Skip to content

Fix docstring parameter names in compute_dpo_loss function#953

Merged
rasbt merged 1 commit intorasbt:mainfrom
wozniakos10:main
Jan 29, 2026
Merged

Fix docstring parameter names in compute_dpo_loss function#953
rasbt merged 1 commit intorasbt:mainfrom
wozniakos10:main

Conversation

@wozniakos10
Copy link
Contributor

Fixed a small inconsistency in the compute_dpo_loss function where the docstring was referring to parameters as policy_chosen_logprobs and policy_rejected_logprobs, but the actual parameters are named model_chosen_logprobs and model_rejected_logprobs. The docstring now matches the actual parameter names.

Proposing this fix to improve documentation clarity and consistency.

@review-notebook-app
Copy link

Check out this pull request on  ReviewNB

See visual diffs & provide feedback on Jupyter Notebooks.


Powered by ReviewNB

@rasbt
Copy link
Owner

rasbt commented Jan 29, 2026

Ah yes, I tried to make it less jargon-heavy and might have forgotten to rename it there. Thanks!

@rasbt rasbt merged commit 82010e2 into rasbt:main Jan 29, 2026
13 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants