trl
trl copied to clipboard
Fix `start` index under `batched_forward_pass`
Start index under batched_forward_pass
starts from the last query token which does not align with the comment in line 1032. This also causes a problem when handling the response tokens as stated in #1781.
Set start
to the first response token instead of the last query token.