open_lm Fix too many tokens requested edge case.

Fix too many tokens requested edge case.

Open GeorgiosSmyrnis opened this issue 1 year ago • 0 comments

Sometimes, the model needs to do a few more training steps in a new epoch, and it would load an entire checkpoints worth of data for that. This PR limits the number of tokens requested by how many steps are actually left over.

Jan 16 '24 14:01 GeorgiosSmyrnis

open_lm open_lm copied to clipboard

Fix too many tokens requested edge case.

open_lm
open_lm copied to clipboard