alpaca-lora
alpaca-lora copied to clipboard
--group_by_length flag train/loss anomoly
What is the "odd train/loss" that is referenced to be caused by this flag? I have it on (and havent tested with it off) and my train/loss looks like this:
Is this consistent with the specific "oddness" the flag creates, or am I doing something else terribly wrong?