LLaVA icon indicating copy to clipboard operation
LLaVA copied to clipboard

[Question] loss 不能下降

Open awzhgw opened this issue 4 months ago • 4 comments

Question

When I fine-tune with Lora, the loss will drop to about 0.8 in the first 20k steps, and then increase to 6.2 over time, and then keep oscillating here. May I ask why this problem occurs? How to solve it? Translate this passage into English

awzhgw avatar Apr 06 '24 00:04 awzhgw

Hi friend, I guess you may consider switching a lower learning rate when conducting your SFT.

Sprinter1999 avatar Apr 19 '24 02:04 Sprinter1999

Same question

OliverLeeXZ avatar Apr 29 '24 05:04 OliverLeeXZ

Same problem too! Curious to know if lower learning rate helped you

Vignesh-Valaboju avatar May 01 '24 19:05 Vignesh-Valaboju

I change learning rate from 2e-4 to 2e-5. It is really work!

OliverLeeXZ avatar May 02 '24 16:05 OliverLeeXZ