DDP icon indicating copy to clipboard operation
DDP copied to clipboard

loss nan

Open Lzyin opened this issue 1 year ago • 0 comments

When I run the segmentation code on other public datasets, the loss becomes nan after training for more than 20 epochs. What could be the reason?

Thank you for your reply!

Lzyin avatar Mar 05 '24 02:03 Lzyin

Thank you for your sharing about MLM works! I have tried chatting with "Lion"-Bot while asking in Chinese, and got brief answer almost in English. But I'm appreciated to know why using chineseBERT as Q-Former writed in your readme.md? Do you have any plan to make “Lion” answer in Chinese? Thank you!

Hi sunyoe, We used both Chinese and English data during pre-training, but during SFT, we found that using pure English data would perform better on MME. Therefore, this demo model is more inclined to speak English. Our Chinese and English chat model will be released as soon as possible, and look forward to it.

mynameischaos avatar Oct 13 '23 03:10 mynameischaos