LLaVA-Mini
LLaVA-Mini copied to clipboard
Can someone reproduce this project
Can someone reproduce this project, why the training loss is 0 in the four node training under zero2 setting. The effect is also super poor under zero3.