Chai_Ivy
Chai_Ivy
Hey kamyar, I fixed the validation dataset, loss weight, and iterations, however I still see this" Training epoch 1" ending. And as I check ls -trl in my checkpoints folder,...
> . Based on your configuration, the model stops training when the number of iterations is larger than 999 "Based on your configuration, the model stops training when the number...
> Dear Yaqiong > > How you solve the problem of the training process ending up instantly. Thank you very much! > > Best wishes! @Hgit007 Knazeri suggested a few...
It is really great work! @sanchezirina Could you possibly upload the models you trained? Or something in the checkpoints_restore_dir would be very appreciated. It keeps complaining " in restore raise...