IMoS icon indicating copy to clipboard operation
IMoS copied to clipboard

The arm model result cannot be replicated through train_arm.py

Open wzyabcas opened this issue 1 year ago • 6 comments

Hi, I ran data_preprocess.py and train_arms.py with default configs. The validation loss during the teacher forcing phase was around 0.006 and did not continue to decrease. After removing teacher forcing, it stayed around 0.01 without further improvement. This validation loss curve is quite different from the training curve you trained. The visualization dosen't look good as well. I want to ask whether you used the script "train_arms.py" for training and whether there were any special hyperparameters used. I am also curious why there is a joint mismatch between the pretrained model and the model we trained ourselves. Thx for answering!!

wzyabcas avatar Jul 01 '23 20:07 wzyabcas