EMOPIA icon indicating copy to clipboard operation
EMOPIA copied to clipboard

Training problem with family token (y_type)

Open yen52205 opened this issue 2 years ago • 0 comments

Hi, when I used your training code, I found there was something I didn't understand during model forwarding. During training process, the model firstly predicts the family token (y_type), and then predicts other kind of tokens. In the code below, it shows that you directly use ground truth family token to predict other kind of tokens. image

But in the generate process, you use the family predicts earlier to predict other kind of tokens. I'm wondering why you choose the way to train and inference? And if this is possible to cause the inconsistence between training and inference?

thanks for anyone who could help me figure out this!!

yen52205 avatar Jul 23 '22 17:07 yen52205