EMOPIA Training problem with family token (y

Training problem with family token (y_type)

Open yen52205 opened this issue 2 years ago • 0 comments

Hi, when I used your training code, I found there was something I didn't understand during model forwarding. During training process, the model firstly predicts the family token (y_type), and then predicts other kind of tokens. In the code below, it shows that you directly use ground truth family token to predict other kind of tokens.

But in the generate process, you use the family predicts earlier to predict other kind of tokens. I'm wondering why you choose the way to train and inference? And if this is possible to cause the inconsistence between training and inference?

thanks for anyone who could help me figure out this!!

Jul 23 '22 17:07 yen52205

EMOPIA EMOPIA copied to clipboard

Training problem with family token (y_type)

EMOPIA
EMOPIA copied to clipboard