transfer-learning-conv-ai icon indicating copy to clipboard operation
transfer-learning-conv-ai copied to clipboard

Confusion in pretrained model.

Open Han8931 opened this issue 5 years ago • 1 comments

Hello.

I am trying to run your model and I have some confusion in your pre-trained model.

It seems that train.py trained the model with doublehead model, but in the interact.py, the code loads LMHeadmodel.

Why are they using different models?

So in the pre-trained model, actually, next sentence classification is not implemented?

Han8931 avatar Jan 07 '20 09:01 Han8931

Hi Han8931. I think doublehead model is trained to solve two tasks: seq2seq and seq2label. You can check the code in their transformer. In interact.py, we can't predict the label, we need to generate a new answer. So only LMHead is enough(seq2seq).

ZHANG45 avatar Jan 08 '20 05:01 ZHANG45