transfer-learning-conv-ai
transfer-learning-conv-ai copied to clipboard
Confusion in pretrained model.
Hello.
I am trying to run your model and I have some confusion in your pre-trained model.
It seems that train.py
trained the model with doublehead model, but in the interact.py
, the code loads LMHeadmodel
.
Why are they using different models?
So in the pre-trained model, actually, next sentence classification is not implemented?
Hi Han8931.
I think doublehead model is trained to solve two tasks: seq2seq and seq2label. You can check the code in their transformer.
In interact.py
, we can't predict the label, we need to generate a new answer. So only LMHead is enough(seq2seq).