Pretrained-Language-Model
Pretrained-Language-Model copied to clipboard
请问General Distillation部分的teachermodel是什么,语料库如何获取?
The teacher model for general distillation is BERT-base-uncased and the corpus is the original one, Totonto Book Corpus. You can search pretrained BERT model on huggingface as reference.