Bert-Chinese-Text-Classification-Pytorch Question about masked language model

Question about masked language model

Open paprika0741 opened this issue 2 years ago • 0 comments

代码中masked language modeling labels中-1标记的是被masked的token，loss计算忽略被mask的token，但是BERT论文中写的是”the final hidden vectors corresponding to the mask tokens are fed into an output softmax over the vocabulary“ 只计算masked token处的loss

Sep 23 '23 13:09 paprika0741

Bert-Chinese-Text-Classification-Pytorch Bert-Chinese-Text-Classification-Pytorch copied to clipboard

Question about masked language model

Bert-Chinese-Text-Classification-Pytorch
Bert-Chinese-Text-Classification-Pytorch copied to clipboard