bert_ner
bert_ner copied to clipboard
The ##word should not be predicted
In bert paper, it seems that the words start with '##' should not be predicted. And you did compute is_head variable, but why this variable is not used when computing loss ?
Hello,I want to ask this code do you have run and the result of F1 is good? @ AnblueWang