rtmaww comments

Results 38 comments of


                                            rtmaww

关于Label Mapping

你好，感谢你关注我们的论文。关于你提出的问题，我们使用的label word确实是"John" （discrete）或者['Michael', 'John', 'David', 'Thomas', 'Martin', 'Paul'] （virtual）这样的形式，其中virtual实际上是由多个词一起构建得到label word的。在代码train_transformer.py中的第912行的add_label_token_bert中，我们（1）先向tokenizer的词表中插入一些新token作为label word，这些label word在词表中的key为其对应的标签（比如"I-PER"）（2）然后我们为这些词表中的新token初始化embedding，这里的embedding就是使用"John" 或者['Michael', 'John', 'David', 'Thomas', 'Martin', 'Paul'] 对应的词的embedding进行初始化。比如，'I-PER':['Michael', 'John', 'David', 'Thomas', 'Martin', 'Paul']时，词表里实际插入了一个名称为"I-PER"的新token，它初始化的embedding为['Michael', 'John',...

rtmaww

关于Label Mapping

Could you provide the dev set?

How do you reproduce the NN model in the baseline

MIT-Movie

关于MLM的问题

请问MSRA的dev数据集怎么得到的

使用bert之后得到的向量是与原始随机初始化embeding直接拼接了吗？

关于GPU使用率问题，以及LSTM比Transformer速度”更快“的问题

论文是用jieba工具分词的吗？

关于bert