bert4pytorch issues

和bert4torch实现有点类似

看到另外一个仓库[bert4torch](https://github.com/Tongjilibo/bert4torch)，感觉改写的有点类似啊

HiNLP

导入hugging face 的bert-base-chinese存在bug

1

modelling文件中variable mapping函数中的mapping存在bug. hugging face的模型文件中layerNorm参数用的是gamma和beta, 作者给的是weight bias, 不匹配

DimariaW

返回 embedding 和 huggingface 的返回结果不完全一致

4

比如 bert-base-chinese，作者是否有做过这方面的评估测试呀～

mmmwhy

Transformer Quality in Linear Time gate control unit and FLASH code

Transformer Quality in Linear Time Weizhe Hua, Zihang Dai, Hanxiao Liu, Quoc V. Le We revisit the design choices in Transformers, and propose methods to address their weaknesses in handling...

aoom

KeyError: 'bert.embeddings.LayerNorm.gamma'

2

有人知道这个问题改怎么解决吗 File "F:\software\Anaconda\envs\tensor\lib\site-packages\bert4pytorch\modeling.py", line 71, in load_weights_from_pytorch_checkpoint state_dict[new_key] = state_dict.pop(old_key) KeyError: 'bert.embeddings.LayerNorm.gamma'

fangzhaoyi1995

请问/chinese_L-12_H-768_A-12/下的文件到哪里去下载呢

3

xtv417810

LabelSmoothingCrossEntropy中的疑问

2

LabelSmoothingCrossEntropy这个函数最终返回的总loss的前半部分: loss*self.eps/c ，这里c是类别个数，我发现有的公式里写的这里应该是除以类别个数减一。请教一下到底要不要减一

luxuantao

LayerNorm 类有个小错误

1

if conditional: self.dense1 = nn.Linear(2 * hidden_size, hidden_size, bias=False) self.dense.weight.data.uniform_(0, 0) -------> 此处应该self.dense1, 下边的self.dense2 也是一样的

di-osc

可以加载bert4keras里面提供的模型吗

6

可以加载bert4keras里面提供的模型吗

yuanjie-ai

bert4pytorch
bert4pytorch copied to clipboard

Metadata

批量加载数据问题

和bert4torch实现有点类似

导入hugging face 的bert-base-chinese存在bug

返回 embedding 和 huggingface 的返回结果不完全一致

Transformer Quality in Linear Time gate control unit and FLASH code

KeyError: 'bert.embeddings.LayerNorm.gamma'

请问/chinese_L-12_H-768_A-12/下的文件到哪里去下载呢

LabelSmoothingCrossEntropy中的疑问

LayerNorm 类有个小错误

可以加载bert4keras里面提供的模型吗

← Metadata

Owner

Metadata

bert4pytorch bert4pytorch copied to clipboard

Metadata

← Metadata

Owner

Metadata

bert4pytorch
bert4pytorch copied to clipboard