Heming Xia

Results 16 comments of Heming Xia

这个模型训的非常的简陋Hhh,只是个新手入门参考,machine translation想得到比较好的效果可以试试[fairseq](https://github.com/facebookresearch/fairseq): - [examples/translation](https://github.com/facebookresearch/fairseq/tree/main/examples/translation)

对的 可能是 @zjmwqx 提到的问题,这个repository只是一个简单实现的参考,可以根据自己需要修改代码

非finetune的应该没用过 这里是有问题的 可以自己改一下 不难的~

不好意思,这两天比较忙,下周我看一下这个问题~

评测脚本在[这里](https://github.com/hemingkx/WordSeg#dataset)有说明~ ``` 如有需要,这里是PKU数据集及评测脚本的百度网盘地址👇: 链接: https://pan.baidu.com/s/1yZrDSrvAXi-0mCxRp3L-sw 密码: 31qd 将数据集及评测脚本解压后放置在所用模型的./data/路径下即可。 ``` inference都包含在run.py中了,可以根据需要自行修改~

因为已经过了一段时间,训练好的模型已经被清除了,过几天会重新跑一遍更新一个网盘链接~

Hi @ndkmath1, I noticed that you put the NAT drafter and the AT verifier in opposite positions. They should be placed as follows. Can you try again the correct way?...

You can pre-process the official WMT14 EN-DE `test.en` file with our bpemodel [here](https://github.com/microsoft/unilm/blob/master/decoding/GAD/data/wmt14.en-de/bpe.32000). Here are the first three lines of our test file: ``` Gut@@ ach : Incre@@ ased safety...

The bpe codes we used are from the released data by Google. However, you can find a pre-process script [here](https://github.com/facebookresearch/fairseq/blob/main/examples/translation/prepare-wmt14en2de.sh), which works fine with wmt14.en-de raw data.