AmazingJ comments

Results 16 comments of


                                            AmazingJ

希望可以开源 continue training 相关代码

有几个问题哈 1. 直接运行 sh 文件，从头初始化一个 Deberta 模型，那用的是什么 tokenizer，sh 文件里面也没用参数指定呀？词表用的是什么？ 2. 如果我想 continuing training 的话，需要怎么操作？是否修改.sh文件里面的 load_ckpt_path 参数即可？

out of memory

@stefan-it But the paper reports using hundreds of GB of data. How did they do it?

离线环境下无法启动语音识别模型问题？（Unable to start the speech recognition model in an offline environment）

修复了吗

font{ line-height: 1.6; } ul,ol{ padding-left: 20px; list-style-position: inside; } First, you need to linearize the AMR graph. You can use konstas's script or song's script, because the final performance...

Preprocessing

Python has an [“anytree”] (https://pypi.org/project/anytree/2.1.4/) . You can try.

Preprocessing

After deleting "@@ ", the BLEU value should not decline, but rise a lot. Are you sure you are doing the right BPE process? It is worth noting that not...

Preprocessing

What I mean is that the source and target segment needs to do BPE during training, and the target segment does not need to do BPE during testing. BPE is...

Preprocessing

yes. During the test, only the source side needs to do BPE, and then test BLEU after deleting @@.

Preprocessing

On LDC2015E86 10000 On LDC2017T10 20000 train_file: cat train_source+train_target