Xueqing Wu

Results 6 comments of Xueqing Wu

Hi, what do you mean by the BERT scores are mismatching while training? We do not need to calculate BERT scores during training. As in the README, `bert_score` package is...

Hi, I don't think you need these two files? I think you only need ${BART_DIR}/dict.txt

@harryniuby 你可以尝试用我发布的数据训练一个模型,然后用你的文本infer。但是我不确定效果怎么样

I also feel confused about this... It seems that as x.size(1) (that is, maximum graph size of a batch) varies from time to time, the size of BatchNorm cannot be...

Same problem. Disabling fused_mlp works for me. Note: use .pt file, not .safetensors; for some reason .safetensors still triggers the error

Hi, for me the problem is resolved by running gradio demo **AFTER** the backend is setup, rather than in the second step right after controller is launched as in README....