jackeymango
jackeymango
> * In line 582, history mask is being generated. * is the multiplication operation that converts the self-attention mask of the target tokens into a corresponding history based mask...
> Okay, I think you can change the type of the history mask to bool as: "history_mask.to(torch.bool)". So, now two bool matrices can be multiplied. >Thanks,But the loss also can't...
> okay, are you using float16? If so, try to remove the float16 option. > Or could you paste your command line argument that you use to run the code....
I have modified line 215 in transformer.py ,changed batch_A = batch_A.masked_fill(mask==0,float("inf")) # Works in v0.4. Does this affect the model?
> Thanks! > Does this change make the model runs okay? >No! It can't runs okey.
I really hope you can give me some suggestions! thank you very much!
Thanks,there are the args: (1)pytorch is under the official website, and the conda download statement is this: conda install pytorch torchvision cudatoolkit=9.2 -c pytorch (2)os:linux (3)Graphics card information: 
> Thanks for the information! > I have made a couple of changes to the code to make it compatible with pytorch v1.4. It works fine on my compute machine....
Thanks,I can run this! But when I run the bpe_pipeline_MT.sh ,there is a problem: Traceback (most recent call last): File "/work/zhangzhongze/multilingual_nmt/train.py", line 457, in main() File "/work/zhangzhongze/multilingual_nmt/train.py", line 381, in...