jackeymango

Results 15 comments of jackeymango
trafficstars

> * In line 582, history mask is being generated. * is the multiplication operation that converts the self-attention mask of the target tokens into a corresponding history based mask...

> Okay, I think you can change the type of the history mask to bool as: "history_mask.to(torch.bool)". So, now two bool matrices can be multiplied. >Thanks,But the loss also can't...

> okay, are you using float16? If so, try to remove the float16 option. > Or could you paste your command line argument that you use to run the code....

I have modified line 215 in transformer.py ,changed batch_A = batch_A.masked_fill(mask==0,float("inf")) # Works in v0.4. Does this affect the model?

> Thanks! > Does this change make the model runs okay? >No! It can't runs okey.

I really hope you can give me some suggestions! thank you very much!

Thanks,there are the args: (1)pytorch is under the official website, and the conda download statement is this: conda install pytorch torchvision cudatoolkit=9.2 -c pytorch (2)os:linux (3)Graphics card information: ![image](https://user-images.githubusercontent.com/66673974/84972605-ce4f6b00-b151-11ea-9fac-3f6cd1d44f07.png)

> Thanks for the information! > I have made a couple of changes to the code to make it compatible with pytorch v1.4. It works fine on my compute machine....

Thanks,I can run this! But when I run the bpe_pipeline_MT.sh ,there is a problem: Traceback (most recent call last): File "/work/zhangzhongze/multilingual_nmt/train.py", line 457, in main() File "/work/zhangzhongze/multilingual_nmt/train.py", line 381, in...