Andy comments

Results 7 comments of


                                            Andy

set for squad 2.0

Sorry, but i did not implement the model with squad2.0 yet.

Some error when training model

https://github.com/pytorch/pytorch/issues/2341 You could take a look at this thread. Set dataloader number of workers to 0 to see the actual bug.

mask_logits function

I don't think it will make a big difference since -1e30 is already a extremely small value and should mask correctly.

This repo cannot reproduce the result of original paper.

What hidden size did you use? I have tried 96 and 128. 128 performs better. You can try tuning the hidden size.

This repo cannot reproduce the result of original paper.

The hyper parameters of the repository is mostly based on “NLPLearn/QANet”, so the results are similar. I have tried to reproduce the result of the paper. But with limited resources,...

GPU memory explode after 3 steps

I have implemented a repository [QANet](https://github.com/andy840314/QANet-pytorch-), mostly based on this repository and another Tensorflow implementation [Tensorflow QANet](https://github.com/NLPLearn/QANet). I can reach F1: 75.0 EM: 64.0 in 60000 steps. You could take...

GPU memory explode after 3 steps

@BangLiu i'm not sure, but i will try adding EMA first.