BERT-pytorch issues

Why not use torch.no_grad when evaluating test data?

1

The way the trainer is set up the `iteration` that is used for train and test is similar except when train step is run the backwards propagation occurs. But one...

EvanZ

why specify `ignore_index=0` in the NLLLoss function in BERTTrainer?

1

# trainer/pretrain.py ```python class BERTTrainer: def __init__(self, ...): ... # Using Negative Log Likelihood Loss function for predicting the masked_token self.criterion = nn.NLLLoss(ignore_index=0) ... ``` I cannot understand why `ignore...

Jasmine969

IndexError

6

I try to run according to md tutorial but ![image](https://user-images.githubusercontent.com/30914380/110130632-2e7f5d80-7e04-11eb-807b-12dd6a842b69.png)

LemonQC

embedding/position.py with RuntimeError if d_model is odd

2

such as if set d_model = 29, max_len = 10000 RuntimeError: The expanded size of the tensor (14) must match the existing size (15) at non-singleton dimension 1. Target sizes:...

bookong22

Added a Google Colab Notebook that contains all the code in this project.

1

For learning purposes, I added `example.ipynb`, which is a Google Colab Notebook that works right out of the box. I have also included an example data file that addresses #59...

ginward