pytorch-transformer issues

1

### What I have done? - I have cloned the project in my Macbook Pro and using Jupyter Notebook, I have run the file https://github.com/hkproj/pytorch-transformer/blob/main/Local_Train.ipynb How to solve the code...

hissain

Fix Layernorm Implementation

According to the formula norm = (x - mean) / sqrt(var + eps) not (x - mean)/(std + eps) sqrt(var + eps) == sqrt(std**2 + eps) != (std + eps)...

Xue10

Should encoder mask be a (1, seq_len, seq_len) matrix?

If input sentence is (A, B, C, D, PAD). In this implementation, encoder mask is [[[FALSE, FALSE, FALSE, TRUE]]] But the encoder attention is [ [AA, AB, AC, AD], [BA,...

scotthuang1989

Issue with latest_weights_file_path() function

Hi @hkproj , I found an issue with the latest_weights_file_path() function in config.py. The original code uses weights_files.sort() to sort the files and selects the last one as the latest...

Xie-yx

Issue with BLEU Score Calculation in train.py and Suggested Fix

Hi @hkproj, I found an issue with the BLEU score calculation in train.py. The torchmetrics.BLEUScore() function expects a list of reference sentences but receives a single sentence instead. Here is...

Xie-yx

why the Encoder has a norm layer on its final output?

1

class Encoder(nn.Module): def __init__(self, features: int, layers: nn.ModuleList) -> None: super().__init__() self.layers = layers self.norm = LayerNormalization(features) def forward(self, x, mask): for layer in self.layers: x = layer(x, mask) return...

SeekPoint

training results are poor

Even after training for 30 epochs with batch size 32, lr of 1e-4, I the predicted results are very poor. What can be done? ``` -------------------------------------------------------------------------------- SOURCE: Karenin was arguing...

S1LV3RJ1NX

Colab training Fix: Clean up unused import and update local configuration for Colab training

Colab training Fix: Clean up unused import and update local configuration for Colab training 1 . Removing unused dependencies. 3. Updating local file paths in the configuration for smooth execution...

LeonTang-LeonTang

pytorch-transformer
pytorch-transformer copied to clipboard

Metadata

Change

Error: DatasetGenerationError: An error occurred while generating the dataset

Fix Layernorm Implementation

Should encoder mask be a (1, seq_len, seq_len) matrix?

Issue with latest_weights_file_path() function

Issue with BLEU Score Calculation in train.py and Suggested Fix

why the Encoder has a norm layer on its final output?

training results are poor

Colab training Fix: Clean up unused import and update local configuration for Colab training

← Metadata

Owner

Metadata

pytorch-transformer pytorch-transformer copied to clipboard

Metadata

← Metadata

Owner

Metadata

pytorch-transformer
pytorch-transformer copied to clipboard