vanilla comments

Repositories
Issues
Comments

Results 4 comments of


                                            vanilla

Isn't loss only supposed to be calculated on masked tokens?

I have the same issue. Why loss was calculated on all tokens？

Isn't loss only supposed to be calculated on masked tokens?

@EmaadKhwaja` return logits[~mask], target[~mask]` seems a bit problematic， we should calculate masked token loss `return logits[mask], target[mask]`

Isn't loss only supposed to be calculated on masked tokens?

> @xuesongnie it's because the mask calculated is applied to the wrong values. The other option would be to do `r = math.floor(1-self.gamma(np.random.uniform()) * z_indices.shape[1])`, but I don't like that...

code sharing

When the code will be released？