Michaelikarasik issues

Repositories
Issues
Comments

Results 1 issues of


                                            Michaelikarasik

Changing access order of attention and MLP outputs affects model predictions unexpectedly

I'm attempting to zero-ablate all self-attention outputs of the last token across all layers, such that the model's prediction should only depend on the last token and not be affected...

Michaelikarasik

Changing *access* order of attention and MLP outputs affects model predictions unexpectedly

Changing access order of attention and MLP outputs affects model predictions unexpectedly