Results 4 comments of Fabian Degen

Hi! After all the einsum removals that were done over the past two weeks, I took a look at this again, and it seems like the logits match very well!...

Hi! After all the einsum removals that were done over the past two weeks, I took a look at this again, and it seems like the logits match very well!...

Hi! After all the einsum removals that were done over the past two weeks, I took a look at this again, and it seems like the logits match very well!...

Hey! How are you loading Llama in? When you load Llama with from_pretrained_no_processing the residual stream values should be very close.