Fabian Degen
Fabian Degen
Hi! After all the einsum removals that were done over the past two weeks, I took a look at this again, and it seems like the logits match very well!...
Hi! After all the einsum removals that were done over the past two weeks, I took a look at this again, and it seems like the logits match very well!...
Hi! After all the einsum removals that were done over the past two weeks, I took a look at this again, and it seems like the logits match very well!...
Hey! How are you loading Llama in? When you load Llama with from_pretrained_no_processing the residual stream values should be very close.