Haoyan Luo

Results 1 comments of Haoyan Luo

Yes, i believe the hook_normalized works fine. But I think the problem is when retrieving the final normalized output after RMSNorm, say calling `llama_logits, llama_cache = model.run_with_cache(llama_tokens)` and to get...