Haoyan Luo
Results
1
comments of
Haoyan Luo
Yes, i believe the hook_normalized works fine. But I think the problem is when retrieving the final normalized output after RMSNorm, say calling `llama_logits, llama_cache = model.run_with_cache(llama_tokens)` and to get...