Jonas Zhou
Results
2
issues of
Jonas Zhou
I tried to reproduce the result of MEND in gpt-j-6B and Llama-2-7b, but the ngram-entropy of gpt-j-6B is far below Llama-2-7b(gpt-j-6B around 350 vs Llama-2-7b around 550). Do you have...
question
Hello, your tutorial is very inspiring and helpful to me. Thanks for your nice work. However, when I run final.py, I met the numerical precision differences in outputs among the...