Any plan for releasing the evaluation code?
Great Work, Sincerely Congratulations!
I'm very interested in your work, is that possible for your team to also release the evaluation code which you used for the benchmark testing? For example, if i want to test LLaDA through the lm-evaluation-harness, what should i do?
I also found it challenging to integrate the loglikelihood and generate functions into lm-evaluation-harness. If the author could open-source the relevant parts of lm-evaluation-harness, it would be greatly helpful to me.
Hello, thank you for your interest in LLaDA. We plan to open-source the evaluation metrics for the LLaDA Base model using the lm-evaluation-harness library. This may take some time to organize the code and go through the open-source process.
Hello, thank you for your interest in LLaDA. We plan to open-source the evaluation metrics for the LLaDA Base model using the lm-evaluation-harness library. This may take some time to organize the code and go through the open-source process.
thit is really good to know, sincerely thanks. Looking forward to it : )
Thank you for your attention. We have released the code for the evaluation using the open source library.