HaluEval icon indicating copy to clipboard operation
HaluEval copied to clipboard

I added HaluEval to lm-eval-harness, can you please double check?

Open pminervini opened this issue 1 year ago • 3 comments
trafficstars

Here is my pull request: https://github.com/EleutherAI/lm-evaluation-harness/pull/1076

Thanks!

pminervini avatar Dec 07 '23 15:12 pminervini

Sorry for the late reply. We have double-checked the PR and think there is no problem with this implementation.

Xiaoxue-xx avatar Feb 12 '24 03:02 Xiaoxue-xx

@Xiaoxue-xx thank you! 🙂 You can find the latest version of the tasks here: https://huggingface.co/spaces/hallucinations-leaderboard/leaderboard/tree/main/src/backend/tasks/halueval We used it for our Hallucinations Leaderboard: https://huggingface.co/blog/leaderboards-on-the-hub-hallucinations

@Xiaoxue-xx, would you also be happy if we switch from an open-ended generation task to a multiple-choice task, where the model is only allowed to generate "yes" or "no"?

pminervini avatar Feb 12 '24 08:02 pminervini

Sure. Thank you for your attention to HaluEval!

Xiaoxue-xx avatar Feb 14 '24 06:02 Xiaoxue-xx