how can i reproduce the results on truthfulqa?

Open SuperChanS opened this issue 1 year ago • 1 comments

I notice that operating truthfulqa.sh requires "gpt_true_model_name" and "gpt_info_model_name". But it seems the original model is unavailable now.

Jul 28 '24 15:07 SuperChanS

Yes, this part of the evaluation is unfortunately no longer reproducable due to OpenAI's deprecation of GPT-3-based models. The allenai/open-instruct evaluation frameowrk has switched to finetuned judge models based on Llama2 instead! Please see their evaluation script here.

Jul 29 '24 22:07 alisawuffles