simple-evals
simple-evals copied to clipboard
Has anyone run this code and cached the granular data?
I'm a student working on a final project and wanted to use the granular data here (e.g. not “GPT-4o hits 88.7% on MMLU" but rather “what did it answer for this multiple choice question?”. I don't really have credits nor can afford them but I'm sure this has been run before so would be wasteful anyway. Was looking for someone who has used this repo before for a push in the right direction (or even better if someone has a csv with the actual inputs/responses).
Thank you! This will go a long way to help :)