evals icon indicating copy to clipboard operation
evals copied to clipboard

Any website where I can share evaluation results?

Open pocca2048 opened this issue 1 year ago • 7 comments

Describe the feature or improvement you're requesting

Hi.

I was wondering if there is any websites where I can share and see others' evaluation results. Should I run every 'eval' locally by myself to see the accuracy of 'eval' made by other people? Maybe I am missing but I think it would be good if there is a website like llm_leaderboard for evals too.

Thanks

Additional context

If someone wants to share their eval results, do something like:

oaieval gpt-3.5-turbo test-match --submit=True

pocca2048 avatar Jun 15 '23 04:06 pocca2048