evals
evals copied to clipboard
Any website where I can share evaluation results?
Describe the feature or improvement you're requesting
Hi.
I was wondering if there is any websites where I can share and see others' evaluation results.
Should I run every 'eval' locally by myself to see the accuracy of 'eval' made by other people?
Maybe I am missing but I think it would be good if there is a website like llm_leaderboard for evals
too.
Thanks
Additional context
If someone wants to share their eval results, do something like:
oaieval gpt-3.5-turbo test-match --submit=True