sotabench-eval
sotabench-eval copied to clipboard
Easily evaluate machine learning models on public benchmarks
# Patching CVE-2007-4559 Hi, we are security researchers from the Advanced Research Center at [Trellix](https://www.trellix.com). We have began a campaign to patch a widespread bug named CVE-2007-4559. CVE-2007-4559 is a...
Hi, I'm proposing to integrate the Tatoeba machine translation dataset into sotabench-eval. I have included code for running the tests, modeled after WMT, and for downloading and configuring the data....
This looks like an awesome project. Would be great if there was a way to report hyper parameters with each submission.
Bumps [black](https://github.com/psf/black) from 19.3b0 to 24.3.0. Release notes Sourced from black's releases. 24.3.0 Highlights This release is a milestone: it fixes Black's first CVE security vulnerability. If you run Black...