evaluation topic
booleval
Header-only C++17 library for evaluating logical expressions.
midi_degradation_toolkit
A toolkit for generating datasets of midi files which have been degraded to be 'un-musical'.
clearmetrics
Python implementation of CLEAR multi object tracking (MOT) evaluation metrics
NAS-Benchmark
[ICLR 2020] NAS evaluation is frustratingly hard
polara
Recommender system and evaluation framework for top-n recommendations tasks that respects polarity of feedbacks. Fast, flexible and easy to use. Written in python, boosted by scientific python stack.
DallEval
DALL-Eval: Probing the Reasoning Skills and Social Biases of Text-to-Image Generation Models (ICCV 2023)
awesome-semantic-segmentation
:metal: awesome-semantic-segmentation
toolkit-legacy
Visual Object Tracking (VOT) challenge evaluation toolkit
EvalAI
:cloud: :rocket: :bar_chart: :chart_with_upwards_trend: Evaluating state of the art in AI
datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools