evaluation-metrics topic
VL-CheckList
Evaluating Vision & Language Pretraining Models with Objects, Attributes and Relations.
TopPR
NeurIPS 2023 - TopP&R: Robust Support Estimation Approach for Evaluating Fidelity and Diversity in Generative Models Official Code
kolena
Python client for Kolena's machine learning testing platform
SOD_Evaluation_Metrics
A more complete python version (GPU) of the evaluation for salient object detection (with S-measure, Fbw measure, MAE, max/mean/adaptive F-measure, max/mean/adaptive E-measure, PRcurve and F-measure c...
deepeval
The LLM Evaluation Framework
tonic_validate
Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.
athina-evals
Python SDK for running evaluations on LLM generated responses
codebleu
Pip compatible CodeBLEU metric implementation available for linux/macos/win
ClayRS
Complexly represent contents, build recommender systems, evaluate them. All in one place!
faster_coco_eval
Continuation of an abandoned project fast-coco-eval