evaluation-framework topic
BIRL
BIRL: Benchmark on Image Registration methods with Landmark validations
OD-test
OD-test: A Less Biased Evaluation of Out-of-Distribution (Outlier) Detectors (PyTorch)
RecSys2019_DeepLearning_Evaluation
This is the repository of our article published in RecSys 2019 "Are We Really Making Much Progress? A Worrying Analysis of Recent Neural Recommendation Approaches" and of several follow-up studies.
PyDGN
A research library for automating experiments on Deep Graph Networks
expressive
Expressive is a cross-platform expression parsing and evaluation framework. The cross-platform nature is achieved through compiling for .NET Standard so it will run on practically any platform.
evalify
Evaluate your biometric verification models literally in seconds.
DialogEntailment
The implementation of the paper "Evaluating Coherence in Dialogue Systems using Entailment"
PySODEvalToolkit
PySODEvalToolkit: A Python-based Evaluation Toolbox for Salient Object Detection and Camouflaged Object Detection
lm-evaluation
Evaluation suite for large-scale language models.
CrowdFlow
Optical Flow Dataset and Benchmark for Visual Crowd Analysis