evaluation-metrics topics

A more complete python version (GPU) of the evaluation for salient object detection (with S-measure, Fbw measure, MAE, max/mean/adaptive F-measure, max/mean/adaptive E-measure, PRcurve and F-measure c...

zyjwuyan

evaluation-metrics

gpu-acceleration

salient-object-detection

sod

deepeval

9.9k

Stars

860

Forks

Watchers

The LLM Evaluation Framework

confident-ai

evaluation-framework

evaluation-metrics

llm-evaluation

llm-evaluation-framework

tonic_validate

253

Stars

26

Forks

Watchers

Metrics to evaluate the quality of responses of your Retrieval Augmented Generation (RAG) applications.

TonicAI

evaluation-framework

evaluation-metrics

large-language-models

llm

athina-evals

210

Stars

12

Forks

Watchers

Python SDK for running evaluations on LLM generated responses

athina-ai

evaluation

evaluation-framework

evaluation-metrics

llm-eval

codebleu

84

Stars

17

Forks

Watchers

Pip compatible CodeBLEU metric implementation available for linux/macos/win

k4black

code

code-evaluation

code-generation

codebleu

ClayRS

31

Stars

4

Forks

Watchers

Complexly represent contents, build recommender systems, evaluate them. All in one place!

swapUniba

content-based-recommendation

evaluation-metrics

graph-based-recommendation

python

faster_coco_eval

136

Stars

12

Forks

136

Watchers

Continuation of an abandoned project fast-coco-eval

MiXaiLL76

coco

evaluation-metrics

pycocotools

python