evaluations topic

List evaluations repositories

evaluations

15
Stars
1
Forks
Watchers

This library implements various metrics (including Kaggle Competition, Medicine) for evaluating ML, DL, AI models, and algorithms. 📐📊📈📉📏

42-Evaluations

37
Stars
0
Forks
Watchers

42 School Projects Evaluation Marking Criteria

Crunch

60
Stars
7
Forks
Watchers

The fastest java expression compiler/evaluator

log10

77
Stars
7
Forks
Watchers

Python client library for improving your LLM app accuracy

leaf-playground

21
Stars
0
Forks
Watchers

A framework to build scenario simulation projects where human and LLM based agents can participant in, with a user-friendly web UI to visualize simulation, support automatically evaluation on agent ac...

langtrace

146
Stars
12
Forks
Watchers

Langtrace 🔍 is an open-source, Open Telemetry based end-to-end observability tool for LLM applications, providing real-time tracing, evaluations and metrics for popular LLMs, LLM frameworks, vectorD...