llm-evaluation-framework topic
List
llm-evaluation-framework repositories
promptfoo
3.1k
Stars
205
Forks
Watchers
Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models wit...
deepeval
2.0k
Stars
139
Forks
Watchers
The LLM Evaluation Framework
parea-sdk-py
41
Stars
4
Forks
Watchers
Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)