llm-evaluation-framework topic

List llm-evaluation-framework repositories

promptfoo

3.1k
Stars
205
Forks
Watchers

Test your prompts, models, and RAGs. Catch regressions and improve prompt quality. LLM evals for OpenAI, Azure, Anthropic, Gemini, Mistral, Llama, Bedrock, Ollama, and other local & private models wit...

parea-sdk-py

41
Stars
4
Forks
Watchers

Python SDK for experimenting, testing, evaluating & monitoring LLM-powered applications - Parea AI (YC S23)