prompt-testing topic
List
prompt-testing repositories
LLM-RGB
122
Stars
9
Forks
Watchers
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.
promptfoo
4.5k
Stars
346
Forks
Watchers
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command...