prompt-testing topic

List prompt-testing repositories

LLM-RGB

122
Stars
9
Forks
Watchers

LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.

promptfoo

4.5k
Stars
346
Forks
Watchers

Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command...