prompt-testing topic
List
prompt-testing repositories
LLM-RGB
122
Stars
9
Forks
Watchers
LLM Reasoning and Generation Benchmark. Evaluate LLMs in complex scenarios systematically.
promptfoo
6.9k
Stars
552
Forks
Watchers
Test your prompts, agents, and RAGs. Red teaming, pentesting, and vulnerability scanning for LLMs. Compare performance of GPT, Claude, Gemini, Llama, and more. Simple declarative configs with command...
agentic_security
1.7k
Stars
229
Forks
1.7k
Watchers
Agentic LLM Vulnerability Scanner / AI red teaming kit 🧪