deception icon indicating copy to clipboard operation
deception copied to clipboard

Benchmark evaluating LLMs on their ability to create and resist disinformation. Includes comprehensive testing across major models (Claude, GPT-4, Gemini, Llama, etc.) with standardized evaluation met...

Results 0 deception issues
Sort by recently updated
recently updated
newest added