DoomArena
DoomArena copied to clipboard
DoomArena is a Framework for Testing AI Agents Against Evolving Security Threats
DoomArena: A Framework for Testing AI Agents Against Evolving Security Threats

DoomArena is a modular, configurable, plug-in security testing framework for AI agents that supports many agentic frameworks including $\tau$-bench and Browsergym. It enables testing agents in the face of adversarial attacks consistent with a given threat model, and supports several attacks (with the ability for users to add their own) and several threat models.
🚀 Quick Start
The DoomArena Intro Notebook
is a good place for learning hands-on about the core concepts of DoomArena.
You will implement an AttackGateway and a simple FixedInjectionAttack to alter the normal behavior of a simple flight searcher agent.
If you only want to use the library just run
pip install doomarena # core library, minimal dependencies
If you want to run DoomArena integrated with TauBench, additionally run
pip install doomarena-taubench # optional
If you want to run DoomArena integrated with Browsergym, additionally run
pip install doomarena-browsergym # optional
Export relevant API keys into your environment or .env file.
OPENAI_API_KEY="<your api key>"
OPENROUTER_API_KEY="<your api key>"
🛠️ Advanced Setup
To actively develop DoomArena, please create a virtual environment and install the package locally in editable mode using
pip install -e doomarena/core
pip install -e doomarena/taubench
pip install -e doomarena/browsergym
Once the environments are set up, run the tests to make sure everything is working.
make ci-tests
make tests # requires openai key
💻 Running Experiments
Follow the environment-specific instructions for TauBench and BrowserGym
🌟 Contributors
Note: contributions made prior to the open-sourcing are not accounted for; please refer to author list for full list of contributors.
📝 Paper
If you found DoomArena helpful, please cite us
@misc{boisvert2025doomarenaframeworktestingai,
title={DoomArena: A framework for Testing AI Agents Against Evolving Security Threats},
author={Leo Boisvert and Mihir Bansal and Chandra Kiran Reddy Evuru and Gabriel Huang and Abhay Puri and Avinandan Bose and Maryam Fazel and Quentin Cappart and Jason Stanley and Alexandre Lacoste and Alexandre Drouin and Krishnamurthy Dvijotham},
year={2025},
eprint={2504.14064},
archivePrefix={arXiv},
primaryClass={cs.CR},
url={https://arxiv.org/abs/2504.14064},
}