ai-safety topics

A project to add scalable state-of-the-art out-of-distribution detection (open set recognition) support by changing two lines of code! Perform efficient inferences (i.e., do not increase inference tim...

dlmacedo

ai-safety

anomaly-detection

deep-learning

machine-learning

distinction-maximization-loss

45

Stars

5

Forks

Watchers

A project to improve out-of-distribution detection (open set recognition) and uncertainty estimation by changing a few lines of code in your project! Perform efficient inferences (i.e., do not increas...

awesome-ai-alignment

57

Stars

9

Forks

Watchers

A curated list of awesome resources for getting-started-with and staying-in-touch-with Artificial Intelligence Alignment research.

PromptInject

276

Stars

27

Forks

Watchers

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Sa...

agi