ml-safety topic
List
ml-safety repositories
giskard
4.0k
Stars
261
Forks
Watchers
🐢 Open-Source Evaluation & Testing for ML & LLM systems
ME-Net
51
Stars
10
Forks
Watchers
[ICML 2019] ME-Net: Towards Effective Adversarial Robustness with Matrix Estimation
PromptInject
295
Stars
28
Forks
Watchers
PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Sa...
awesome-ai-safety
158
Stars
13
Forks
Watchers
📚 A curated list of papers & technical articles on AI Quality & Safety
langtest
545
Stars
50
Forks
545
Watchers
Deliver safe & effective language models
ProjNorm
16
Stars
1
Forks
Watchers
Predicting Out-of-Distribution Error with the Projection Norm