ml-safety topic

List ml-safety repositories

giskard

4.0k

Stars

261

Forks

Watchers

🐢 Open-Source Evaluation & Testing for ML & LLM systems

artificial-intelligence

continous-delivery

ME-Net

51

Stars

10

Forks

Watchers

[ICML 2019] ME-Net: Towards Effective Adversarial Robustness with Matrix Estimation

adversarial-attacks

adversarial-example

PromptInject

295

Stars

28

Forks

Watchers

PromptInject is a framework that assembles prompts in a modular fashion to provide a quantitative analysis of the robustness of LLMs to adversarial prompt attacks. 🏆 Best Paper Awards @ NeurIPS ML Sa...

agencyenterprise

awesome-ai-safety

158

Stars

13

Forks

Watchers

📚 A curated list of papers & technical articles on AI Quality & Safety

langtest

545

Stars

50

Forks

545

Watchers

Deliver safe & effective language models

Pacific-AI-Corp

large-language-models

ProjNorm

16

Stars

1

Forks

Watchers

Predicting Out-of-Distribution Error with the Projection Norm

distribution-shift