trustworthy-ai topic
giskard
🐢 Open-Source Evaluation & Testing for ML & LLM systems
nnv
Neural Network Verification Software Tool
FSRL
🚀 A fast safe reinforcement learning library in PyTorch
SyReNN
SyReNN: Symbolic Representations for Neural Networks
FAME
Framework for Adversarial Malware Evaluation.
MERLIN
MERLIN is a global, model-agnostic, contrastive explainer for any tabular or text classifier. It provides contrastive explanations of how the behaviour of two machine learning models differs.
robust-deep-learning
A project to train your model from scratch or fine-tune a pretrained model using the losses provided in this library to improve out-of-distribution detection and uncertainty estimation performances. C...
FNI-RL
[TPAMI, 2023] Fear-Neuro-Inspired Reinforcement Learning for Safe Autonomous Driving
TrustLLM
[ICML 2024] TrustLLM: Trustworthiness in Large Language Models
WatermarkDM
Code of the paper: A Recipe for Watermarking Diffusion Models