llm-safety topic

List llm-safety repositories

resta

20
Stars
1
Forks
Watchers

Restore safety in fine-tuned language models through task arithmetic

OpenRedTeaming

37
Stars
2
Forks
Watchers

Papers about red teaming LLMs and Multimodal models.