reinforcement-learning-from-ai-feedback topic
List
reinforcement-learning-from-ai-feedback repositories
R2Vul
15
Stars
2
Forks
15
Watchers
R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation