reinforcement-learning-from-ai-feedback topic

List reinforcement-learning-from-ai-feedback repositories

R2Vul

15
Stars
2
Forks
15
Watchers

R2Vul: Learning to Reason about Software Vulnerabilities with Reinforcement Learning and Structured Reasoning Distillation