reinforcement-finetuning topic

List reinforcement-finetuning repositories

PURE

148
Stars
6
Forks
148
Watchers

Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"

AutoVLA

354
Stars
9
Forks
354
Watchers

[NeurIPS 2025] AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning