reinforcement-finetuning topic
List
reinforcement-finetuning repositories
PURE
148
Stars
6
Forks
148
Watchers
Official code for the paper, "Stop Summation: Min-Form Credit Assignment Is All Process Reward Model Needs for Reasoning"
AutoVLA
354
Stars
9
Forks
354
Watchers
[NeurIPS 2025] AutoVLA: A Vision-Language-Action Model for End-to-End Autonomous Driving with Adaptive Reasoning and Reinforcement Fine-Tuning