Jinliang Zheng

Results 3 comments of Jinliang Zheng

> Great job! I have a question about rewarding learning,Is the reward calculated directly by S(φ(on), ψ(l)) or S(φ(on+1), ψ(l)) − S(φ(on), ψ(l))? I noticed that the get_reward function of...

Further, I’d like to check X-VLA’s performance after post-training with the LeRobot pipeline. Does it match the officially reported results?

Universal Actions for Enhanced Embodied Foundation Models Paper: https://arxiv.org/abs/2501.10105 Code: https://github.com/2toinf/UniAct Project: https://2toinf.github.io/UniAct/