Inquiry Regarding the Source of Actions and Post-Processing

Open HaoChenga opened this issue 1 year ago • 0 comments

Thank you for your excellent work. I would like to ask whether the ground truth actions (fine-tuning the LLM) are derived from an RL policy or human operations. If they are derived from an RL policy, do you remove the collision frames during post-processing? Looking forward to your reply~ @zijinoier

Sep 10 '24 10:09 HaoChenga