DiLu
DiLu copied to clipboard
Inquiry Regarding the Source of Actions and Post-Processing
Thank you for your excellent work. I would like to ask whether the ground truth actions (fine-tuning the LLM) are derived from an RL policy or human operations. If they are derived from an RL policy, do you remove the collision frames during post-processing? Looking forward to your reply~ @zijinoier