openpi icon indicating copy to clipboard operation
openpi copied to clipboard

Questions about Pi0.5 Model Training Details and High Level Planning Implementation

Open Ginldaj opened this issue 2 weeks ago • 0 comments

Hello, while studying the Pi0.5 model, I have two questions regarding the model implementation that I would like to ask you:

1、The paper mentions that the model adopts two-stage pre-training and designs a comprehensive loss function. However, when checking the compute_loss part in the open-source code, it is found that currently only the action loss is calculated, and the loss related to the VLM (Vision-Language Model) in the pre-training stage is not reflected. I would like to confirm whether this part is implemented elsewhere in the code or if there are other design considerations?

2、The ablation experiments in the paper show that the jointly trained Pi0.5 performs excellently in explicit and implicit High Level planning, even better than GPT4 and manual upper-level planning. However, from the open-source model code, the implementation part related to the High Level planning step has not been found for the time being. I would like to know how this part of the function is reflected in the code?

Looking forward to your reply, thank you!

Ginldaj avatar Nov 18 '25 01:11 Ginldaj