Qingyuan Yang

Results 2 comments of Qingyuan Yang

Here, the `(pipeline_rank - 1)` calculation when VPP is enabled doesn't consider the case where `num_layers_in_first_pipeline_stage` is None, unlike the logic when VPP is disabled below. ![Image](https://github.com/user-attachments/assets/73d19e79-e1fc-4d11-95c5-71c7f07f8c52)