Qingyuan Yang
Results
2
comments of
Qingyuan Yang
Here, the `(pipeline_rank - 1)` calculation when VPP is enabled doesn't consider the case where `num_layers_in_first_pipeline_stage` is None, unlike the logic when VPP is disabled below. 
@yanring Hey, I propose a simple pr to solve this issue. Can anyone review it?