Takuya Kato
Results
1
comments of
Takuya Kato
I bumped into a similar issue when I mistakenly specified the `--num-layers-per-virtual-pipeline-stage` larger than intended. For example, ``` --num-layers=16 --pipeline-model-parallel-size=4 --num-layers-per-virtual-pipeline-stage=4 ``` lead to `virtual_pipeline_model_parallel_size=1`, which doesn't seem to be...