torchtitan icon indicating copy to clipboard operation
torchtitan copied to clipboard

Question about Pipeline parallelism

Open vermouth1992 opened this issue 7 months ago • 5 comments

Just wonder does the current PipelineStage API supports variable length input shapes like in Megatron? https://github.com/NVIDIA/Megatron-LM/blob/e33c8f78a35765d5aa37475a144da60e8a2349d1/megatron/core/model_parallel_config.py#L212 This is particular useful for packed inputs where all the paddings are removed.

vermouth1992 avatar Jun 27 '24 15:06 vermouth1992