Megatron-LM icon indicating copy to clipboard operation
Megatron-LM copied to clipboard

Is CP support with MTP now?

Open Opdoop opened this issue 7 months ago • 1 comments

If CP does not support MTP, how to train long-context with MTP models?

https://github.com/NVIDIA/Megatron-LM/blob/bed7dbd1676f785401779291630dbc0002e0f618/megatron/training/arguments.py#L1006

Opdoop avatar May 21 '25 13:05 Opdoop

It's not supported yet. We are currently developing this feature and it's estimated to be released in MCore v0.13. Tensor parallelism can be used for now to reduce the activation memory. But the performance may be not optimal.

Victarry avatar May 22 '25 05:05 Victarry

Hi @Victarry, when will MCore v0.13 be released?

Opdoop avatar Jun 26 '25 09:06 Opdoop

This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.

github-actions[bot] avatar Jul 27 '25 02:07 github-actions[bot]