unavailableun
unavailableun
**Describe the bug** @tohtana Thanks for your great work about DeepCompile. I am trying to enable DeepCompile for a 2-nodes training job, while hit below exception: MemoryProfiling error /pytorch/build/aten/src/ATen/RegisterCUDA.cpp:7280: SymIntArrayRef...
Thanks for supporting Qwen3 models! > CP is not supported currently because of RoPE embedding implementation details. Any plan to support CP + EP for Qwen3 MoE models? If no...
**Describe the bug** ``CheckpointEngine.commit(info: CheckpointCommitInfo)`` interface does not align with ``DeepSpeedEngine`` reference. Line 3527 in ``runtime/engine.py`` should be ```self.checkpoint_engine.commit(commit_info)``` > [rank0]: File "/opt/conda/envs/ptca/lib/python3.10/site-packages/pytorch_lightning/trainer/trainer.py", line 1381, in save_checkpoint > [rank0]: self.strategy.save_checkpoint(checkpoint,...
### 🐛 Describe the bug I am building pytorch 2.9.1 and torchvision 0.24.1 from source based on ROCm 6.2, while hit an error during torchvision building stage: multiple definition of...