rltakashige
rltakashige
Probably quite a bit easier in EXO 1.0 - just select it from the dropdown.
I've also seen this issue when running DeepSeek V3.1 in pipeline parallel (it does not happen in tensor parallel). Have not encountered this in Qwen Coder, but do want to...
https://github.com/exo-explore/exo/issues/879#issuecomment-3670942858
Seems to be more of an issue when running transformers. This is noted. Please reraise if this is an issue.