rltakashige

Results 4 comments of rltakashige

Probably quite a bit easier in EXO 1.0 - just select it from the dropdown.

I've also seen this issue when running DeepSeek V3.1 in pipeline parallel (it does not happen in tensor parallel). Have not encountered this in Qwen Coder, but do want to...

https://github.com/exo-explore/exo/issues/879#issuecomment-3670942858

Seems to be more of an issue when running transformers. This is noted. Please reraise if this is an issue.