Open-Sora-Plan
Open-Sora-Plan copied to clipboard
Is DeepSpeed-Ulysses the sequence parallel method used in Open-Sora-Plan v1.2.0?
Hi, thank you for the great work!
I was wondering is DeepSpeed-Ulysses the sequence parallel method used in both inference/training of Open-Sora-Plan v1.2.0?
(As a side note, I think you could take a look at the hybrid ring/ulysses method featured in https://github.com/feifeibear/long-context-attention/tree/main )