aphrodite-engine icon indicating copy to clipboard operation
aphrodite-engine copied to clipboard

[Kernel][Comms] feat: add custom all-gather kernels

Open AlpinDale opened this issue 3 months ago • 1 comments

We don't really use all-gather all that much, but for context parallel, all-gather is used quite a lot. This adds a fair bit of overhead when doing Context Parallelism, sometimes halving the speed. Currently WIP.

CP will land in #1521

AlpinDale avatar Sep 19 '25 05:09 AlpinDale

/gemini review

AlpinDale avatar Sep 19 '25 13:09 AlpinDale