DeepEP icon indicating copy to clipboard operation
DeepEP copied to clipboard

About the number of messages chunked in IBGDA

Open GitAlice123 opened this issue 9 months ago • 2 comments

Image Why theoretically 3 for maximum?

GitAlice123 avatar Mar 19 '25 02:03 GitAlice123

Assuming the message size (maximum ~KB level) is much smaller than the page size (i.e. NVSHMEM_CUMEM_GRANULARITY, normally very large >100 MB). So the worst case of getting local/remote key is, the message is splitted into two pages at local, two pages at remote, totally 3 pages.

e.g.

| chunk 0 --- | chunk 1 ------ | chunk 2 --------- | | local page i | local page i + 1 ------------------ | | remote page j ------------- | remote page j + 1 |

LyricZhao avatar Mar 21 '25 08:03 LyricZhao

You can ignore that note as the while loop can proceed more than 3 chunks. But we tried some code simplication and optimizations here for the theretical maximum, but it didn't work.

LyricZhao avatar Mar 21 '25 08:03 LyricZhao