mars icon indicating copy to clipboard operation
mars copied to clipboard

Ray-based Shuffle For Mars

Open chaokunyang opened this issue 3 years ago • 3 comments

This issue is used for tracking Ray-based Shuffle For Mars progress:

  • [x] Ray Future-based Shuffle MEP:https://github.com/mars-project/meps/pull/2
  • [x] Shuffle Meta optimization #3055
  • [x] Ray Future-based Shuffle
    • [x] Basic Ray Shuffle Implementation #3040
    • [x] Shuffle object GC
    • [x] Shuffle operands compatibility
  • [ ] Push-based Ray Shuffle
    • [ ] Push-based ray shuffle MEP
    • [ ] Scheduling all ray tasks in placement group
    • [ ] Pipelined push-based Shuffle
      • [ ] Push strategy optimization. Skip push shuffle blocks whose size exceed specified size.
      • [ ] Support configure merge factor.
      • [ ] Support configure concurrent running mapper subtasks.
      • [ ] Support adjust merge factor adaptively
      • [ ] Support adjust concurrent running mapper subtasks adaptively to reduce write amplification
  • [ ] Handle shuffle slow sutbask by speculative execution
  • [ ] Dynamic merge shuffle partitions
  • [ ] Dynamic split shuffle partitions
    • [ ] Dynamic detect skewed partitions
    • [ ] Dynamic optimize skewed join
    • [ ] Support sub chunks [ ] Shuffle Failover based lineage reconstruction

chaokunyang avatar Apr 13 '22 07:04 chaokunyang

I've implemented a half-done branch https://github.com/wjsi/mars/tree/feature/shuffle_svc for shuffle. The reason why it is not merged is that currently storage service lacks the capability of remote readers and writers. Now working on it.

wjsi avatar Apr 14 '22 03:04 wjsi

https://github.com/wjsi/mars/tree/feature/shuffle_svc

Is there a design doc for this?

chaokunyang avatar Apr 14 '22 03:04 chaokunyang

https://github.com/wjsi/mars/tree/feature/shuffle_svc

Is there a design doc for this?

Not yet. I'll write it soon.

wjsi avatar Apr 14 '22 03:04 wjsi