risingwave fragmenter: remove 1v1 exchange rule & move the optimization to compute node

fragmenter: remove 1v1 exchange rule & move the optimization to compute node

Open BugenZhao opened this issue 1 year ago • 5 comments

We've introduced the "1v1 exchange rewrite" in #1745 that split multiple stateful operators into different fragments, then connect them with no-shuffle exchange. However, this breaks the assumption that every fragment can be scheduled or scaled independently: if we want to scale out one of the fragments, either we need to resolve the related upstream/downstream fragment and scale them alongside, or we need to replace all dispatchers with hash dispatchers. This lead to extra complexity for the meta service and the cloud manager.

Considering that our purpose is to increase the I/O concurrency, and the benchmark show that the compute parallelism is good enough, I suggest removing this rule(rewrite) from the fragmenter and letting the actor in compute nodes decide whether to do this optimization: multiple stateful executors are still in a single actor/fragment logically, while the actor may join multiple ActorStage to achieve I/O concurrency.

Any ideas are welcome. cc @skyzh @st1page @fuyufjh @shanicky

Update: A more detailed doc describing the issue: https://singularity-data.quip.com/GU2ZAhJdBhCJ/The-Future-of-No-shuffle-Exchange

Aug 12 '22 07:08 BugenZhao

Aug 12 '22 08:08 fuyufjh

Any doc for ActorStage? This batch-streaming naming looks very interesting.

Aug 12 '22 08:08 BowenXiao1999

dup w/ https://github.com/singularity-data/risingwave/issues/3607, I've proposed it long before!

Aug 12 '22 15:08 skyzh

+1. No shuffle exchange looks bad when there's scale-in / scale-out. If we don't want to handle this as a special case when scale, this proposal looks good.

Aug 12 '22 15:08 skyzh

may also close https://github.com/singularity-data/risingwave/issues/3607

Aug 12 '22 15:08 skyzh

Closed via #5449.

Sep 28 '22 11:09 BugenZhao

risingwave risingwave copied to clipboard

fragmenter: remove 1v1 exchange rule & move the optimization to compute node

risingwave
risingwave copied to clipboard