flink-remote-shuffle icon indicating copy to clipboard operation
flink-remote-shuffle copied to clipboard

Remote Shuffle Service for Flink

Results 30 flink-remote-shuffle issues
Sort by recently updated
recently updated
newest added

### Motivation SortBuffer can improve read performance significantly, but it doesn't support read data from a specific channel. The development of some new functions, for example, ReducePartition implementation, depends on...

### Motivation As described in the document, ReducePartition is a good supplement of the current MapPartition. It has several good features, for example, it can benefit streaming and hybrid shuffle...

### Motivation Currently, the high availability of ShuffleManager depends on the support of external services when it hangs up. In essence, ShuffleManager has a single point problem. We can introduce...

### Motivation Currently, network issues like unstable network may cause task failover which may further lead to reproducing of data. In fact, we can improve the behavior by reconnecting and...

### Motivation For some jobs, data loss and reproduction is not acceptable, data replication is needed to handle data loss in this scenario. ### Changes Allow to config the replication...

### Motivation Support to change log level dynamically can help to debug the shuffle system. ### Changes Add a rest API to both ShuffleManager and ShuffleWorker together with the corresponding...

### Motivation Based on the rest API, we can remove or add disk dynamically without restarting the cluster. For example, we can remove a bad disk or we can add...

We should support pod templates in k8s deployment mode, which will bring great convenience.

The shuffle manager should restore the previously managed workers when re-electing the master. Otherwise, in the next heartbeat cycle, the job will not be available when the worker is requested,...

I want to contribute this feature