celeborn icon indicating copy to clipboard operation
celeborn copied to clipboard

Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.

Results 100 celeborn issues
Sort by recently updated
recently updated
newest added

### What changes were proposed in this pull request? Adding a worker metrics for publish unreleased partition location count when worker was gracefully shutdown. ### Why are the changes needed?...

kind:deploy
module:worker

not a formal PR, just for discussion purposes. ### What changes were proposed in this pull request? Exclude JUnit 5 dependencies to enable running Java unit tests for modules such...

kind:build

### What changes were proposed in this pull request? Introduce `ApplicationTotalCount` and `ApplicationFallbackCount` metric to record the total and fallback count of application. ### Why are the changes needed? There...

…stage when fetch fail has been reported ### What changes were proposed in this pull request? for non barrier shuffle read stage, LifecycleManager#handleGetShuffleIdForApp always return appshuffleId whether fetch status is...

module:client
module:spark
module:common
module:tests

### What changes were proposed in this pull request? support different master/worker nodeSelector and tolerations ### Why are the changes needed? add possibility to run master and worker on different...

### What changes were proposed in this pull request? Introduce Celeborn Chaos Testing Framework. ### Why are the changes needed? A chaos testing framework is designed to simulate unpredictable and...

### What changes were proposed in this pull request? Current version client can't be compatibility with older worker version. ### Why are the changes needed? For backward compatibility. ### Does...

### What changes were proposed in this pull request? Set the max memory threshold to actual memory allocated to the task. This is reverse-calculated from how Spark determines it since...

module:client
module:spark
kind:documentation
module:common

### What changes were proposed in this pull request? Remove unused batch data tracking ### Why are the changes needed? When the optimization to handle skewed partition reads is enabled,...

### What changes were proposed in this pull request? Refactoring shuffleWriters, extract common methods as BasedShuffleWriter. ### Why are the changes needed? Currently, HashBasedShuffleWriter and SortBasedShuffleWriter have a lot of...

module:client
module:spark