celeborn
celeborn copied to clipboard
Apache Celeborn is an elastic and high-performance service for shuffle and spilled data.
### What changes were proposed in this pull request? Adding a worker metrics for publish unreleased partition location count when worker was gracefully shutdown. ### Why are the changes needed?...
not a formal PR, just for discussion purposes. ### What changes were proposed in this pull request? Exclude JUnit 5 dependencies to enable running Java unit tests for modules such...
### What changes were proposed in this pull request? Introduce `ApplicationTotalCount` and `ApplicationFallbackCount` metric to record the total and fallback count of application. ### Why are the changes needed? There...
…stage when fetch fail has been reported ### What changes were proposed in this pull request? for non barrier shuffle read stage, LifecycleManager#handleGetShuffleIdForApp always return appshuffleId whether fetch status is...
### What changes were proposed in this pull request? support different master/worker nodeSelector and tolerations ### Why are the changes needed? add possibility to run master and worker on different...
### What changes were proposed in this pull request? Introduce Celeborn Chaos Testing Framework. ### Why are the changes needed? A chaos testing framework is designed to simulate unpredictable and...
### What changes were proposed in this pull request? Current version client can't be compatibility with older worker version. ### Why are the changes needed? For backward compatibility. ### Does...
### What changes were proposed in this pull request? Set the max memory threshold to actual memory allocated to the task. This is reverse-calculated from how Spark determines it since...
### What changes were proposed in this pull request? Remove unused batch data tracking ### Why are the changes needed? When the optimization to handle skewed partition reads is enabled,...
### What changes were proposed in this pull request? Refactoring shuffleWriters, extract common methods as BasedShuffleWriter. ### Why are the changes needed? Currently, HashBasedShuffleWriter and SortBasedShuffleWriter have a lot of...