Ethan Feng
Ethan Feng
Using a bitset to record return chunks can solve this data loss scenario. For most partition splits, it won't be larger than 8 GB, I think the default value could...
@s0nskar Did you forget to update the transport messages proto files?
@ChenYi015 Can you split this PR into several small PRs? Changing one feature in one PR can be easier for reviewers.
Hi, you can run the following command to refresh the docs. ``` UPDATE=1 build/mvn clean test -pl common -am -Dtest=none -DwildcardSuites=org.apache.celeborn.ConfigurationSuite ```
@AmandeepSingh285 As you can see that the disks are split into groups evenly, why do you still want to add disk count weight? ``` int groupSizeSize = (int) Math.ceil(usableDisks.size() /...
@AmandeepSingh285 Thanks for your enthusiasm about this PR, but this pr's functionality can be replaced by tuning the `diskGroupGradient`. I added some calculations down here to clarify that you just...
We can set logger levels for components. Maybe this won't be a trouble.
Plz share some stats about this PR. I was wondering if this PR is helpful for the Spark job's e2e time.
Just hold on a second. The field is skipped for a reason. Let me find out why the field is not included in the proto.
org.apache.celeborn.service.deploy.master.clustermeta.AbstractMetaManager#restoreMetaFromFile Abstract meta manager will try to resolve workerinfos who are "DEFAULT_RACK".