celeborn/docs/configuration
Saurabh Dubey 7571e10ad5 [CELEBORN-1894] Allow skipping already read chunks during unreplicated shuffle read retried
### What changes were proposed in this pull request?
Whenever a `WorkerPartitionReader` is recreated (due celeborn worker restarts / any other chunk fetch failure), the entire shuffle partition file is re-read from beginning, discarding already read chunks in `CelebornInputStream` based on the batchIdSet metadata maintained.

This can be improved (only for cases where shuffle data is unreplicated) by skipping already read chunk id since they'd be discarded anyway. This improves overall shuffle read performance (reducer's total time, network usage etc).

### Why are the changes needed?
Allow skipping already read shuffle chunks

### Does this PR introduce _any_ user-facing change?
No

### How was this patch tested?
UTs added

Closes #3132 from saurabhd336/skipReadChunks.

Authored-by: Saurabh Dubey <saurabhd336@uber.com>
Signed-off-by: Shuang <lvshuang.xjs@alibaba-inc.com>
2025-03-18 11:37:33 +08:00
..
client.md [CELEBORN-1894] Allow skipping already read chunks during unreplicated shuffle read retried 2025-03-18 11:37:33 +08:00
columnar-shuffle.md [CELEBORN-1051] Add isDynamic property for CelebornConf 2024-02-20 14:20:44 +08:00
ha.md [CELEBORN-1400] Bump Ratis version from 2.5.1 to 3.0.1 2024-05-30 17:22:22 +08:00
index.md [MINOR] Add documentation for CELEBORN_NO_DAEMONIZE 2024-12-23 10:31:37 +08:00
master.md [CELEBORN-1811] Update default value for celeborn.master.slot.assign.extraSlots 2024-12-31 15:37:28 +08:00
metrics.md [CELEBORN-1745] Remove application top disk usage code 2024-11-28 10:55:34 +08:00
network-module.md [CELEBORN-1353] Document Celeborn security - authentication and SSL support 2024-04-30 14:37:56 +08:00
network.md [MINOR] Change config versions 2025-03-11 07:39:32 +08:00
quota.md [CELEBORN-1748] Deprecate identity provider configs tied with quota 2024-12-04 09:28:40 +08:00
worker.md [CELEBORN-1792][FOLLOWUP] Keep resume for a while after resumeByPinnedMemory 2025-03-05 09:37:59 +08:00