celeborn

History

Saurabh Dubey 7571e10ad5 [CELEBORN-1894] Allow skipping already read chunks during unreplicated shuffle read retried ### What changes were proposed in this pull request? Whenever a `WorkerPartitionReader` is recreated (due celeborn worker restarts / any other chunk fetch failure), the entire shuffle partition file is re-read from beginning, discarding already read chunks in `CelebornInputStream` based on the batchIdSet metadata maintained. This can be improved (only for cases where shuffle data is unreplicated) by skipping already read chunk id since they'd be discarded anyway. This improves overall shuffle read performance (reducer's total time, network usage etc). ### Why are the changes needed? Allow skipping already read shuffle chunks ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? UTs added Closes #3132 from saurabhd336/skipReadChunks. Authored-by: Saurabh Dubey <saurabhd336@uber.com> Signed-off-by: Shuang <lvshuang.xjs@alibaba-inc.com>		2025-03-18 11:37:33 +08:00
..
client.md	[CELEBORN-1894] Allow skipping already read chunks during unreplicated shuffle read retried	2025-03-18 11:37:33 +08:00
columnar-shuffle.md	[CELEBORN-1051] Add `isDynamic` property for CelebornConf	2024-02-20 14:20:44 +08:00
ha.md	[CELEBORN-1400] Bump Ratis version from 2.5.1 to 3.0.1	2024-05-30 17:22:22 +08:00
index.md	[MINOR] Add documentation for `CELEBORN_NO_DAEMONIZE`	2024-12-23 10:31:37 +08:00
master.md	[CELEBORN-1811] Update default value for `celeborn.master.slot.assign.extraSlots`	2024-12-31 15:37:28 +08:00
metrics.md	[CELEBORN-1745] Remove application top disk usage code	2024-11-28 10:55:34 +08:00
network-module.md	[CELEBORN-1353] Document Celeborn security - authentication and SSL support	2024-04-30 14:37:56 +08:00
network.md	[MINOR] Change config versions	2025-03-11 07:39:32 +08:00
quota.md	[CELEBORN-1748] Deprecate identity provider configs tied with quota	2024-12-04 09:28:40 +08:00
worker.md	[CELEBORN-1792][FOLLOWUP] Keep resume for a while after resumeByPinnedMemory	2025-03-05 09:37:59 +08:00