celeborn

History

zky.zhoukeyong a42ec85a6e [CELEBORN-943][PERF] Pre-create CelebornInputStreams in CelebornShuffleReader ### What changes were proposed in this pull request? This PR fixes performance degradation when Spark's coalescePartitions takes effect caused by RPC latency. ### Why are the changes needed? I encountered a performance degradation when testing tpcds 10T q10: \|\|Time\| \|---\|---\| \|ESS\|14s\| \|Celeborn\| 24s\| After digging into it I found out that q10 triggers partition coalescence: ![image](https://github.com/apache/incubator-celeborn/assets/948245/0b4745da-8d57-4661-a35d-683d97f56e1d) As I configured `spark.sql.adaptive.coalescePartitions.initialPartitionNum` to 1000, `CelebornShuffleReader` will call `shuffleClient.readPartition` sequentially 1000 times, causing the delay. This PR optimizes by calling `shuffleClient.readPartition` in parallel. After this PR q10 time becomes 14s. ### Does this PR introduce _any_ user-facing change? No, but introduced a new client side configuration `celeborn.client.streamCreatorPool.threads` which defaults to 32. ### How was this patch tested? TPCDS 1T and passes GA. Closes #1876 from waitinfuture/943. Lead-authored-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com> Co-authored-by: Keyong Zhou <waitinfuture@gmail.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>		2023-09-04 21:46:11 +08:00
..
client.md	[CELEBORN-943][PERF] Pre-create CelebornInputStreams in CelebornShuffleReader	2023-09-04 21:46:11 +08:00
columnar-shuffle.md	[CELEBORN-877][DOC] Document on SBT	2023-08-11 12:17:55 +08:00
ha.md	[CELEBORN-877][DOC] Document on SBT	2023-08-11 12:17:55 +08:00
index.md	[CELEBORN-877][DOC] Document on SBT	2023-08-11 12:17:55 +08:00
master.md	[CELEBORN-877][DOC] Document on SBT	2023-08-11 12:17:55 +08:00
metrics.md	[CELEBORN-877][DOC] Document on SBT	2023-08-11 12:17:55 +08:00
network.md	[CELEBORN-877][DOC] Document on SBT	2023-08-11 12:17:55 +08:00
quota.md	[CELEBORN-877][DOC] Document on SBT	2023-08-11 12:17:55 +08:00
worker.md	[CELEBORN-920] Worker sends its load to Master through heartbeat	2023-08-26 13:58:37 +08:00