celeborn/docs/configuration
zky.zhoukeyong a42ec85a6e [CELEBORN-943][PERF] Pre-create CelebornInputStreams in CelebornShuffleReader
### What changes were proposed in this pull request?
This PR fixes performance degradation when Spark's coalescePartitions takes effect caused
by RPC latency.

### Why are the changes needed?
I encountered a performance degradation when testing  tpcds 10T q10:
||Time|
|---|---|
|ESS|14s|
|Celeborn| 24s|

After digging into it I found out that q10 triggers partition coalescence:
![image](https://github.com/apache/incubator-celeborn/assets/948245/0b4745da-8d57-4661-a35d-683d97f56e1d)

As I configured `spark.sql.adaptive.coalescePartitions.initialPartitionNum` to 1000, `CelebornShuffleReader`
will call `shuffleClient.readPartition` sequentially 1000 times, causing the delay.

This PR optimizes by calling `shuffleClient.readPartition` in parallel. After this PR q10 time becomes 14s.

### Does this PR introduce _any_ user-facing change?
No, but introduced a new client side configuration `celeborn.client.streamCreatorPool.threads`
which defaults to 32.

### How was this patch tested?
TPCDS 1T and passes GA.

Closes #1876 from waitinfuture/943.

Lead-authored-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>
Co-authored-by: Keyong Zhou <waitinfuture@gmail.com>
Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>
2023-09-04 21:46:11 +08:00
..
client.md [CELEBORN-943][PERF] Pre-create CelebornInputStreams in CelebornShuffleReader 2023-09-04 21:46:11 +08:00
columnar-shuffle.md [CELEBORN-877][DOC] Document on SBT 2023-08-11 12:17:55 +08:00
ha.md [CELEBORN-877][DOC] Document on SBT 2023-08-11 12:17:55 +08:00
index.md [CELEBORN-877][DOC] Document on SBT 2023-08-11 12:17:55 +08:00
master.md [CELEBORN-877][DOC] Document on SBT 2023-08-11 12:17:55 +08:00
metrics.md [CELEBORN-877][DOC] Document on SBT 2023-08-11 12:17:55 +08:00
network.md [CELEBORN-877][DOC] Document on SBT 2023-08-11 12:17:55 +08:00
quota.md [CELEBORN-877][DOC] Document on SBT 2023-08-11 12:17:55 +08:00
worker.md [CELEBORN-920] Worker sends its load to Master through heartbeat 2023-08-26 13:58:37 +08:00