celeborn/tests
mingji 42d5d426a1 [CELEBORN-1071] Support stage rerun for shuffle data lost
### What changes were proposed in this pull request?
If shuffle data is lost and enabled throw fetch failures, triggered stage rerun.

### Why are the changes needed?
Rerun stage for shuffle lost scenarios.

### Does this PR introduce _any_ user-facing change?
NO.

### How was this patch tested?
GA.

Closes #2894 from FMX/b1701.

Authored-by: mingji <fengmingxiao.fmx@alibaba-inc.com>
Signed-off-by: Shuang <lvshuang.xjs@alibaba-inc.com>
2024-11-12 10:07:26 +08:00
..
flink-it [CELEBORN-1490][CIP-6] Support process large buffer in flink hybrid shuffle 2024-11-04 16:57:43 +08:00
kubernetes-it [CELEBORN-1565] Introduce warn-unused-import in Scala 2024-08-29 13:43:17 +08:00
mr-it [CELEBORN-1434] Support MRAppMasterWithCeleborn to disable job recovery and job reduce slow start by default 2024-05-22 15:32:41 +08:00
spark-it [CELEBORN-1071] Support stage rerun for shuffle data lost 2024-11-12 10:07:26 +08:00