celeborn/assets
Xianming Lei 999510b265 [CELEBORN-1444] Introduce worker decommission metrics and corresponding REST API
### What changes were proposed in this pull request?

Introduce worker decommission metrics and corresponding REST API.

### Why are the changes needed?

In a production environment, due to certain hardware or environmental reasons, our script will automatically decommission the node. At this time, we need to distinguish between graceful shutdown nodes and decommissioned nodes.

If we distinguish shutdown worker and decommission worker metrics, we can achieve better operation and maintenance.

### Does this PR introduce _any_ user-facing change?

Yes.

### How was this patch tested?

- `DefaultMetaSystemSuiteJ#testHandleReportWorkerDecommission`
- `RatisMasterStatusSystemSuiteJ#testHandleReportWorkerDecommission`
- `ApiMasterResourceSuite#decommissionWorkers`
- `ApiWorkerResourceSuite#isDecommissioning`

Closes #2535 from leixm/issue_1444.

Lead-authored-by: Xianming Lei <jerrylei@apache.org>
Co-authored-by: Xianming Lei <31424839+leixm@users.noreply.github.com>
Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>
2024-06-08 11:10:31 +08:00
..
diagram [CELEBORN-746][BUILD] Rename project files from rss-xx to celeborn-xx 2023-06-29 16:30:02 +08:00
grafana [CELEBORN-1444] Introduce worker decommission metrics and corresponding REST API 2024-06-08 11:10:31 +08:00
img [CELEBORN-749] Update grafana dashboard to remove "RSS" 2023-06-29 20:44:09 +08:00
spark-patch [CELEBORN-858] Generate patch to each Spark 3.x minor version 2023-11-10 15:35:54 +08:00