celeborn/assets
Sanskar Modi 80bdb46801 [CELEBORN-1892] Adding register with master fail count metric for worker
### What changes were proposed in this pull request?

Adding register with master fail count metric for worker

### Why are the changes needed?

This will help put monitoring around if workers are not able to register with master like wrong endpoints are passed or master becomes unavailable.

### Does this PR introduce _any_ user-facing change?

No

### How was this patch tested?
Local setup

<img width="724" alt="Screenshot 2025-06-04 at 10 44 56 AM" src="https://github.com/user-attachments/assets/1f84557b-5df8-422f-b602-bb5316a72a0e" />

Closes #3308 from s0nskar/worker_register_metric.

Authored-by: Sanskar Modi <sanskarmodi97@gmail.com>
Signed-off-by: Wang, Fei <fwang12@ebay.com>
2025-06-11 11:04:59 -07:00
..
diagram [CELEBORN-746][BUILD] Rename project files from rss-xx to celeborn-xx 2023-06-29 16:30:02 +08:00
grafana [CELEBORN-1892] Adding register with master fail count metric for worker 2025-06-11 11:04:59 -07:00
spark-patch [CELEBORN-1719][FOLLOWUP] Rename throwsFetchFailure to stageRerunEnabled 2025-06-11 19:33:19 +08:00