celeborn

Author	SHA1	Message	Date
SteNicholas	52eddc59f3	[CELEBORN-448] Support exclude worker manually ### What changes were proposed in this pull request? Support exclude worker manually given worker id. This worker is added into excluded workers manually. ### Why are the changes needed? Celeborn supports to shuffle client-side fetch and push exclude workers on failure at present. It's necessary to exclude worker manually for maintaining the Celeborn cluster. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? - `HttpUtilsSuite` - `DefaultMetaSystemSuiteJ#testHandleWorkerExclude` - `RatisMasterStatusSystemSuiteJ#testHandleWorkerExclude` - `MasterStateMachineSuiteJ#testObjSerde` Closes #1997 from SteNicholas/CELEBORN-448. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: mingji <fengmingxiao.fmx@alibaba-inc.com>	2023-11-07 16:25:24 +08:00
sychen	efa22a4936	[CELEBORN-1105][FLINK] Support Flink 1.18 ### What changes were proposed in this pull request? ### Why are the changes needed? ```bash flink-1.18.0 ./bin/start-cluster.sh ./bin/flink run examples/streaming/WordCount.jar --execution-mode BATCH ``` ```java Caused by: java.lang.NoSuchMethodError: org.apache.flink.runtime.io.network.partition.consumer.SingleInputGate.<init>(Ljava/lang/String;ILorg/apache/flink/runtime/jobgraph/IntermediateDataSetID;Lorg/apache/flink/runtime/io/network/partition/ResultPartitionType;Lorg/apache/flink/runtime/executiongraph/IndexRange;ILorg/apache/flink/runtime/io/network/partition/PartitionProducerStateProvider;Lorg/apache/flink/util/function/SupplierWithException;Lorg/apache/flink/runtime/io/network/buffer/BufferDecompressor;Lorg/apache/flink/core/memory/MemorySegmentProvider;ILorg/apache/flink/runtime/throughput/ThroughputCalculator;Lorg/apache/flink/runtime/throughput/BufferDebloater;)V at org.apache.celeborn.plugin.flink.RemoteShuffleInputGate$FakedRemoteInputChannel.<init>(RemoteShuffleInputGate.java:225) at org.apache.celeborn.plugin.flink.RemoteShuffleInputGate.getChannel(RemoteShuffleInputGate.java:179) at org.apache.flink.runtime.io.network.partition.consumer.InputGate.setChannelStateWriter(InputGate.java:90) at org.apache.flink.runtime.taskmanager.InputGateWithMetrics.setChannelStateWriter(InputGateWithMetrics.java:120) at org.apache.flink.streaming.runtime.tasks.StreamTask.injectChannelStateWriterIntoChannels(StreamTask.java:524) at org.apache.flink.streaming.runtime.tasks.StreamTask.<init>(StreamTask.java:496) ``` Flink 1.18.0 release https://flink.apache.org/2023/10/24/announcing-the-release-of-apache-flink-1.18/ Interface `org.apache.flink.runtime.io.network.buffer.Buffer` adds `setRecycler` method. [[FLINK-32549](https://issues.apache.org/jira/browse/FLINK-32549)][network] Tiered storage memory manager supports ownership transfer for buffers `org.apache.flink.runtime.io.network.partition.consumer.SingleInputGate` constructor adds parameters. [[FLINK-31638](https://issues.apache.org/jira/browse/FLINK-31638)][network] Introduce the TieredStorageConsumerClient to SingleInputGate [[FLINK-31642](https://issues.apache.org/jira/browse/FLINK-31642)][network] Introduce the MemoryTierConsumerAgent to TieredStorageConsumerClient ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? ```bash flink-1.18.0 ./bin/flink run examples/streaming/WordCount.jar --execution-mode BATCH Executing example with default input data. Use --input to specify file input. Printing result to stdout. Use --output to specify output path. Job has been submitted with JobID d7fc5f0ca018a54e9453c4d35f7c598a Program execution finished Job with JobID d7fc5f0ca018a54e9453c4d35f7c598a has finished. Job Runtime: 1635 ms ``` <img width="1297" alt="image" src="https://github.com/apache/incubator-celeborn/assets/3898450/6a5266bf-2386-4386-b98b-a60d2570fa99"> Closes #2063 from cxzl25/CELEBORN-1105. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: Shuang <lvshuang.tb@gmail.com>	2023-11-06 15:53:39 +08:00
joey.ljy	455cd40137	[CELEBORN-1111] Supporting connection to HDFS with Kerberos authentication enabled ### What changes were proposed in this pull request? Adding Kerberos support for HDFS storage type. The following five parameters need to be configured: \| key \| value \| \| :--: \| :--: \| \| celeborn.storage.hdfs.kerberos.enabled \| true \| \| celeborn.storage.hdfs.kerberos.principal \| userREALM \| \| celeborn.storage.hdfs.kerberos.keytab \| /path/test.keytab \| \| celeborn.hadoop.hadoop.security.authorization \| kerberos \| \| celeborn.hadoop.dfs.namenode.kerberos.principal \| hdfs/_HOSTREALM \| ### Why are the changes needed? Connecting to HDFS with Kerberos enabled requires support for keytab login. ### Does this PR introduce _any_ user-facing change? Add 3 configurations. celeborn.storage.hdfs.kerberos.enabled celeborn.storage.hdfs.kerberos.principal celeborn.storage.hdfs.kerberos.keytab ### How was this patch tested? Test in Kerberos enabled HDFS cluster. Closes #2072 from liujiayi771/hdfs-kerberos. Authored-by: joey.ljy <joey.ljy@alibaba-inc.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-11-04 17:21:41 +08:00
mingji	5e77b851c9	[CELEBORN-1081] Client support `celeborn.storage.activeTypes` config ### What changes were proposed in this pull request? 1.To support `celeborn.storage.activeTypes` in Client. 2.Master will ignore slots for "UNKNOWN_DISK". ### Why are the changes needed? Enable client application to select storage types to use. ### Does this PR introduce _any_ user-facing change? Yes. ### How was this patch tested? GA and cluster. Closes #2045 from FMX/B1081. Authored-by: mingji <fengmingxiao.fmx@alibaba-inc.com> Signed-off-by: Shuang <lvshuang.tb@gmail.com>	2023-11-03 20:03:11 +08:00
Chandni Singh	c8b5384baf	[CELEBORN-1107] Make the max default number of netty threads configurable ### What changes were proposed in this pull request? This change makes the maximum default number of Netty threads configurable. Previously, this value was hardcoded to 64, which could be small for certain environments. While it's possible to configure the number of Netty server and client threads individually for each module, providing an option to increase the default value offers greater convenience. ### Why are the changes needed? The change offers convenience. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Added a UT Closes #2065 from otterc/CELEBORN-1107. Authored-by: Chandni Singh <singh.chandni@gmail.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-11-03 13:18:44 +08:00
onebox-li	7b185a2562	[CELEBORN-1058] Support specifying the number of dispatcher threads for each role ### What changes were proposed in this pull request? Support specifying the number of dispatcher threads for each role, especially shuffle client side. For shuffle client, there is only RpcEndpointVerifier endpoint which handles not many requests, one thread is enough. The rpc env of other roles has only two endpoints at most, using a shared event loop is reasonable. I am not sure if there is a need to add rpc requests to shuffle client. So add specific parameters to specify the dispatcher threads here. And change the dispatcher thread pool name in order to distinguish it from spark's. ### Why are the changes needed? Ditto ### Does this PR introduce _any_ user-facing change? Yes, add params celeborn.\<role>.rpc.dispatcher.threads ### How was this patch tested? Manual test and UT Closes #2003 from onebox-li/my_dev. Authored-by: onebox-li <lyh-36@163.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-11-03 10:35:54 +08:00
SteNicholas	4e8e8c2310	[CELEBORN-1094] Optimize mechanism of ChunkManager expired shuffle key cleanup to avoid memory leak ### What changes were proposed in this pull request? The `cleaner` of `Worker` executes the `StorageManager#cleanupExpiredShuffleKey` to clean expired shuffle keys with daemon cached thread pool. The optimization speeds up cleaning including expired shuffle keys of ChunkManager to avoid memory leak. ### Why are the changes needed? `ChunkManager#streams` could lead memory leak when the speed of cleanup is slower than expiration for expired shuffle of worker. The behavior that `ChunkStreamManager` cleanup expired shuffle key should be optimized to avoid memory leak, which causes that the VM thread of worker is 100%. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? `WorkerSuite#clean up`. Closes #2053 from SteNicholas/CELEBORN-1094. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: mingji <fengmingxiao.fmx@alibaba-inc.com>	2023-11-02 15:46:07 +08:00
sychen	e437228dc8	[CELEBORN-1104][DOC] Fix SBT documentation incorrect command ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #2062 from cxzl25/CELEBORN-1104. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: Cheng Pan <chengpan@apache.org>	2023-11-01 17:00:09 +08:00
SteNicholas	b45b63f9a5	[CELEBORN-247][FOLLOWUP] Add metrics for each user's quota usage of Celeborn Worker ### What changes were proposed in this pull request? Add the metric `ResourceConsumption` to monitor each user's quota usage of Celeborn Worker. ### Why are the changes needed? The metric `ResourceConsumption` supports to monitor each user's quota usage of Celeborn Master at present. The usage of Celeborn Worker also needs to monitor. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Internal tests. Closes #2059 from SteNicholas/CELEBORN-247. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: mingji <fengmingxiao.fmx@alibaba-inc.com>	2023-11-01 15:48:31 +08:00
onebox-li	320714bf24	[CELEBORN-1089] Seperate overHighWatermark check to a dedicated thread ### What changes were proposed in this pull request? Seperate `overHighWatermark` check to a dedicated thread, let this value can shared better and lighten `CongestionController#isUserCongested` logic. ### Why are the changes needed? Ditto. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Manual test and UT. Closes #2041 from onebox-li/congest-check. Authored-by: onebox-li <lyh-36@163.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-11-01 09:51:24 +08:00
SteNicholas	3092644168	[CELEBORN-1095] Support configuration of fastest available XXHashFactory instance for checksum of Lz4Decompressor ### What changes were proposed in this pull request? `CelebornConf` adds `celeborn.client.shuffle.decompression.lz4.xxhash.instance` to configure fastest available `XXHashFactory` instance for checksum of `Lz4Decompressor`. Fix #2043. ### Why are the changes needed? `Lz4Decompressor` creates the checksum with `XXHashFactory#fastestInstance`, which returns the fastest available `XXHashFactory` instance that uses nativeInstance at default. The fastest available `XXHashFactory` instance for checksum of `Lz4Decompressor` could be supported to configure instead of dependency on the class loader is the system class loader, which method is as follows: ``` /** * Returns the fastest available {link XXHashFactory} instance. If the class * loader is the system class loader and if the * {link #nativeInstance() native instance} loads successfully, then the * {link #nativeInstance() native instance} is returned, otherwise the * {link #fastestJavaInstance() fastest Java instance} is returned. * <p> * Please read {link #nativeInstance() javadocs of nativeInstance()} before * using this method. * * return the fastest available {link XXHashFactory} instance. */ public static XXHashFactory fastestInstance() { if (Native.isLoaded() \|\| Native.class.getClassLoader() == ClassLoader.getSystemClassLoader()) { try { return nativeInstance(); } catch (Throwable t) { return fastestJavaInstance(); } } else { return fastestJavaInstance(); } } ``` ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? - `CelebornConfSuite` - `ConfigurationSuite` Closes #2050 from SteNicholas/CELEBORN-1095. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: xiyu.zk <xiyu.zk@alibaba-inc.com>	2023-10-31 14:57:31 +08:00
Fu Chen	349ee8b1cb	Revert "[CELEBORN-255] Add counter of outstandingFetches, outstanding… …Rpcs and outstandingPushes to metrics" This reverts commit `bfa341c32f`. ### What changes were proposed in this pull request? ### Why are the changes needed? https://github.com/apache/incubator-celeborn/pull/1992#issuecomment-1776760369 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #2032 from cfmcgrady/revert-pr-1992. Authored-by: Fu Chen <cfmcgrady@gmail.com> Signed-off-by: Fu Chen <cfmcgrady@gmail.com>	2023-10-24 17:18:54 +08:00
SteNicholas	11c90d8e72	[CELEBORN-916] Add new metric about active shuffle file count in worker ### What changes were proposed in this pull request? Adds new metric `ActiveShuffleFileCount` about active shuffle file count of Celeborn Worker. ### Why are the changes needed? `ActiveShuffleSize` metric report the active shuffle size of peer worker at present. Therefore, it's better to introduce `ActiveShuffleFileCount` to report the active shuffle file count of Celeborn Worker. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Internal tests. Closes #2009 from SteNicholas/CELEBORN-916. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: mingji <fengmingxiao.fmx@alibaba-inc.com>	2023-10-23 11:15:18 +08:00
SteNicholas	7276dd024c	[CELEBORN-1035] Expose RunningApplicationCount, PartitionWritten and PartitionFileCount metric by Celeborn master ### What changes were proposed in this pull request? Meta manager records `appHeartbeatTime`, `partitionTotalWritten` and `partitionTotalFileCount`, which are useful to monitor the application heartbeat and shuffle partition. `RunningApplicationCount`, `PartitionWritten` and `PartitionFileCount` metrics are exposed by Celeborn master to monitor the application and shuffle partition. ### Why are the changes needed? `Master` exposes `RunningApplicationCount`, `PartitionWritten` and `PartitionFileCount` metrics. ### Does this PR introduce _any_ user-facing change? None. ### How was this patch tested? Internal tests. Closes #1976 from SteNicholas/CELEBORN-1035. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: mingji <fengmingxiao.fmx@alibaba-inc.com>	2023-10-19 22:07:17 +08:00
mingji	69defcad7f	[CELEBORN-1021] Celeborn support arbitary Ratis configs and client rpc timeout ### What changes were proposed in this pull request? 1. To support arbitrary Ratis configs 2. To support Ratis client rpc timeout ### Why are the changes needed? After some digs that I found out Celeborn never changed the default config of ratis client's timeout. ### Does this PR introduce _any_ user-facing change? NO. ### How was this patch tested? GA and cluster. Closes #1969 from FMX/CELEBORN-1021. Authored-by: mingji <fengmingxiao.fmx@alibaba-inc.com> Signed-off-by: Shuang <lvshuang.tb@gmail.com>	2023-10-18 10:26:11 +08:00
sunjunjie	03498ce46b	[CELEBORN-1046] Add an expiration time configuration for app directory to clean up ### What changes were proposed in this pull request? Add a configuration "celeborn.worker.storage.expireDirs.timeout" with a default value of 6h in rsswork. This configuration is used to set the expiration time for app local directories. https://issues.apache.org/jira/browse/CELEBORN-1046 ### Why are the changes needed? When Celeborn periodically deletes the directories of apps, it determines whether the app needs to be deleted based on the shuffleKeySet in memory. However, this method may not accurately indicate the completion of the app and could potentially lead to the unintentional deletion of shuffle data. ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1998 from wilsonjie/CELEBORN-1046. Authored-by: sunjunjie <sunjunjie@zto.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-17 19:23:49 +08:00
SteNicholas	bfa341c32f	[CELEBORN-255] Add counter of outstandingFetches, outstandingRpcs and outstandingPushes to metrics ### What changes were proposed in this pull request? Add counter of `outstandingFetches`, `outstandingRpcs` and `outstandingPushes` of `TransportResponseHandler` to metrics of Celeborn Worker. ### Why are the changes needed? The counter of `outstandingFetches`, `outstandingRpcs` and `outstandingPushes` of `TransportResponseHandler` could be added to metrics to monitor `outstandingFetches`, `outstandingRpcs` and `outstandingPushes`. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? `TransportResponseHandlerSuiteJ` Closes #1992 from SteNicholas/CELEBORN-255. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-16 21:16:57 +08:00
sychen	a8ac18f2e8	[CELEBORN-299] Deprecate `celeborn.worker.storage.baseDir.prefix` and `celeborn.worker.storage.baseDir.number` ### What changes were proposed in this pull request? <img width="1460" alt="image" src="https://github.com/apache/incubator-celeborn/assets/3898450/ac3b29be-7c39-4c18-b71d-0e243797273e"> ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? ``` 23/10/16 03:31:13,399 WARN [pool-1-thread-1-ScalaTest-running-CelebornConfSuite] CelebornConf: The configuration key 'celeborn.worker.storage.baseDir.prefix' has been deprecated in v0.4.0 and may be removed in the future. Please use celeborn.worker.storage.dirs 23/10/16 03:31:13,399 WARN [pool-1-thread-1-ScalaTest-running-CelebornConfSuite] CelebornConf: The configuration key 'celeborn.worker.storage.baseDir.number' has been deprecated in v0.4.0 and may be removed in the future. Please use celeborn.worker.storage.dirs ``` Closes #1993 from cxzl25/CELEBORN-299. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-16 19:10:13 +08:00
SteNicholas	f2d6cc7525	[CELEBORN-829] Improve response message of invalid HTTP request ### What changes were proposed in this pull request? Improve response message of invalid HTTP request, which lists available API providers like as below: - master ``` Invalid uri of the master. Available API providers include: /applications List all running application's ids of the cluster. /conf List the conf setting of the master. /excludedWorkers List all excluded workers of the master. /help List the available API providers of the master. /hostnames List all running application's LifecycleManager's hostnames of the cluster. /listTopDiskUsedApps List the top disk usage application ids. It will return the top disk usage application ids for the cluster. /lostWorkers List all lost workers of the master. /masterGroupInfo List master group information of the service. It will list all master's LEADER, FOLLOWER information. /shuffles List all running shuffle keys of the service. It will return all running shuffle's key of the cluster. /shutdownWorkers List all shutdown workers of the master. /threadDump List the current thread dump of the master. /workerInfo List worker information of the service. It will list all registered workers 's information. ``` - worker ``` Invalid uri of the worker. Available API providers include: /conf List the conf setting of the worker. /exit Trigger this worker to exit. Legal types are 'DECOMMISSION‘, 'GRACEFUL' and 'IMMEDIATELY' /help List the available API providers of the worker. /isRegistered Show if the worker is registered to the master success. /isShutdown Show if the worker is during the process of shutdown. /listPartitionLocationInfo List all the living PartitionLocation information in that worker. /listTopDiskUsedApps List the top disk usage application ids. It only return application ids running in that worker. /shuffles List all the running shuffle keys of the worker. It only return keys of shuffles running in that worker. /threadDump List the current thread dump of the worker. /unavailablePeers List the unavailable peers of the worker, this always means the worker connect to the peer failed. /workerInfo List the worker information of the worker. ``` ### Why are the changes needed? Response message of invalid HTTP request could not help users with correct HTTP path. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? `HttpUtilsSuite#CELEBORN-829: Improve response message of invalid HTTP request` Closes #1986 from SteNicholas/CELEBORN-829. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-16 16:37:51 +08:00
SteNicholas	f61fe17551	[CELEBORN-987][FOLLOWUP][DOC] README#Build and sbt#System Requirements should extend to Scala 2.13 and Spark 3.5 ### What changes were proposed in this pull request? `README#Build` and `sbt#System Requirements` extends to Scala 2.13. ### Why are the changes needed? `README#Build` and `sbt#System Requirements`should extend to Scala 2.13 to align the SBT CI test results. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? SBT CI tests. Closes #1987 from SteNicholas/CELEBORN-987. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: Fu Chen <cfmcgrady@gmail.com>	2023-10-14 09:54:22 +08:00
sychen	dd65e74f99	[CELEBORN-983] Rename PrometheusMetric configuration ### What changes were proposed in this pull request? Replace ```properties celeborn.metrics.master.prometheus.host celeborn.metrics.master.prometheus.port celeborn.metrics.worker.prometheus.host celeborn.metrics.worker.prometheus.port ``` With ```properties celeborn.master.http.host celeborn.master.http.port celeborn.worker.http.host celeborn.worker.http.port ``` ### Why are the changes needed? The `celeborn.master.metrics.prometheus.port` and `celeborn.metrics.worker.prometheus.port` bind port not only serve prometheus metrics, but also provide some useful API services. https://celeborn.apache.org/docs/latest/monitoring/#rest-api ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1919 from cxzl25/CELEBORN-983. Lead-authored-by: sychen <sychen@ctrip.com> Co-authored-by: Keyong Zhou <zhouky@apache.org> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-13 13:28:58 +08:00
onebox-li	a47f6169d8	[MINOR] Fix some typos ### What changes were proposed in this pull request? Fix some typos ### Why are the changes needed? Ditto ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? - Closes #1983 from onebox-li/fix-typo. Authored-by: onebox-li <lyh-36@163.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-12 20:34:07 +08:00
sychen	9c07ceddb0	[CELEBORN-1028][FOLLOWUP][DOCS] Make prometheus path configurable ### What changes were proposed in this pull request? ### Why are the changes needed? https://github.com/apache/incubator-celeborn/pull/1965#issuecomment-1755345813 ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? <img width="1410" alt="image" src="https://github.com/apache/incubator-celeborn/assets/3898450/6454133a-040b-4dde-84b7-dbf08fb15b13"> <img width="1401" alt="image" src="https://github.com/apache/incubator-celeborn/assets/3898450/3cdfa9f2-9a7a-43cb-9006-77810a350669"> Closes #1974 from cxzl25/CELEBORN-1028-FOLLOWUP. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-10 22:59:22 +08:00
sychen	bcf89da7dd	[MINOR] Fix typo in CelebornConf ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1971 from cxzl25/typo. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-10 20:04:16 +08:00
sychen	f6d27609b8	[CELEBORN-1028] Make prometheus path configurable ### What changes were proposed in this pull request? `celeborn.metrics.prometheus.path` ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1965 from cxzl25/CELEBORN-1028. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-10 18:37:44 +08:00
mingji	95c9ccfc3e	[CELEBORN-1010] Update docs about `spark.shuffle.service.enabled` ### What changes were proposed in this pull request? To clarify a spark config to work with Celeborn. ### Why are the changes needed? After some tests, I found that Spark 3.1 and newer can work with Celeborn with `spark.shuffle.service.enabled=true`. ExternalShuffleBlockResolver won't check the shuffle manager's type since Spark 3.1 and newer. ### Does this PR introduce _any_ user-facing change? NO. ### How was this patch tested? I tested two scenarios about this PR. 1. Check whether Spark can release the executors in time. 2. Check data correctness by running TPC-DS. All checks are good. Closes #1955 from FMX/CELEBORN-1010. Authored-by: mingji <fengmingxiao.fmx@alibaba-inc.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-10-08 09:15:42 +08:00
Cheng Pan	84ef527181	[CELEBORN-1007][FOLLOWUP][DOCS] Update Migration Guide ### What changes were proposed in this pull request? Mention metrics name change in Migration Guide ### Why are the changes needed? https://github.com/apache/incubator-celeborn/pull/1939 ### Does this PR introduce _any_ user-facing change? Yes, docs updated. ### How was this patch tested? Review. Closes #1950 from pan3793/CELEBORN-1007-followup. Authored-by: Cheng Pan <chengpan@apache.org> Signed-off-by: Cheng Pan <chengpan@apache.org>	2023-09-28 21:08:11 +08:00
sychen	5310bcaf6b	[CELEBORN-313] Add rest endpoint to show master group info ### What changes were proposed in this pull request? <img width="1347" alt="image" src="https://github.com/apache/incubator-celeborn/assets/3898450/43d10bff-6878-4591-9461-889494d797f9"> ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? ```bash ./bin/celeborn-ratis sh -Draft.rpc.type=NETTY group info -peers clb-1:9872,clb-2:9873,clb-3:9874 ``` ``` group id: c5196f6d-2c34-3ed3-8b8a-47bede733167 leader info: 1(clb-1:9872) [server { id: "1" address: "clb-1:9872" clientAddress: "clb-1:9097" startupRole: FOLLOWER } commitIndex: 316 , server { id: "2" address: "clb-2:9873" clientAddress: "clb-2:9098" startupRole: FOLLOWER } commitIndex: 316 , server { id: "3" address: "clb-3:9874" clientAddress: "clb-3:9099" startupRole: FOLLOWER } commitIndex: 316 ] ``` ```bash curl http://clb-3:9983/masterGroupInfo ``` ``` ====================== Master Group INFO ============================== group id: c5196f6d-2c34-3ed3-8b8a-47bede733167 leader info: 1(clb-1:9872) [server { id: "3" address: "clb-3:9874" clientAddress: "clb-3:9099" startupRole: FOLLOWER } commitIndex: 316 , server { id: "1" address: "clb-1:9872" clientAddress: "clb-1:9097" startupRole: FOLLOWER } commitIndex: 316 , server { id: "2" address: "clb-2:9873" clientAddress: "clb-2:9098" startupRole: FOLLOWER } commitIndex: 316 ] ``` Closes #1946 from cxzl25/CELEBORN-313. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: Cheng Pan <chengpan@apache.org>	2023-09-28 20:08:31 +08:00
Cheng Pan	e4a60d15e4	[CELEBORN-909][FOLLOWUP][DOCS] Restore titles in migration guide ### What changes were proposed in this pull request? Restore titles in migration guide ### Why are the changes needed? Make title in migration guide consistent. ### Does this PR introduce _any_ user-facing change? Yes, docs changed. ### How was this patch tested? Pass GA. Closes #1949 from pan3793/CELEBORN-909-followup. Authored-by: Cheng Pan <chengpan@apache.org> Signed-off-by: Cheng Pan <chengpan@apache.org>	2023-09-28 20:04:53 +08:00
Cheng Pan	ab68a4ae1b	[MINOR] Fix configuration version ### What changes were proposed in this pull request? Change the `.version("0.3.2")` to `.version("0.3.1")` ### Why are the changes needed? 0.3.1 is not release yet. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? Pass GA. Closes #1948 from pan3793/minor-version. Lead-authored-by: Cheng Pan <chengpan@apache.org> Co-authored-by: Cheng Pan <pan3793@gmail.com> Signed-off-by: Cheng Pan <chengpan@apache.org>	2023-09-28 19:58:06 +08:00
sychen	3e515c5d2e	[CELEBORN-1009][DOC] CELEBORN_PREFER_JEMALLOC ### What changes were proposed in this pull request? ![image](https://github.com/apache/incubator-celeborn/assets/3898450/e7d9e93d-6e1c-469c-98f2-835840bb0973) ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1944 from cxzl25/CELEBORN-1009. Lead-authored-by: sychen <sychen@ctrip.com> Co-authored-by: Keyong Zhou <zhouky@apache.org> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-27 23:14:27 +08:00
sychen	42f08ca21a	[CELEBORN-985] Change default value of numConnectionsPerPeer to 1 ### What changes were proposed in this pull request? ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1943 from cxzl25/CELEBORN-985. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-27 22:50:23 +08:00
xleoken	83a92fd1f1	[MINOR] Remove unexpected $ symbol ### What changes were proposed in this pull request? Remove unexpected $ symbol in README doc ### Why are the changes needed? throw error ``` bash: export: `=/opt': not a valid identifier ``` ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1942 from xleoken/patch. Authored-by: xleoken <leo65535@163.com> Signed-off-by: Cheng Pan <chengpan@apache.org>	2023-09-27 19:51:26 +08:00
Fu Chen	c775089c4b	[CELEBORN-988][FOLLOWUP] Rename config key `celeborn.worker.sortPartition.lazyRemovalOfOriginalFiles.enabled` ### What changes were proposed in this pull request? 1. rename config key from `celeborn.worker.sortPartition.lazyRemovalOfOriginalFiles.enabled` to `celeborn.worker.sortPartition.eagerlyRemoveOriginalFiles.enabled` 2. make this config as an internal config ### Why are the changes needed? make the config key more clearly ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Pass GA Closes #1934 from cfmcgrady/celeborn-988-followup. Authored-by: Fu Chen <cfmcgrady@gmail.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-24 22:28:32 +08:00
jiaoqingbo	f1713dacaf	[MINOR] Fix incorrect default resume ratio in trafficcontrol doc <!-- Thanks for sending a pull request! Here are some tips for you: - Make sure the PR title start w/ a JIRA ticket, e.g. '[CELEBORN-XXXX] Your PR title ...'. - Be sure to keep the PR description updated to reflect all changes. - Please write your PR title to summarize what this PR proposes. - If possible, provide a concise example to reproduce the issue for a faster review. --> ### What changes were proposed in this pull request? As Title ### Why are the changes needed? Since 0.3.1, Celeborn changed the default value of `celeborn.worker.directMemoryRatioToResume` from `0.5` to `0.7`. the doc should be update ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? PASS GA Closes #1931 from jiaoqingbo/ratiofix. Authored-by: jiaoqingbo <1178404354@qq.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-21 11:18:48 +08:00
sychen	bb50618780	[CELEBORN-997][DOC] Fix Rolling upgrade broken link ### What changes were proposed in this pull request? https://celeborn.apache.org/docs/latest/developers/overview/ > For more details, please refer to Rolling upgrade ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1927 from cxzl25/CELEBORN-997. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: mingji <fengmingxiao.fmx@alibaba-inc.com>	2023-09-20 16:44:42 +08:00
sychen	b2b7c4d359	[CELEBORN-991][DOC] Remove incorrect `spark.metrics.conf` ### What changes were proposed in this pull request? 1. Replace `spark.metrics.conf` with `celeborn.metrics.conf`. 2. Fix broken links. https://celeborn.apache.org/docs/latest/monitoring/#metrics ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1925 from cxzl25/CELEBORN-991. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-20 09:03:27 +08:00
Fu Chen	1e49ff76f3	[CELEBORN-988] Add config option to control original unsorted file deletion in `PartitionFilesSorter` ### What changes were proposed in this pull request? This PR adds a new configuration option, `celeborn.worker.sortPartition.lazyRemovalOfOriginalFiles.enabled`, allowing users to control whether the `PartitionFilesSorter` deletes the original unsorted file. ### Why are the changes needed? https://github.com/apache/incubator-celeborn/pull/1907#issuecomment-1723420513 ### Does this PR introduce _any_ user-facing change? Users have the option to prevent the `PartitionSorter` from deleting the original unsorted file by configuring `celeborn.worker.sortPartition.lazyRemovalOfOriginalFiles.enabled = false`. ### How was this patch tested? Pass GA Closes #1922 from cfmcgrady/make-delete-configurable. Authored-by: Fu Chen <cfmcgrady@gmail.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-19 11:14:51 +08:00
sychen	beed2a85b0	[CELEBORN-977] Support RocksDB as recover DB backend ### What changes were proposed in this pull request? ### Why are the changes needed? LevelDB does not support mac arm version. ```java java.lang.UnsatisfiedLinkError: Could not load library. Reasons: [no leveldbjni64-1.8 in java.library.path, no leveldbjni-1.8 in java.library.path, no leveldbjni in java.library.path, /private/var/folders/tc/r2n_8g6j4731h7clfqwntg880000gn/T/libleveldbjni-64-1-4616234670453989010.8: dlopen(/private/var/folders/tc/r2n_8g6j4731h7clfqwntg880000gn/T/libleveldbjni-64-1-4616234670453989010.8, 0x0001): tried: '/private/var/folders/tc/r2n_8g6j4731h7clfqwntg880000gn/T/libleveldbjni-64-1-4616234670453989010.8' (fat file, but missing compatible architecture (have 'x86_64,i386', need 'arm64')), '/System/Volumes/Preboot/Cryptexes/OS/private/var/folders/tc/r2n_8g6j4731h7clfqwntg880000gn/T/libleveldbjni-64-1-4616234670453989010.8' (no such file), '/private/var/folders/tc/r2n_8g6j4731h7clfqwntg880000gn/T/libleveldbjni-64-1-4616234670453989010.8' (fat file, but missing compatible architecture (have 'x86_64,i386', need 'arm64'))] at org.fusesource.hawtjni.runtime.Library.doLoad(Library.java:182) at org.fusesource.hawtjni.runtime.Library.load(Library.java:140) at org.fusesource.leveldbjni.JniDBFactory.<clinit>(JniDBFactory.java:48) at org.apache.celeborn.service.deploy.worker.shuffledb.LevelDBProvider.initLevelDB(LevelDBProvider.java:49) at org.apache.celeborn.service.deploy.worker.shuffledb.DBProvider.initDB(DBProvider.java:30) at org.apache.celeborn.service.deploy.worker.storage.StorageManager.<init>(StorageManager.scala:197) at org.apache.celeborn.service.deploy.worker.Worker.<init>(Worker.scala:109) at org.apache.celeborn.service.deploy.worker.Worker$.main(Worker.scala:734) at org.apache.celeborn.service.deploy.worker.Worker.main(Worker.scala) ``` The released `leveldbjni-all` for `org.fusesource.leveldbjni` does not support AArch64 Linux, we need to use `org.openlabtesting.leveldbjni`. See https://issues.apache.org/jira/browse/HADOOP-16614 ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? local test Closes #1913 from cxzl25/CELEBORN-977. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: Cheng Pan <chengpan@apache.org>	2023-09-19 09:20:33 +08:00
sychen	4d35e501a3	[CELEBORN-984][DOC] shutdownWorkers API documentation ### What changes were proposed in this pull request? https://celeborn.apache.org/docs/latest/monitoring/#master_1 `07c1dc2568/service/src/main/scala/org/apache/celeborn/server/common/http/HttpRequestHandler.scala (L74-L75)` ### Why are the changes needed? ### Does this PR introduce _any_ user-facing change? ### How was this patch tested? Closes #1920 from cxzl25/CELEBORN-984. Authored-by: sychen <sychen@ctrip.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-18 19:58:11 +08:00
Shuang	615479c442	[CELEBORN-468] Timeout useless lostWorkers/shutdownWorkers meta ### What changes were proposed in this pull request? As title ### Why are the changes needed? If Worker lost or lost after graceful shutdown, Master would retain these lostWorker/shutdownWorkers meta permanently, These meta would cause some noisy message in lifecycleManager. For these meta better to delete them after a while ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? UT & E2E test Closes #1916 from RexXiong/CELEBORN-468. Authored-by: Shuang <lvshuang.tb@gmail.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-18 18:39:43 +08:00
jiaoqingbo	107f3df8ba	[CELEBORN-979] Reduce default disk Check Interval ### What changes were proposed in this pull request? Reduce default disk Check Interval ### Why are the changes needed? since https://github.com/apache/incubator-celeborn/pull/1909 ，In PushDataHandler#checkDiskFull method，Added check logic for DiskInfo status, the default disk Check Interval should be reduced ### Does this PR introduce _any_ user-facing change? NO ### How was this patch tested? PASS GA Closes #1915 from jiaoqingbo/979. Authored-by: jiaoqingbo <1178404354@qq.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-18 14:54:22 +08:00
mingji	cb9adfc511	[CELEBORN-974] Add quick start guide about using MapReduce with Celeborn ### What changes were proposed in this pull request? Add quick start guide about using MapReduce with Celeborn. ### Why are the changes needed? Celeborn supports MapReduce client recently. ### Does this PR introduce _any_ user-facing change? NO. ### How was this patch tested? No need to test. Closes #1908 from FMX/CELEBORN-974. Authored-by: mingji <fengmingxiao.fmx@alibaba-inc.com> Signed-off-by: mingji <fengmingxiao.fmx@alibaba-inc.com>	2023-09-14 19:31:01 +08:00
mingji	e0c00ecd38	[CELEBORN-839][MR] Support Hadoop MapReduce ### What changes were proposed in this pull request? 1. Map side merge and push. 2. Support hadoop2 & 3. 3. Reduce in-memory merge. 4. Integrate LifecycleManager to RmApplicationMaster. ### Why are the changes needed? Ditto. ### Does this PR introduce _any_ user-facing change? NO. ### How was this patch tested? Cluster. I tested this PR on a cluster with a 4x 16 CPU 64G Mem 4ESSD cluster. Hadoop 2.8.5 1TB Terasort, 8400 mappers, 1000 reducers Celeborn 81min vs MR shuffle 89min ![mr1](https://github.com/apache/incubator-celeborn/assets/4150993/a3cf6493-b6ff-4c03-9936-4558cf22761d) ![mr2](https://github.com/apache/incubator-celeborn/assets/4150993/9119ffb4-6996-4b77-bcdf-cbd6db5c096f) 1GB wordcount, 8 mappers, 8 reducers Celeborn 35s VS MR shuffle 38s ![mr3](https://github.com/apache/incubator-celeborn/assets/4150993/907dce24-16b7-4788-ab5d-5b784fd07d47) ![mr4](https://github.com/apache/incubator-celeborn/assets/4150993/8e8065b9-6c46-4c8d-9e71-45eed8e63877) Closes #1830 from FMX/CELEBORN-839. Lead-authored-by: mingji <fengmingxiao.fmx@alibaba-inc.com> Co-authored-by: Ethan Feng <fengmingxiao.fmx@alibaba-inc.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-14 14:12:53 +08:00
zwangsheng	03a39819b5	[CELEBORN-882][WORKER][METRICS] Add `Pause Push Data Time Count` Metrics & Dashboard Panel ### What changes were proposed in this pull request? Add `PausePushDataTime ` Metrics ### Why are the changes needed? Count each celeborn worker pause time. ### Does this PR introduce _any_ user-facing change? No ### How was this patch tested? Cluster Test Closes #1800 from zwangsheng/CELEBORN-882. Lead-authored-by: zwangsheng <2213335496@qq.com> Co-authored-by: zwangsheng <binjieyang@apache.org> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-12 17:45:26 +08:00
mingji	17cfbd7dc7	[CELEBORN-948][DOC] fix quick start doc about failed to submit flink wordcount ### What changes were proposed in this pull request? Update the script to start word count demo. ### Why are the changes needed? A user reported that he could not run the demo while following the quick start docs. ### Does this PR introduce _any_ user-facing change? NO. ### How was this patch tested? Cluster. Closes #1880 from FMX/CELEBORN-948. Authored-by: mingji <fengmingxiao.fmx@alibaba-inc.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-05 17:44:16 +08:00
zky.zhoukeyong	a42ec85a6e	[CELEBORN-943][PERF] Pre-create CelebornInputStreams in CelebornShuffleReader ### What changes were proposed in this pull request? This PR fixes performance degradation when Spark's coalescePartitions takes effect caused by RPC latency. ### Why are the changes needed? I encountered a performance degradation when testing tpcds 10T q10: \|\|Time\| \|---\|---\| \|ESS\|14s\| \|Celeborn\| 24s\| After digging into it I found out that q10 triggers partition coalescence: ![image](https://github.com/apache/incubator-celeborn/assets/948245/0b4745da-8d57-4661-a35d-683d97f56e1d) As I configured `spark.sql.adaptive.coalescePartitions.initialPartitionNum` to 1000, `CelebornShuffleReader` will call `shuffleClient.readPartition` sequentially 1000 times, causing the delay. This PR optimizes by calling `shuffleClient.readPartition` in parallel. After this PR q10 time becomes 14s. ### Does this PR introduce _any_ user-facing change? No, but introduced a new client side configuration `celeborn.client.streamCreatorPool.threads` which defaults to 32. ### How was this patch tested? TPCDS 1T and passes GA. Closes #1876 from waitinfuture/943. Lead-authored-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com> Co-authored-by: Keyong Zhou <waitinfuture@gmail.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-09-04 21:46:11 +08:00
zhongqiang.czq	b66eaff880	[CELEBORN-627][FLINK] Support split partitions ### What changes were proposed in this pull request? In MapPartiitoin, datas are split into regions. 1. Unlike ReducePartition whose partition split can occur on pushing data to keep MapPartition data ordering, PartitionSplit only be done on the time of sending PushDataHandShake or RegionStart messages (As shown in the following image). That's to say that the partition split only appear at the beginnig of a region but not inner a region. > Notice: if the client side think that it's failed to push HandShake or RegionStart messages. but the worker side can still receive normal HandShake/RegionStart message. After client revive succss, it don't push any messages to old partition, so the worker having the old partition will create a empty file. After committing files, the worker will return empty commitids. That's to say that empty file will be filterd after committing files and ReduceTask will not read any empty files. ![image](https://github.com/apache/incubator-celeborn/assets/96606293/468fd660-afbc-42c1-b111-6643f5c1e944) 2. PushData/RegioinFinish don't care the following cases: - Diskfull - ExceedPartitionSplitThreshold - Worker ShuttingDown so if one of the above three conditions appears, PushData and RegionFinish cant still do as normal. Workers should consider the ShuttingDown case and try best to wait all the regions finished before shutting down. if PushData or RegionFinish failed like network timeout and so on, then MapTask will failed and start another attempte maptask. ![image](https://github.com/apache/incubator-celeborn/assets/96606293/db9f9166-2085-4be1-b09e-cf73b469c55b) 3. how shuffle read supports partition split? ReduceTask should get split paritions by order and open the stream by partition epoc orderly ### Why are the changes needed? PartiitonSplit is not supported by MapPartition from now. There still a risk that a partition file'size is too large to store the file on worker disk. To avoid this risk, this pr introduces partition split in shuffle read and shuffle write. ### Does this PR introduce _any_ user-facing change? NO. ### How was this patch tested? UT and manual TPCDS test Closes #1550 from FMX/CELEBORN-627. Lead-authored-by: zhongqiang.czq <zhongqiang.czq@alibaba-inc.com> Co-authored-by: mingji <fengmingxiao.fmx@alibaba-inc.com> Co-authored-by: Ethan Feng <ethanfeng@apache.org> Signed-off-by: zhongqiang.czq <zhongqiang.czq@alibaba-inc.com>	2023-09-01 19:25:51 +08:00
mingji	2ee6e305f1	[CELEBORN-941] fix incorrect deploy doc ### What changes were proposed in this pull request? Fix the incorrect deploy doc about using HDFS only. ### Why are the changes needed? Ditto. ### Does this PR introduce _any_ user-facing change? NO. ### How was this patch tested? Just docs. Closes #1874 from FMX/CELEBORN-941. Authored-by: mingji <fengmingxiao.fmx@alibaba-inc.com> Signed-off-by: mingji <fengmingxiao.fmx@alibaba-inc.com>	2023-08-31 18:54:27 +08:00
SteNicholas	baaddb8ee8	[CELEBORN-822][DOC] Introduce a quick start guide for running Apache Flink with Apache Celeborn ### What changes were proposed in this pull request? Introduce a quick start guide for running Apache Flink with Apache Celeborn to help Flink users to run with Celeborn. ### Why are the changes needed? There is no quick start guide for running Apache Flink with Apache Celeborn. ### Does this PR introduce _any_ user-facing change? No. ### How was this patch tested? None. Closes #1868 from SteNicholas/CELEBORN-822. Authored-by: SteNicholas <programgeek@163.com> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>	2023-08-30 21:38:03 +08:00

1 2 3 4 5

248 Commits