### What changes were proposed in this pull request? Change Celeborn Master URL from `rss://<host>:<port>` to `celeborn://<host>:<port>` ### Why are the changes needed? Respect the project name. ### Does this PR introduce _any_ user-facing change? Yes, migration guide is updated accordingly. ### How was this patch tested? Pass GA. Closes #1624 from pan3793/CELEBORN-715. Authored-by: Cheng Pan <chengpan@apache.org> Signed-off-by: zky.zhoukeyong <zky.zhoukeyong@alibaba-inc.com>
3.5 KiB
| hide | license | |
|---|---|---|
|
Licensed to the Apache Software Foundation (ASF) under one or more contributor license agreements. See the NOTICE file distributed with this work for additional information regarding copyright ownership. The ASF licenses this file to You under the Apache License, Version 2.0 (the "License"); you may not use this file except in compliance with the License. You may obtain a copy of the License at http://www.apache.org/licenses/LICENSE-2.0 Unless required by applicable law or agreed to in writing, software distributed under the License is distributed on an "AS IS" BASIS, WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied. See the License for the specific language governing permissions and limitations under the License. |
Migration Guide
Upgrading from 0.2.1 to 0.3.0
-
From 0.3.0 on the default value for
celeborn.client.push.replicate.enabledis changed fromtruetofalse, users who want replication on should explicitly enable replication. For example, to enable replication for Spark users should add the spark config when submitting job:spark.celeborn.client.push.replicate.enabled=true -
From 0.3.0 on the default value for
celeborn.worker.storage.workingDiris changed fromhadoop/rss-worker/shuffle_datatorss-worker/shuffle_data, users who want to use origin working dir path should set this configuration. -
Since 0.3.0, configuration namespace
celeborn.ha.masteris deprecated, and will be removed in the future versions. All configurationsceleborn.ha.master.*should migrate toceleborn.master.ha.*. -
Since 0.3.0, environment variables
CELEBORN_MASTER_HOSTandCELEBORN_MASTER_PORTare removed. InsteadCELEBORN_LOCAL_HOSTNAMEworks on both master and worker, which takes high priority than configurations defined in properties file. -
Since 0.3.0, the Celeborn Master URL schema is changed from
rss://toceleborn://, for users who start Worker bysbin/start-worker.sh rss://<master-host>:<master-port>, should migrate tosbin/start-worker.sh celeborn://<master-host>:<master-port>. -
When using 0.2.1 as client side and 0.3.0 as server side, you may see the following Exception in LifecycleManger's log. You can safely ignore the log, it's caused by the behavior change when Master receives heartbeat from Application.
??? warning "logs"
23/06/20 18:12:30 WARN TransportChannelHandler: Exception in connection from /192.168.1.16:9097 java.io.InvalidObjectException: enum constant HEARTBEAT_FROM_APPLICATION_RESPONSE does not exist in class org.apache.celeborn.common.protocol.MessageType at java.io.ObjectInputStream.readEnum(ObjectInputStream.java:2157) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1662) at java.io.ObjectInputStream.defaultReadFields(ObjectInputStream.java:2430) at java.io.ObjectInputStream.readSerialData(ObjectInputStream.java:2354) at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:2212) at java.io.ObjectInputStream.readObject0(ObjectInputStream.java:1668) at java.io.ObjectInputStream.readObject(ObjectInputStream.java:502) at java.io.ObjectInputStream.readObject(ObjectInputStream.java:460) at org.apache.celeborn.common.serializer.JavaDeserializationStream.readObject(JavaSerializer.scala:76) at org.apache.celeborn.common.serializer.JavaSerializerInstance.deserialize(JavaSerializer.scala:110)