From ae85c245d2ea53ff4d7a155a9ca68aead39ef02b Mon Sep 17 00:00:00 2001 From: Kent Yao <11215016@zju.edu.cn> Date: Mon, 25 Feb 2019 12:33:48 +0800 Subject: [PATCH] Prepare releasing v0.5.0 (#152) * Prepare releasing 0.5.0 * update documentations --- docs/architecture.md | 6 +++--- docs/authentication.md | 2 +- docs/authorization.md | 2 +- docs/configurations.md | 8 ++++---- docs/high_availability_guide.md | 16 ++++++++-------- kyuubi-server/pom.xml | 2 +- pom.xml | 2 +- 7 files changed, 19 insertions(+), 19 deletions(-) diff --git a/docs/architecture.md b/docs/architecture.md index c662c08bc..ee83b78de 100644 --- a/docs/architecture.md +++ b/docs/architecture.md @@ -105,9 +105,9 @@ a selected registered one. This feature offers: Name|Default|Description ---|---|--- -spark.kyuubi.ha.enabled|false|Whether KyuubiServer supports dynamic service discovery for its clients. To support this, each instance of KyuubiServer currently uses ZooKeeper to register itself, when it is brought up. JDBC/ODBC clients should use the ZooKeeper ensemble: spark.kyuubi.ha.zk.quorum in their connection string. -spark.kyuubi.ha.zk.quorum|none|Comma separated list of ZooKeeper servers to talk to, when KyuubiServer supports service discovery via Zookeeper. -spark.kyuubi.ha.zk.namespace|kyuubiserver|The parent node in ZooKeeper used by KyuubiServer when supporting dynamic service discovery. +spark.kyuubi.
ha.enabled|false|Whether KyuubiServer supports dynamic service discovery for its clients. To support this, each instance of KyuubiServer currently uses ZooKeeper to register itself, when it is brought up. JDBC/ODBC clients should use the ZooKeeper ensemble: spark.kyuubi.ha.zk.quorum in their connection string. +spark.kyuubi.
ha.zk.quorum|none|Comma separated list of ZooKeeper servers to talk to, when KyuubiServer supports service discovery via Zookeeper. +spark.kyuubi.
ha.zk.namespace|kyuubiserver|The parent node in ZooKeeper used by KyuubiServer when supporting dynamic service discovery. # Kyuubi Internal diff --git a/docs/authentication.md b/docs/authentication.md index e60292ab7..3951d3457 100644 --- a/docs/authentication.md +++ b/docs/authentication.md @@ -5,7 +5,7 @@ Kyuubi supports Anonymous (no authentication) with and without SASL, Kerberos (G Name|Default|Description ---|---|--- -spark.kyuubi.authentication | NONE | Authentication mode, default NONE. Options are NONE (uses plain SASL), NOSASL, KERBEROS, LDAP. +spark.kyuubi.
authentication | NONE | Authentication mode, default NONE. Options are NONE (uses plain SASL), NOSASL, KERBEROS, LDAP. #### NONE ###### Server diff --git a/docs/authorization.md b/docs/authorization.md index b5053c6f8..691e071a6 100644 --- a/docs/authorization.md +++ b/docs/authorization.md @@ -12,7 +12,7 @@ Three primary modes for Kyuubi authorization are available by [spark-authorizer]

Storage-Based Authorization

-Enabling Storage Based Authorization in the `Hive Metastore Server` uses the HDFS permissions to act as the main source for verification and allows for consistent data and metadata authorization policy. This allows control over metadata access by verifying if the user has permission to access corresponding directories on the HDFS. Similar with `HiveServer2`, files and directories will be tanslated into hive metadata objects, such as dbs, tables, partitions, and be protected from end user's queries through Kyuubi. +Enabling Storage Based Authorization in the `Hive Metastore Server` uses the HDFS permissions to act as the main source for verification and allows for consistent data and metadata authorization policy. This allows control over metadata access by verifying if the user has permission to access corresponding directories on the HDFS. Similar with `HiveServer2`, files and directories will be translated into hive metadata objects, such as dbs, tables, partitions, and be protected from end user's queries through Kyuubi. Storage-Based Authorization offers users with Database, Table and Partition-level coarse-gained access control. diff --git a/docs/configurations.md b/docs/configurations.md index 1c04218cc..3887c5be5 100644 --- a/docs/configurations.md +++ b/docs/configurations.md @@ -44,7 +44,7 @@ spark.kyuubi.
ha.zk.connection.max.retries|3|Max retry times for connecting Name|Default|Description ---|---|--- spark.kyuubi.
logging.operation.enabled|true|When true, Kyuubi Server will save operation logs and make them available for clients -spark.kyuubi.
logging.operation.log.dir|`KYUUBI_LOG_DIR` -> `java.io.tmpdir`/operation_logs|Top level directory where operation logs are stored if logging functionality is enabled +spark.kyuubi.
logging.operation.log.dir|KYUUBI_LOG_DIR/
operation_logs|Top level directory where operation logs are stored if logging functionality is enabled #### Frontend Service options @@ -86,8 +86,8 @@ spark.kyuubi.
backend.session.wait.other.interval|1s|The interval for check spark.kyuubi.
backend.session.init.timeout|60s|How long we suggest the server to give up instantiating SparkContext. spark.kyuubi.
backend.session.check.interval|5min|The check interval for backend session a.k.a SparkSession timeout. spark.kyuubi.
backend.session.idle.timeout|30min|SparkSession timeout. -spark.kyuubi.
backend.session.local.dir|KYUUBI_HOME/local|Default value to set `spark.local.dir`. For YARN mode, this only affect the Kyuubi server side settings according to the rule of Spark treating `spark.local.dir`. -spark.kyuubi.
backend.session.long.cache|${UserGroupInformation.isSecurityEnabled}|Whether to update the tokens of Spark's executor to support long caching SparkSessions iff this is true && `spark.kyuubi.backend.token.update.class` is loadable. This is used towards kerberized hadoop clusters in case of `spark.kyuubi.backend.session.idle.timeout` is set longer than token expiration time limit or SparkSession never idles. +spark.kyuubi.
backend.session.local.dir|KYUUBI_HOME/
local|Default value to set `spark.local.dir`. For YARN mode, this only affect the Kyuubi server side settings according to the rule of Spark treating `spark.local.dir`. +spark.kyuubi.
backend.session.long.cache|${UserGroupInformation.
isSecurityEnabled}|Whether to update the tokens of Spark's executor to support long caching SparkSessions iff this is true && `spark.kyuubi.backend.token.update.class` is loadable. This is used towards kerberized hadoop clusters in case of `spark.kyuubi.backend.session.idle.timeout` is set longer than token expiration time limit or SparkSession never idles. spark.kyuubi.
backend.session.token.update.class|org.apache.spark.
scheduler.cluster.
CoarseGrainedClusterMessages$
UpdateDelegationTokens|`CoarseGrainedClusterMessages` for token update message from the driver of Spark to executors, it is loadable only by higher version Spark release(2.3 and later) @@ -119,7 +119,7 @@ spark.driver.extraJavaOptions| (none) | A string of extra JVM options to pass to Spark use netty as RPC between driver and executor, Kyuubi Server may need much bigger directory memory size. ```properties -spark.driver.extraJavaOptions -XX:PermSize=1024m -XX:MaxPermSize=1024m -XX:MaxDirectMemorySize=4096m +spark.driver.extraJavaOptions -XX:+PrintFlagsFinal -XX:+UnlockDiagnosticVMOptions -XX:ParGCCardsPerStrideChunk=4096 -XX:+UseParNewGC -XX:+UseConcMarkSweepGC -XX:+CMSConcurrentMTEnabled -XX:CMSInitiatingOccupancyFraction=70 -XX:+UseCMSInitiatingOccupancyOnly -XX:+CMSClassUnloadingEnabled -XX:+CMSParallelRemarkEnabled -XX:+UseCondCardMark -XX:PermSize=1024m -XX:MaxPermSize=1024m -XX:MaxDirectMemorySize=8192m -XX:+HeapDumpOnOutOfMemoryError -XX:HeapDumpPath=./logs -XX:OnOutOfMemoryError="kill -9 %p" -verbose:gc -XX:+PrintGCDetails -XX:+PrintGCDateStamps -XX:+PrintTenuringDistribution -Xloggc:./logs/kyuubi-server-gc-%t.log -XX:+UseGCLogFileRotation -XX:NumberOfGCLogFiles=50 -XX:GCLogFileSize=5M -XX:NewRatio=3 -Dio.netty.noPreferDirect=true -Dio.netty.recycler.maxCapacity=0 ``` Spark properties for [Driver](http://spark.apache.org/docs/latest/configuration.html#runtime-environment) like those above controls Kyuubi Server's own behaviors, while other properties could be set in JDBC/ODBC connection strings. diff --git a/docs/high_availability_guide.md b/docs/high_availability_guide.md index a296fe1b9..32c1755c4 100644 --- a/docs/high_availability_guide.md +++ b/docs/high_availability_guide.md @@ -46,14 +46,14 @@ This section describes how to configure high availability. These configurations Name|Default|Description ---|---|--- -spark.kyuubi.ha.enabled|false|Whether KyuubiServer supports dynamic service discovery for its clients. To support this, each instance of KyuubiServer currently uses ZooKeeper to register itself, when it is brought up. JDBC/ODBC clients should use the ZooKeeper ensemble: spark.kyuubi.ha.zk.quorum in their connection string. -spark.kyuubi.ha.mode|load-balance|High availability mode, one is load-balance which is used by default, another is failover as master-slave mode. -spark.kyuubi.ha.zk.quorum|none|Comma separated list of ZooKeeper servers to talk to, when KyuubiServer supports service discovery via Zookeeper. -spark.kyuubi.ha.zk.namespace|kyuubiserver|The parent node in ZooKeeper used by KyuubiServer when supporting dynamic service discovery. -spark.kyuubi.ha.zk.client.port|2181|The port of ZooKeeper servers to talk to. If the list of Zookeeper servers specified in spark.kyuubi.zookeeper.quorum does not contain port numbers, this value is used. -spark.kyuubi.ha.zk.session.timeout|1,200,000|ZooKeeper client's session timeout (in milliseconds). The client is disconnected, and as a result, all locks released, if a heartbeat is not sent in the timeout. -spark.kyuubi.ha.zk.connection.basesleeptime|1,000|Initial amount of time (in milliseconds) to wait between retries when connecting to the ZooKeeper server when using ExponentialBackoffRetry policy. -spark.kyuubi.ha.zk.connection.max.retries|3|Max retry times for connecting to the zk server +spark.kyuubi.
ha.enabled|false|Whether KyuubiServer supports dynamic service discovery for its clients. To support this, each instance of KyuubiServer currently uses ZooKeeper to register itself, when it is brought up. JDBC/ODBC clients should use the ZooKeeper ensemble: spark.kyuubi.ha.zk.quorum in their connection string. +spark.kyuubi.
ha.mode|load-balance|High availability mode, one is load-balance which is used by default, another is failover as master-slave mode. +spark.kyuubi.
ha.zk.quorum|none|Comma separated list of ZooKeeper servers to talk to, when KyuubiServer supports service discovery via Zookeeper. +spark.kyuubi.
ha.zk.namespace|kyuubiserver|The parent node in ZooKeeper used by KyuubiServer when supporting dynamic service discovery. +spark.kyuubi.
ha.zk.client.port|2181|The port of ZooKeeper servers to talk to. If the list of Zookeeper servers specified in spark.kyuubi.zookeeper.quorum does not contain port numbers, this value is used. +spark.kyuubi.
ha.zk.session.timeout|1,200,000|ZooKeeper client's session timeout (in milliseconds). The client is disconnected, and as a result, all locks released, if a heartbeat is not sent in the timeout. +spark.kyuubi.
ha.zk.connection.basesleeptime|1,000|Initial amount of time (in milliseconds) to wait between retries when connecting to the ZooKeeper server when using ExponentialBackoffRetry policy. +spark.kyuubi.
ha.zk.connection.max.retries|3|Max retry times for connecting to the zk server ## Additional Documentations [Building Kyuubi](https://yaooqinn.github.io/kyuubi/docs/building.html) diff --git a/kyuubi-server/pom.xml b/kyuubi-server/pom.xml index 1bacda502..21d820ed0 100644 --- a/kyuubi-server/pom.xml +++ b/kyuubi-server/pom.xml @@ -22,7 +22,7 @@ kyuubi yaooqinn - 0.5.0-SNAPSHOT + 0.5.0 4.0.0 diff --git a/pom.xml b/pom.xml index a6e8a092b..75e8bab41 100644 --- a/pom.xml +++ b/pom.xml @@ -23,7 +23,7 @@ yaooqinn kyuubi Kyuubi Parent - 0.5.0-SNAPSHOT + 0.5.0 kyuubi-server