kyuubi/docs/deployment
Fei Wang e8445b7f28 [KYUUBI #2102] Support to retry the internal thrift request call and add engine liveness probe to enable fast fail before retry
### _Why are the changes needed?_

To close #2102

Support to retry all the internal thrift request calls(except RenewDelegationToken now), and fast fail if the remote engine is not stable or not alive.

In this PR, it supports engine liveness probe.

If it is enabled, a companion thrift client will be created and open a liveness probe session when opening remote engine session.
It will send some simple thrift request(GetInfo) to check whether the remote engine is alive, and fast fail before retry if remote engine is not connectable.

#### Why not use the same thrift client to check engine liveness before retry?
I tried that, but met `out of resp sequence` error.
For example:
1. send getOperationStatus request
2.  read time out
3. send GetInfoType request
4. receive getOperationStatus response (out of resp sequence)

### _How was this patch tested?_
- [x] Add some test cases that check the changes thoroughly including negative and positive cases if possible

- [ ] Add screenshots for manual tests if appropriate

- [x] [Run test](https://kyuubi.apache.org/docs/latest/develop_tools/testing.html#running-tests) locally before make a pull request

Closes #2122 from turboFei/retry_rpc.

Closes #2102

3926ba04 [Fei Wang] adress comments
ade4ede6 [Fei Wang] add timeout
1b7a64f9 [Fei Wang] Only check remote engine alive before retry
98e03f8e [Fei Wang] refactor
fac388cf [Fei Wang] remove unused import
9c6d8737 [Fei Wang] add ut
9b595650 [Fei Wang] Support to retry the thrift request and engine alive probe

Authored-by: Fei Wang <fwang12@ebay.com>
Signed-off-by: Fei Wang <fwang12@ebay.com>
2022-03-26 00:23:44 +08:00
..
spark [KYUUBI #1215][DOC] Document incremental collection 2022-03-08 20:49:51 +08:00
engine_lifecycle.md [KYUUBI #1557] [DOC] The TTL Of Kyuubi Engines 2021-12-15 10:02:02 +08:00
engine_on_kubernetes.md [KYUUBI #1454] Add Doc About Deploy Kyuubi On Kubernetes 2021-11-29 14:15:19 +08:00
engine_on_yarn.md [KYUUBI #1866][FOLLOWUP] Add Deploy Kyuubi Flink engine on Yarn 2022-03-14 14:45:44 +08:00
engine_share_level.md [KYUUBI #1399] [DOCS] Add doc for engine share level 2021-11-16 19:58:25 +08:00
high_availability_guide.md [KYUUBI #1527] [DOC] Improve High Availability Guide 2021-12-09 13:58:00 +08:00
hive_metastore.md [KYUUBI #1347] [DOC] Fix miscellaneous doc typos 2021-11-08 09:27:37 +08:00
index.rst [KYUUBI #1557] [DOC] The TTL Of Kyuubi Engines 2021-12-15 10:02:02 +08:00
kyuubi_on_kubernetes.md [KYUUBI #1876] Set Spark 3.2.1 as default 2022-02-16 00:54:25 +08:00
settings.md [KYUUBI #2102] Support to retry the internal thrift request call and add engine liveness probe to enable fast fail before retry 2022-03-26 00:23:44 +08:00