Commit Graph

1099 Commits

Author SHA1 Message Date
liugs0213
e0aece9340 [CELEBORN-343] Add configuration for k8s cluster name (#1269) 2023-02-27 17:51:02 +08:00
Rex(Hui) An
798ff90bb7
[CELEBORN-342] Fix the wrong avg produce bytes in Congestion control (#1279) 2023-02-27 16:29:37 +08:00
Shuang
935806f036
[CELEBORN-341][Flink] cache file group for map partition in Flink plugin (#1277) 2023-02-26 20:31:20 +08:00
Keyong Zhou
3c8c58e09d
[CELEBORN-301] Refactor PartitionLocationInfo to use ConcurrentHashMap (#1278) 2023-02-26 16:46:30 +08:00
Ethan Feng
f0b9236ff2
[CELEBORN-340][FLINK] Reuse file channels in map partition read. (#1276) 2023-02-24 19:26:51 +08:00
Ethan Feng
da292fa369
[CELEBORN-339][FLINK] Should ignore readdata's backlog. (#1275) 2023-02-24 17:31:26 +08:00
Angerszhuuuu
a7587c3fe7
[CELEBORN-337] Remove unnecessary StatusCode.message (#1272)
* [CELEBORN-337] Remove unnecessary StatusCode.message
2023-02-24 15:11:07 +08:00
Angerszhuuuu
81f7ffd767
[CELEBORN-332] Unify the log of ShuffleClientImpl (#1267)
* [CELEBORN-332] Unify the log of ShuffleClientImpl
2023-02-24 14:07:25 +08:00
zhongqiangchen
af9e8366c9
[CELEBORN-329] Add rpc address to exception message when failed to sendrpc (#1263) 2023-02-23 19:32:21 +08:00
zhongqiangchen
daca964f87
[CELEBORN-315][FLINK] Add UT for flink-plugin PartitionSortedBuffer (#1248) 2023-02-23 19:12:22 +08:00
Shuang
a963aa4b4f
[CELEBORN-333][FLINK] bypass unexpected backlog message when stream closed. (#1268) 2023-02-23 18:49:14 +08:00
Shuang
9754616d79
[CELEBORN-330] fix deadlock when use the same netty channel to receive data while other thread wait the response (#1265) 2023-02-23 17:57:43 +08:00
Angerszhuuuu
3067efcfd3
[CELEBORN-331] submitRetryPushData should throw PUSH_DATA_CREATE_CONNECTION_FAIL_MASTER too (#1266)
* [CELEBORN-331] submitRetryPushData should throw PUSH_DATA_CREATE_CONNECTION_FAIL_MASTER too
2023-02-23 14:57:11 +08:00
Angerszhuuuu
f7948190cf
[CELEBORN-316][FOLLOWUP] Should not wrap CelebornIOException with CelebornIOException (#1264) 2023-02-23 11:48:46 +08:00
Angerszhuuuu
1132cc25ab
[CELEBORN-328][MPROVEMENT] Too much noisy log when reserve slot failed (#1262) 2023-02-22 17:19:52 +08:00
Binjie Yang
decc7a2c3a
[DOC][METRIC] Fix simple spelling errors (#1242) 2023-02-22 16:34:10 +08:00
Angerszhuuuu
322f0d2b41
[CELEBORN-316] Wrap Celeborn exception with CelebornIOException (#1253) 2023-02-22 16:10:11 +08:00
Ethan Feng
1704aff95c
[CELEBORN-327][Flink] BufferStreamMananger should recycle buffer in reader thread. (#1261) 2023-02-22 16:02:58 +08:00
Shuang
3da615972e
[CELEBORN-326)] [Flink] lifecycleManager supports flink-yarn-session mode to handle multiple Flink jobs. (#1260) 2023-02-22 15:37:24 +08:00
Angerszhuuuu
fc8540a2e6
[CELEBORN-325] After worker restart, throw NPE when receive not found partition (#1259)
* [CELEBORN-325] After worker restart, throw NPE when receive not found partition
2023-02-22 15:19:34 +08:00
Ethan Feng
cb8df62ec5
[CELEBORN-324][FLINK] Flink plugin needs reuse connections. (#1257) 2023-02-21 18:32:00 +08:00
Angerszhuuuu
251b923b5b
[CELEBORN-321] When register shuffle failed, DataPushQueue should directly take the task queue to avoid NPE (#1258) 2023-02-21 17:02:37 +08:00
Shuang
1b1517c7b4
[CELEBORN-323] readBuffers need synchronized as recycle buffer will call readers in multiple threads (#1256) 2023-02-21 15:58:19 +08:00
Ethan Feng
5dd5e97225
[CELEBORN-322][Flink] Copy out message if it‘s readData only. (#1255) 2023-02-21 15:51:13 +08:00
Ethan Feng
c649655933
Revert "[CELEBORN-322][Flink] Copy out message if it‘s readData only."
This reverts commit 0aa37ed7d3.
2023-02-21 14:48:08 +08:00
Ethan Feng
0aa37ed7d3
[CELEBORN-322][Flink] Copy out message if it‘s readData only. 2023-02-21 14:45:39 +08:00
Ethan Feng
e9b33751d3
[CELEBORN-289] Add flink integration test module. (#1229) 2023-02-21 12:25:23 +08:00
Ethan Feng
0df08fbdf3
[CELEBORN-320][FLINK] fix handle wrong message type in FetchHandler. (#1254) 2023-02-21 11:51:01 +08:00
Ethan Feng
d7798127c9
[CELEBORN-319] FlinkTransportClient should not reuse connection. (#1252) 2023-02-21 11:16:30 +08:00
Shuang
cf833e568c
[CELEBORN-318] fix deadlock & bugs in bufferStreamManager (#1251) 2023-02-21 11:12:16 +08:00
Shuang
61065230bd
[CELEBORN-311] not retry when register for map partition occurs exception (#1246) 2023-02-21 10:16:10 +08:00
Shuang
a6103e4bf8
[CELEBORN-317] add REGISTER_MAP_PARTITION_TASK message type (#1250) 2023-02-20 22:01:35 +08:00
Ethan Feng
7e9ba19d58
[CELEBORN-302] Fix workers count out of sync in HA mode. (#1239) 2023-02-20 21:46:33 +08:00
Ethan Feng
26a3bb5e72
[CELEBORN-308] Fix flusher will exit unexpectedly if flush task write failed. (#1249) 2023-02-20 21:45:37 +08:00
zhongqiangchen
b09b85521a
[CELEBORN-314] Add ut for BufferPack and fix npe while logging (#1247) 2023-02-20 14:57:22 +08:00
Ethan Feng
bfb39632d9
[CELEBORN-235] Implement flink plugin. (#1244) 2023-02-17 19:31:12 +08:00
zhongqiangchen
b5dc106af8
[CELEBORN-291] optimize shuffleclientimpl creating client and pushdata for mappartition (#1224) 2023-02-17 19:07:19 +08:00
Ethan Feng
0c8bb83114
[CELEBORN-234] Implement buffer stream. (#1221) 2023-02-17 17:38:36 +08:00
zhongqiangchen
80c922c2d1
[CELEBORN-290] optimize flink-plugin RemoteShuffleOutputGate/RemoteShuffleResultPartition (#1223) 2023-02-17 16:52:48 +08:00
Shuang
4bac21982f
[CELEBORN-310] include roaringbitmap in flink plugin (#1245) 2023-02-17 15:26:23 +08:00
Ethan Feng
3aacede5f8
[CELEBORN-283] Derive network layer for flink plugin. (#1222) 2023-02-17 14:12:54 +08:00
zhongqiangchen
5236df68af
[CELEBORN-292] optimize mappartitionfilewriter flushing index and reading data header (#1225) 2023-02-17 13:42:28 +08:00
zhongqiangchen
79096d60d0
[CELEBORN-293] WorkerSource registers timer for mappartition message metrics (#1226) 2023-02-17 11:29:54 +08:00
Shuang
b7ef9cf216
[CELEBORN-297] don't cache file groups for map partition shuffle type (#1237) 2023-02-17 11:28:47 +08:00
Ethan Feng
1dcfdb0c8f
[CELEBORN-281] Add metrics about buffer stream read buffer. (#1216) 2023-02-17 11:20:07 +08:00
Keyong Zhou
89b4eab3b6
[CELEBORN-309] Fix some potential concurrent issues in InFlightRequestTracker (#1243) 2023-02-17 10:01:19 +08:00
Angerszhuuuu
57f775a7e9
[CELEBORN-273] Move push data timeout checker into TransportResponseHandler to keep callback status consistence (#1208) 2023-02-16 18:27:37 +08:00
Ethan Feng
a364fb27b2
[CELEBORN-282] Add BacklogAnnouncement RPC. (#1217) 2023-02-16 14:58:39 +08:00
Ethan Feng
534853bf8a
[CELEBORN-278] Add openStreamWithCredit RPC. (#1214) 2023-02-16 14:07:13 +08:00
zhongqiangchen
2c508dae0f
[CELEBORN-307] fix ArrayComparisonFailure while running lz4 ut (#1241) 2023-02-16 13:41:17 +08:00