tispark跨机房连tiflash抛异常, Call mpp isAlive fail with Exception机房之间有延迟几十ms,同机房无异常。

【 TiDB 使用环境】生产环境
【 TiDB 版本】
【复现路径】同机房无异常,tispark跨机房连异常
【遇到的问题:问题现象及影响】
tispart跨机房连tiflash抛异常,机房之间有延迟几十ms,同机房正常。

Executor.java:1149) ~[?:1.8.0_152] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[?:1.8.0_152] at java.lang.Thread.run(Thread.java:748) ~[?:1.8.0_152] 2024-05-25 08:02:47 [WARN] [storeStatus-thread-0] com.pingcap.tikv.region.RegionStoreClient#1223 - Call mpp isAlive fail with Exception shade.io.grpc.StatusRuntimeException: UNIMPLEMENTED at shade.io.grpc.stub.ClientCalls.toStatusRuntimeException(ClientCalls.java:244) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at shade.io.grpc.stub.ClientCalls.getUnchecked(ClientCalls.java:225) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at shade.io.grpc.stub.ClientCalls.blockingUnaryCall(ClientCalls.java:142) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at com.pingcap.tikv.region.RegionStoreClient.isMppAlive(RegionStoreClient.java:1219) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at com.pingcap.tikv.TiSession.lambda$null$0(TiSession.java:254) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at java.util.concurrent.ConcurrentHashMap.replaceAll(ConcurrentHashMap.java:1610) ~[?:1.8.0_152] at com.pingcap.tikv.TiSession.lambda$getStoreStatusCache$1(TiSession.java:252) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) ~[?:1.8.0_152] at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) ~[?:1.8.0_152] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) ~[?:1.8.0_152] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) ~[?:1.8.0_152] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[?:1.8.0_152] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[?:1.8.0_152] at java.lang.Thread.run(Thread.java:748) ~[?:1.8.0_152] 2024-05-25 08:02:47 [WARN] [storeStatus-thread-0] com.pingcap.tikv.region.RegionStoreClient#1223 - Call mpp isAlive fail with Exception shade.io.grpc.StatusRuntimeException: UNIMPLEMENTED at shade.io.grpc.stub.ClientCalls.toStatusRuntimeException(ClientCalls.java:244) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at shade.io.grpc.stub.ClientCalls.getUnchecked(ClientCalls.java:225) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at shade.io.grpc.stub.ClientCalls.blockingUnaryCall(ClientCalls.java:142) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at com.pingcap.tikv.region.RegionStoreClient.isMppAlive(RegionStoreClient.java:1219) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at com.pingcap.tikv.TiSession.lambda$null$0(TiSession.java:254) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at java.util.concurrent.ConcurrentHashMap.replaceAll(ConcurrentHashMap.java:1610) ~[?:1.8.0_152] at com.pingcap.tikv.TiSession.lambda$getStoreStatusCache$1(TiSession.java:252) ~[tispark-assembly-3.3_2.12-3.1.5.jar:?] at java.util.concurrent.Executors$RunnableAdapter.call(Executors.java:511) ~[?:1.8.0_152] at java.util.concurrent.FutureTask.runAndReset(FutureTask.java:308) ~[?:1.8.0_152] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.access$301(ScheduledThreadPoolExecutor.java:180) ~[?:1.8.0_152] at java.util.concurrent.ScheduledThreadPoolExecutor$ScheduledFutureTask.run(ScheduledThreadPoolExecutor.java:294) ~[?:1.8.0_152] at java.util.concurrent.ThreadPoolExecutor.runWorker(ThreadPoolExecutor.java:1149) ~[?:1.8.0_152] at java.util.concurrent.ThreadPoolExecutor$Worker.run(ThreadPoolExecutor.java:624) ~[?:1.8.0_152] at java.lang.Thread.run(Thread.java:748) ~[?:1.8.0_152] 2024-05-25 08:02:49 [INFO] [dispatcher-Executor] org.apache.spark.executor.YarnCoarseGrainedExecutorBackend#61 - Driver commanded a shutdown 2024-05-25 08:02:49 [INFO] [CoarseGrainedExecutorBackend-stop-executor] org.apache.spark.storage.memory.MemoryStore#61 - MemoryStore cleared 2024-05-25 08:02:49 [INFO] [CoarseGrainedExecutorBackend-stop-executor] org.apache.spark.storage.BlockManager#61 - BlockManager stopped 2024-05-25 08:02:49 [INFO] [shutdown-hook-0] org.apache.spark.util.ShutdownHookManager#61 - Shutdown hook called

tidb跨机房上连tiflash无异常。

错误信息表明在尝试执行某些操作时,可能是由于网络通信问题或者服务端的问题导致的。具体来说,shade.io.grpc.StatusRuntimeException: UNIMPLEMENTED 异常通常意味着客户端尝试执行的 gRPC 方法在服务端尚未实现。这种情况可能是由于服务端的 API 变更或者配置错误导致的.

是配置的问题吗