为提高效率,提问时请提供以下信息,问题描述清晰可优先响应。
- 【TiDB 版本】:3.0.8
- 【问题描述】:
tikv服务出现了中断,伴有大量的leader drop。同时还有很多leader选举的日志出现
从下面的日志能看出重建链接可以成功,但是不能通信。
麻烦看下是什么情况
[2020/04/22 03:23:00.161 +08:00] [WARN] [raft_client.rs:118] [“batch_raft RPC finished fail”] [err=“RpcFinis
hed(Some(RpcStatus { status: Unavailable, details: Some("OS Error") }))”]
[2020/04/22 03:23:00.161 +08:00] [WARN] [raft_client.rs:132] [“batch_raft/raft RPC finally fail”] [err=“RpcF
inished(Some(RpcStatus { status: Unavailable, details: Some("OS Error") }))”] [to_addr=10.9.99.96:20160]
[2020/04/22 03:23:00.161 +08:00] [INFO] [store.rs:1946] [“broadcasting unreachable”] [unreachable_store_id=5
] [store_id=59697]
[2020/04/22 03:23:00.167 +08:00] [WARN] [raft_client.rs:118] [“batch_raft RPC finished fail”] [err=“RpcFinis
hed(Some(RpcStatus { status: Unavailable, details: Some("OS Error") }))”]
[2020/04/22 03:23:00.167 +08:00] [WARN] [raft_client.rs:132] [“batch_raft/raft RPC finally fail”] [err=“RpcF
inished(Some(RpcStatus { status: Unavailable, details: Some("OS Error") }))”] [to_addr=10.9.16.130:20160]
[2020/04/22 03:23:00.180 +08:00] [WARN] [raft_client.rs:207] [“send to 10.9.99.96:20160 fail, the gRPC conne
ction could be broken”]
[2020/04/22 03:23:00.194 +08:00] [ERROR] [transport.rs:318] [“send raft msg err”] [err=“Other("[src/server/
raft_client.rs:216]: RaftClient send fail")”]
[2020/04/22 03:23:00.194 +08:00] [WARN] [raft_client.rs:207] [“send to 10.9.16.130:20160 fail, the gRPC conn
ection could be broken”]
[2020/04/22 03:23:00.202 +08:00] [INFO] [store.rs:1946] [“broadcasting unreachable”] [unreachable_store_id=2
429] [store_id=59697]
[2020/04/22 03:23:00.209 +08:00] [ERROR] [transport.rs:318] [“send raft msg err”] [err=“Other("[src/server/
raft_client.rs:216]: RaftClient send fail")”]
[2020/04/22 03:23:00.217 +08:00] [INFO] [transport.rs:299] [“resolve store address ok”] [addr=10.9.99.96:201
60] [store_id=5]
[2020/04/22 03:23:00.217 +08:00] [INFO] [raft_client.rs:50] [“server: new connection with tikv endpoint”] [a
ddr=10.9.99.96:20160]
[2020/04/22 03:23:00.219 +08:00] [INFO] [transport.rs:299] [“resolve store address ok”] [addr=10.9.16.130:20
160] [store_id=2429]
[2020/04/22 03:23:00.219 +08:00] [INFO] [raft_client.rs:50] [“server: new connection with tikv endpoint”] [a
ddr=10.9.16.130:20160]
[2020/04/22 03:23:00.719 +08:00] [WARN] [endpoint.rs:454] [error-response] [err=“region message: "peer is n
ot leader for region 135761, leader may Some(id: 135763 store_id: 2429)" not_leader { region_id: 135761 lea
der { id: 135763 store_id: 2429 } }”]
[2020/04/22 03:23:01.779 +08:00] [WARN] [endpoint.rs:454] [error-response] [err=“region message: "peer is n
ot leader for region 164706, leader may Some(id: 164708 store_id: 2429)" not_leader { region_id: 164706 lea
der { id: 164708 store_id: 2429 } }”]
[2020/04/22 03:23:03.226 +08:00] [INFO] [subchannel.cc:878] [“Connect failed: {"created":"@1587496983.223
140374","description":"Failed to connect to remote host: OS Error","errno":110,"file":"/rust/regis
try/src/github.com-1ecc6299db9ec823/grpcio-sys-0.4.7/grpc/src/core/lib/iomgr/tcp_client_posix.cc","file_li
ne":212,"os_error":"Connection timed out","syscall":"getsockopt(SO_ERROR)","target_address":"ipv
4:10.9.16.130:20160"}”]
[2020/04/22 03:23:03.226 +08:00] [INFO] [subchannel.cc:878] [“Connect failed: {"created":"@1587496983.223
120970","description":"Failed to connect to remote host: OS Error","errno":110,"file":"/rust/regis
try/src/github.com-1ecc6299db9ec823/grpcio-sys-0.4.7/grpc/src/core/lib/iomgr/tcp_client_posix.cc","file_li
ne":212,"os_error":"Connection timed out","syscall":"getsockopt(SO_ERROR)","target_address":"ipv
4:10.9.99.96:20160"}”]
[2020/04/22 03:23:03.226 +08:00] [INFO] [subchannel.cc:758] [“Subchannel 0x7fa874396600: Retry immediately”]
[2020/04/22 03:23:03.226 +08:00] [INFO] [subchannel.cc:758] [“Subchannel 0x7fa874396a00: Retry immediately”]
[2020/04/22 03:23:03.226 +08:00] [INFO] [subchannel.cc:719] [“Failed to connect to channel, retrying”]
[2020/04/22 03:23:03.226 +08:00] [INFO] [subchannel.cc:719] [“Failed to connect to channel, retrying”]
[2020/04/22 03:23:03.226 +08:00] [WARN] [raft_client.rs:118] [“batch_raft RPC finished fail”] [err=“RpcFinis
hed(Some(RpcStatus { status: Unavailable, details: Some("Connect Failed") }))”]
[2020/04/22 03:23:03.226 +08:00] [WARN] [raft_client.rs:132] [“batch_raft/raft RPC finally fail”] [err=“RpcF
inished(Some(RpcStatus { status: Unavailable, details: Some("Connect Failed") }))”] [to_addr=10.9.16.130:2
0160]
[2020/04/22 03:23:03.226 +08:00] [WARN] [raft_client.rs:118] [“batch_raft RPC finished fail”] [err=“RpcFinis
hed(Some(RpcStatus { status: Unavailable, details: Some("Connect Failed") }))”]
[2020/04/22 03:23:03.226 +08:00] [WARN] [raft_client.rs:132] [“batch_raft/raft RPC finally fail”] [err=“RpcF
inished(Some(RpcStatus { status: Unavailable, details: Some("Connect Failed") }))”] [to_addr=10.9.99.96:20
160]
[2020/04/22 03:23:03.235 +08:00] [WARN] [raft_client.rs:207] [“send to 10.9.99.96:20160 fail, the gRPC conne
ction could be broken”]
[2020/04/22 03:23:03.239 +08:00] [ERROR] [transport.rs:318] [“send raft msg err”] [err=“Other("[src/server/
raft_client.rs:216]: RaftClient send fail")”]
[2020/04/22 03:23:03.239 +08:00] [WARN] [raft_client.rs:207] [“send to 10.9.16.130:20160 fail, the gRPC conn
ection could be broken”]
[2020/04/22 03:23:03.244 +08:00] [ERROR] [transport.rs:318] [“send raft msg err”] [err=“Other("[src/server/
raft_client.rs:216]: RaftClient send fail")”]
[2020/04/22 03:23:03.244 +08:00] [INFO] [transport.rs:299] [“resolve store address ok”] [addr=10.9.16.130:20
160] [store_id=2429]
[2020/04/22 03:23:03.244 +08:00] [INFO] [raft_client.rs:50] [“server: new connection with tikv endpoint”] [a
ddr=10.9.16.130:20160]
[2020/04/22 03:23:03.244 +08:00] [INFO] [transport.rs:299] [“resolve store address ok”] [addr=10.9.99.96:201
60] [store_id=5]
[2020/04/22 03:23:03.244 +08:00] [INFO] [raft_client.rs:50] [“server: new connection with tikv endpoint”] [a
ddr=10.9.99.96:20160]