致命错，tikv不能启动

奋斗的大象 · 2024 年5 月 22 日 03:28

[FATAL] [server.rs:428] ["panic_mark_file /data12/tidb/data/tikv-20162/panic_mark_file exists, there must be something wrong with the db. Do not remove the panic_mark_file and force the TiKV node to restart. Please contact TiKV maintainers to investigate the issue. If needed, use scale in and scale out to replace the TiKV node
【附件：截图/日志/监控】

tidb菜鸟一只 · 2024 年5 月 22 日 03:33

直接通过扩容后再缩容处理吧，最保险处理也很快。

Billdi表弟 · 2024 年5 月 22 日 03:41

use scale in and scale out to replace the TiKV node

Billdi表弟 · 2024 年5 月 22 日 03:42

说的是先缩容后扩容，可以试试

奋斗的大象 · 2024 年5 月 22 日 04:07

[2024/05/22 12:07:06.165 +08:00] [INFO] [region_cache.go:2377] [“[health check] check health error”] [store=10.114.26.112:20162] [error=“rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing dial tcp 10.114.26.112:20162: connect: connection refused"”]
[2024/05/22 12:07:06.165 +08:00] [INFO] [region_request.go:785] [“mark store’s regions need be refill”] [id=183060412] [addr=10.114.26.112:20162] [error=“context deadline exceeded”]

zhaokede · 2024 年5 月 22 日 04:15

第一个出错日志里，提了建议建议通过缩容和扩容替换tikv节点

TiDBer_QYr0vohO · 2024 年5 月 22 日 04:40

扩缩容处理吧

奋斗的大象 · 2024 年5 月 22 日 04:43

region_cache.go:2377] [“[health check] check health error”] [store=10.114.26.112:20161] [error=“rpc error: code = Unavailable desc = connection error: desc = "transport: Error while dialing dial tcp 10.114.26.112:20161: connect: connection refused"”]

奋斗的大象 · 2024 年5 月 22 日 04:47

port conflict for ‘20162’ between ‘tikv_servers:10.114.26.112.port’

Billmay表妹 · 2024 年5 月 22 日 05:07

问题出现之前你的部署是怎么样子的？

截个图看看你的机器配置情况？几个 tikv？

奋斗的大象 · 2024 年5 月 22 日 05:10

之前是好的，7台，昨天有台机器被搞挂了，今天重启不行了，扩容还报端口冲突：“code”: 1, “error”: “port conflict for ‘20162’ between ‘tikv_servers:10.114.26.112.port’ and ‘tikv_servers:10.114.26.112.port’”}

tony5413 · 2024 年5 月 22 日 05:18

use scale in and scale out to replace the TiKV node

奋斗的大象 · 2024 年5 月 22 日 05:46

tikv 下线失败 linux/x86_64 Pending Offline

tidb菜鸟一只 · 2024 年5 月 22 日 06:03

如果在同一台机器上扩容的话，那得换下端口，如果是7台tikv，其中1台panic的话，可以先缩容再扩容试下

TiDBer_q2eTrp5h · 2024 年5 月 22 日 07:22

先缩容后扩容这个可以试一下，以前遇到过一次就是这么解决的。

TIDB-Learner · 2024 年5 月 22 日 10:14

我在想出问题的tikv 实例，在扩容后，能缩容成功吗》