【TiDB 使用环境】生产环境
【TiDB 版本】v8.1.2
【操作系统】银河麒麟V10-SP2
【部署方式】本地机房部署
【集群数据量】2TB
【集群节点数】3
TIDB 178/179/180(IPAddress)
TIKV 178/179/180
TIFlash 178
PD 179
【问题复现路径】周末公司断电,无人知晓此情况
【遇到的问题:问题现象及影响】PD无法启动
【资源配置】进入到 TiDB Dashboard -集群信息 (Cluster Info) -主机(Hosts) 截图此页面
集群已无法启动,无法截图
【复制黏贴 ERROR 报错的日志】
tiup start报错如下:
Error: failed to start tikv: failed to start: 192.168.15.180 tikv-20160.service, please check the instance’s log(/tidb/tidb_deploy/tikv-20160/log) for more detail.: timed out waiting for port 20160 to be started after 2m0s
查询tikv日志如下:
[2025/11/03 10:48:36.200 +08:00] [INFO] [util.rs:639] [“connecting to PD endpoint”] [endpoints=192.168.15.179:2379] [thread_id=1]
[2025/11/03 10:48:36.203 +08:00] [INFO] [util.rs:601] [“PD failed to respond”] [err=“Grpc(RpcFailure(RpcStatus { code: 14-UNAVAILABLE, message: "failed to connect to all addresses", details: }))”] [endpoints=192.168.15.179:2379] [thread_id=1]
以下为pd_strerr.log报错信息
goroutine 300 [running]:
go.etcd.io/bbolt.(*DB).freepages.func2()
/root/go/pkg/mod/go.etcd.io/bbolt@v1.3.9/db.go:1202 +0x8d
created by go.etcd.io/bbolt.(*DB).freepages in goroutine 299
/root/go/pkg/mod/go.etcd.io/bbolt@v1.3.9/db.go:1200 +0x1e5
panic: freepages: failed to get all reachable pages (key[0]=(hex)0000000000739a9e5f0000000000000000 on leaf page(65) needs to be < than key of the next element in ancestor (hex)0000000000739a845f0000000000000000. Pages stack: [84 79 65])
【其他附件:截图/日志/监控】




