4.0.13 升级时候切换 pd leader 一直提示超时失败

【 TiDB 使用环境】生产环境 /测试/ Poc
【 TiDB 版本】
【复现路径】做过哪些操作出现的问题
【遇到的问题:问题现象及影响】
【资源配置】进入到 TiDB Dashboard -集群信息 (Cluster Info) -主机(Hosts) 截图此页面
【附件:截图/日志/监控】
今天4.0.13 版本升级时,pd 切主一直提示超时10分失败,失败日志如下

2024-12-18T10:26:35.781+0800    INFO    Execute command finished        {"code": 1, "error": "failed to evict PD leader xx.xx.xx.xx: error evicting PD leader, operation timed out after 10m0s", "errorVerbose": "error evicting PD leader, operation timed out after 10m0s\nfailed to evict PD leader 10.74.45.167\ngithub.com/pingcap/tiup/pkg/cluster/spec.(*PDInstance).PreRestart\n\tgithub.com/pingcap/tiup/pkg/cluster/spec/pd.go:447\ngithub.com/pingcap/tiup/pkg/cluster/operation.upgradeInstance\n\tgithub.com/pingcap/tiup/pkg/cluster/operation/upgrade.go:269\ngithub.com/pingcap/tiup/pkg/cluster/operation.Upgrade\n\tgithub.com/pingcap/tiup/pkg/cluster/operation/upgrade.go:197\ngithub.com/pingcap/tiup/pkg/cluster/manager.(*Manager).Upgrade.func2\n\tgithub.com/pingcap/tiup/pkg/cluster/manager/upgrade.go:320\ngithub.com/pingcap/tiup/pkg/cluster/task.(*Func).Execute\n\tgithub.com/pingcap/tiup/pkg/cluster/task/func.go:34\ngithub.com/pingcap/tiup/pkg/cluster/task.(*Serial).Execute\n\tgithub.com/pingcap/tiup/pkg/cluster/task/task.go:86\ngithub.com/pingcap/tiup/pkg/cluster/manager.(*Manager).Upgrade\n\tgithub.com/pingcap/tiup/pkg/cluster/manager/upgrade.go:324\ngithub.com/pingcap/tiup/components/cluster/command.newUpgradeCmd.func1\n\tgithub.com/pingcap/tiup/components/cluster/command/upgrade.go:60\ngithub.com/spf13/cobra.(*Command).execute\n\tgithub.com/spf13/cobra@v1.6.1/command.go:916\ngithub.com/spf13/cobra.(*Command).ExecuteC\n\tgithub.com/spf13/cobra@v1.6.1/command.go:1044\ngithub.com/spf13/cobra.(*Command).Execute\n\tgithub.com/spf13/cobra@v1.6.1/command.go:968\ngithub.com/pingcap/tiup/components/cluster/command.init.0.newReplayCmd.func10\n\tgithub.com/pingcap/tiup/components/cluster/command/replay.go:59\ngithub.com/spf13/cobra.(*Command).execute\n\tgithub.com/spf13/cobra@v1.6.1/command.go:916\ngithub.com/spf13/cobra.(*Command).ExecuteC\n\tgithub.com/spf13/cobra@v1.6.1/command.go:1044\ngithub.com/spf13/cobra.(*Command).Execute\n\tgithub.com/spf13/cobra@v1.6.1/command.go:968\ngithub.com/pingcap/tiup/components/cluster/command.Execute\n\tgithub.com/pingcap/tiup/components/cluster/command/root.go:297\nmain.main\n\tgithub.com/pingcap/tiup/components/cluster/main.go:23\nruntime.main\n\truntime/proc.go:267\nruntime.goexit\n\truntime/asm_amd64.s:1650"

手动试试 https://docs.pingcap.com/zh/tidb/stable/pd-control#member-delete--leader_priority--leader-show--resign--transfer-member_name


手动迁移能迁移走么?看看 pd leader 日志里面有没有什么 failed 关键字

1 个赞

或者 https://docs.pingcap.com/zh/tidb/stable/tiup-component-cluster-replay
replay 续升级下试试。可能重试就好了。