tipd 启动失败。["run server failed"] [error="[PD:server:ErrCancelStartEtcd]etcd start canceled"]

【 TiDB 使用环境】生产环境
【 TiDB 版本】7.5.0
【复现路径】
【遇到的问题:问题现象及影响】新增tipd节点报错 无法启动
【资源配置】

【附件:截图/日志/监控】

有大佬帮忙看看是什么问题呀 感谢🙏

2024-07-27T15:14:59.000856822+08:00 [2024/07/27 15:14:59.000 +08:00] [FATAL] [main.go:279] [“run server failed”] [error=“[PD:server:ErrCancelStartEtcd]etcd start canceled”] [stack=“main.start\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/pd/cmd/pd-server/main.go:279\nmain.createServerWrapper\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/pd/cmd/pd-server/main.go:190\ngithub.com/spf13/cobra.(*Command).execute\n\t/go/pkg/mod/github.com/spf13/cobra@v1.0.0/command.go:846\ngithub.com/spf13/cobra.(*Command).ExecuteC\n\t/go/pkg/mod/github.com/spf13/cobra@v1.0.0/command.go:950\ngithub.com/spf13/cobra.(*Command).Execute\n\t/go/pkg/mod/github.com/spf13/cobra@v1.0.0/command.go:887\nmain.main\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/pd/cmd/pd-server/main.go:70\nruntime.main\n\t/usr/local/go/src/runtime/proc.go:267”]

刚刚删掉这个节点重启一个 报了另一个错误


2024-07-27T15:41:59.837534057+08:00 2024-07-27 15:41:59.837438 W | etcdserver: could not get cluster response from http://tidb-pd-4.tidb-pd-peer.tidb.svc:2380: Get “http://tidb-pd-4.tidb-pd-peer.tidb.svc:2380/members”: context deadline exceeded (Client.Timeout exceeded while awaiting headers)

2024-07-27T15:41:59.837594310+08:00 [2024/07/27 15:41:59.837 +08:00] [ERROR] [etcdutil.go:89] [“failed to get cluster from remote”] [error=“[PD:etcd:ErrEtcdGetCluster]could not retrieve cluster information from the given URLs: could not retrieve cluster information from the given URLs”]

加个节点进去替换

为啥 PD 要偶数个实例?

k8s部署的?节点之间网络通信是不是有问题?

我也遇到了和楼主同一个问题,还没解决,我不是k8s部署的 pd节点到tidb正反向的网络都是通的