br 备份失败

【 TiDB 使用环境】生产环境 /测试/ Poc
【 TiDB 版本】
【复现路径】做过哪些操作出现的问题
【遇到的问题:问题现象及影响】
【资源配置】进入到 TiDB Dashboard -集群信息 (Cluster Info) -主机(Hosts) 截图此页面
【附件:截图/日志/监控】
版本v7.1.5 ,通过 br 备份报错,报错信息如下

[2024/10/23 09:51:23.854 +08:00] [INFO] [state_syncer.go:158] ["get key failed"] [key=/tidb/server/global_state] [error="context canceled"]
[2024/10/23 09:51:24.055 +08:00] [WARN] [job_table.go:306] ["[ddl] get global state failed"] [error="context canceled"] [errorVerbose="context canceled\ngithub.com/pingcap/errors.AddStack\n\t/root/go/pkg/mod/github.com/pingcap/errors@v0.11.5-0.20231212100244-799fae176cfb/errors.go:174\ngithub.com/pingcap/errors.Trace\n\t/root/go/pkg/mod/github.com/pingcap/errors@v0.11.5-0.20231212100244-799fae176cfb/juju_adaptor.go:15\ngithub.com/pingcap/tidb/ddl/syncer.(*serverStateSyncer).getKeyValue\n\t/workspace/source/tidb/ddl/syncer/state_syncer.go:149\ngithub.com/pingcap/tidb/ddl/syncer.(*serverStateSyncer).GetGlobalState\n\t/workspace/source/tidb/ddl/syncer/state_syncer.go:173\ngithub.com/pingcap/tidb/ddl.(*ddl).doCheckClusterState\n\t/workspace/source/tidb/ddl/job_table.go:304\ngithub.com/pingcap/tidb/ddl.(*ddl).startDispatchLoop\n\t/workspace/source/tidb/ddl/job_table.go:248\ngithub.com/pingcap/tidb/util.(*WaitGroupWrapper).Run.func1\n\t/workspace/source/tidb/util/wait_group_wrapper.go:154\nruntime.goexit\n\t/usr/local/go/src/runtime/asm_amd64.s:1598"]
[2024/10/23 09:51:24.055 +08:00] [FATAL] [job_table.go:249] ["dispatch loop get cluster state failed, it should not happen, please try restart TiDB"] [error="context canceled"] [errorVerbose="context canceled\ngithub.com/pingcap/errors.AddStack\n\t/root/go/pkg/mod/github.com/pingcap/errors@v0.11.5-0.20231212100244-799fae176cfb/errors.go:174\ngithub.com/pingcap/errors.Trace\n\t/root/go/pkg/mod/github.com/pingcap/errors@v0.11.5-0.20231212100244-799fae176cfb/juju_adaptor.go:15\ngithub.com/pingcap/tidb/ddl/syncer.(*serverStateSyncer).getKeyValue\n\t/workspace/source/tidb/ddl/syncer/state_syncer.go:149\ngithub.com/pingcap/tidb/ddl/syncer.(*serverStateSyncer).GetGlobalState\n\t/workspace/source/tidb/ddl/syncer/state_syncer.go:173\ngithub.com/pingcap/tidb/ddl.(*ddl).doCheckClusterState\n\t/workspace/source/tidb/ddl/job_table.go:304\ngithub.com/pingcap/tidb/ddl.(*ddl).startDispatchLoop\n\t/workspace/source/tidb/ddl/job_table.go:248\ngithub.com/pingcap/tidb/util.(*WaitGroupWrapper).Run.func1\n\t/workspace/source/tidb/util/wait_group_wrapper.go:154\nruntime.goexit\n\t/usr/local/go/src/runtime/asm_amd64.s:1598"] [stack="github.com/pingcap/tidb/ddl.(*ddl).startDispatchLoop\n\t/workspace/source/tidb/ddl/job_table.go:249\ngithub.com/pingcap/tidb/util.(*WaitGroupWrapper).Run.func1\n\t/workspace/source/tidb/util/wait_group_wrapper.go:154"]

这是重启整个 TiDB 集群还是只要重启 tidb node 节点就可以

tidb集群组件,服务都正常么,提示获取不到tidb的状态,是不是集群有问题,看看日志啥的

服务都正常

br备份机器需要连pd节点,你看看通不通

还有没有其他的日志信息? 重新发起br备份也是报这个错误?

先看看tiup cluster display tidb-name
看看组件是不是都是正常的。都连接一下,能不能脸上

重新发起也是这个报错

组件都是正常的

curl http://pd_ip:端口/pd/api/v1/status 随便跑个api,看可以执行成功吗