不能停止 node_exporter-9100

【 TiDB 使用环境】/测试/
【 TiDB 版本】5.4.3
【复现路径】:安装tidb 后销毁销毁不了
【遇到的问题:问题现象及影响】
【资源配置】进入到 TiDB Dashboard -集群信息 (Cluster Info) -主机(Hosts) 截图此页面
【附件:截图/日志/监控】

报错:
Error: failed to stop: xx.xx.xx.xx node_exporter-9100.service, please check the instance’s log() for more detail.: timed out waiting for port 9100 to be stopped after 1m0s

日志:
2024/11/25 17:34:34.279 +08:00] [INFO] [base_client.go:104] [“[pd] init cluster id”] [cluster-id=xxxxxxxxx]
[2024/11/25 17:34:34.279 +08:00] [INFO] [client.go:648] [“[pd] tso dispatcher created”] [dc-location=global]
[2024/11/25 17:34:34.279 +08:00] [ERROR] [client.go:845] [“[pd] update connection contexts failed”] [dc=global] [error=“rpc error: code = Canceled desc = context canceled”]
[2024/11/25 17:34:34.279 +08:00] [INFO] [client.go:666] [“[pd] exit tso dispatcher”] [dc-location=global]
[2024/11/25 17:34:34.280 +08:00] [FATAL] [terror.go:292] [“unexpected error”] [error=“no pump found in pd”]
[stack=“github.com/pingcap/tidb/parser/terror.MustNil\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/tidb/parser/terror/terror.go:292\nmain.setupBinlogClient\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/tidb/tidb-server/main.go:328\nmain.main\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/tidb/tidb-server/main.go:199\nruntime.main\n\t/usr/local/go/src/runtime/proc.go:225”] [stack=“github.com/pingcap/tidb/parser/terror.MustNil\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/tidb/parser/terror/terror.go:292\nmain.setupBinlogClient\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/tidb/tidb-server/main.go:328\nmain.main\n\t/home/jenkins/agent/workspace/build-common/go/src/github.com/pingcap/tidb/tidb-server/main.go:199\nruntime.main\n\t/usr/local/go/src/runtime/proc.go:225”]

topology.yaml :
global:
user: “tidb”
ssh_port: 22
deploy_dir: “/data/tidb/tidb-deploy”
data_dir: “/data/tidb/tidb-data”

server_configs:
tidb:
performance.txn-total-size-limit: 10737418000
pd:
replication.enable-placement-rules: true

pd_servers:

  • host: a
  • host: b
  • host: c

tidb_servers:

  • host: a
  • host: b
  • host: c

tikv_servers:

  • host: a
  • host: b
  • host: c

monitoring_servers:

  • host: a

grafana_servers:

  • host: a

alertmanager_servers:

  • host: a

现在是启动启动不了,销毁销毁不了
:imp:

你这是出现了两个问题,第一个 node_exporter-9100 启动不起来,大概率那台机的9100端口被某个服务占用了,导致无法启动;第二个,no pump found in pd,你这是在配置文件中开了 binlog.enable: true 的选项,一步步排查下

这个9100端口一直开着,而且关闭不掉

第二个我更改一下配置文件,现在不知来的及不

增加 --force 可以卸载成功,更改配置文件重装就好了 :handshake:

ps -ef | grep node_expor ,然后kill掉,再重新安装

kill 不行,无限重启 :rofl: