failed to start: 10.142.100.84 blackbox_exporter-9115.service

【 TiDB 使用环境】生产环境 /测试/ Poc
【 TiDB 版本】
【复现路径】做过哪些操作出现的问题
【遇到的问题:问题现象及影响】
执行集群机器检查,出现如下异常并且通过修复无法修复
epoll-exclusive Fail epoll exclusive is not supported

根据以往的帖子说这个不影响,可以忽略,说是操作系统内核问题
查看了操作系统版本如下
Linux 3.10.0-327.el7.x86_64

启动集群时出现如下异常
Error: failed to start: 10.142.100.84 blackbox_exporter-9115.service, please check the instance’s log() for more detail.: timed out waiting for port 9115 to be started after 2m0s

查看tiup这台机器的debug日志如下,没有太多信息
2022-11-30T17:31:02.584+0800 DEBUG retry error {“error”: “operation timed out after 2m0s”}
2022-11-30T17:31:02.584+0800 DEBUG TaskFinish {“task”: “StartCluster”, “error”: “failed to start: 10.142.100.84 blackbox_exporter-9115.service, please check the instance’s log() for more detail.: timed out waiting for port 9115 to be started after 2m0s”, “errorVerbose”: “timed out waiting for port 9115 to be started after 2m0s\ngithub.com/pingcap/tiup/pkg/cluster/module.(*WaitFor).Execute\n\tgithub.com/pingcap/tiup/pkg/cluster/module/wait_for.go:91\ngithub.com/pingcap/tiup/pkg/cluster/spec.PortStarted\n\tgithub.com/pingcap/tiup/pkg/cluster/spec/instance.go:119\ngithub.com/pingcap/tiup/pkg/cluster/operation.systemctlMonitor.func1\n\tgithub.com/pingcap/tiup/pkg/cluster/operation/action.go:336\ngolang.org/x/sync/errgroup.(*Group).Go.func1\n\tgolang.org/x/sync@v0.0.0-20220819030929-7fc1605a5dde/errgroup/errgroup.go:75\nruntime.goexit\n\truntime/asm_amd64.s:1594\nfailed to start: 10.142.100.84 blackbox_exporter-9115.service, please check the instance’s log() for more detail.”}
2022-11-30T17:31:02.584+0800 INFO Execute command finished {“code”: 1, “error”: “failed to start: 10.142.100.84 blackbox_exporter-9115.service, please check the instance’s log() for more detail.: timed out waiting for port 9115 to be started after 2m0s”, “errorVerbose”: “timed out waiting for port 9115 to be started after 2m0s\ngithub.com/pingcap/tiup/pkg/cluster/module.(*WaitFor).Execute\n\tgithub.com/pingcap/tiup/pkg/cluster/module/wait_for.go:91\ngithub.com/pingcap/tiup/pkg/cluster/spec.PortStarted\n\tgithub.com/pingcap/tiup/pkg/cluster/spec/instance.go:119\ngithub.com/pingcap/tiup/pkg/cluster/operation.systemctlMonitor.func1\n\tgithub.com/pingcap/tiup/pkg/cluster/operation/action.go:336\ngolang.org/x/sync/errgroup.(*Group).Go.func1\n\tgolang.org/x/sync@v0.0.0-20220819030929-7fc1605a5dde/errgroup/errgroup.go:75\nruntime.goexit\n\truntime/asm_amd64.s:1594\nfailed to start: 10.142.100.84 blackbox_exporter-9115.service, please check the instance’s log() for more detail.”}

通过sudo journalctl -u blackbox_exporter-9115命令看到如下异常
Failed at step CAPABILITIES spawning /bin/bash: Invalid argument

【资源配置】
【附件:截图/日志/监控】

忽略掉这个问题吧,这个无所谓,只要有node_exporter-9100 进程存在就可以了

tiup cluster start tidb-cluster-test --init
可是这样启动没有密码输出

tiup cluster display tidb-cluster-test

看看主要节点正常吗?

没看出异常,节点信息都看到了

所以你的问题是没有root登陆密码吗?

不确定集群是否正常?没密码倒可以修改


这是刚刚display

这个问题忽略吧,不用管了,我升级了tidb 6.1.2最后忽略了这些问题


看你操作系统还是Centos 7.2,可能某些内核特性没办法使用,方便升级到7.3或者8.X么?

同意这位大佬的说法。一些底层环境如果不按照推荐的来,有的问题很难排查。

集群是升级上来的还是直接安装的目前的版本?

此话题已在最后回复的 60 天后被自动关闭。不再允许新回复。