tiup安装tiflash节点报错

tidb版本:v4.0.1
安装过程:在现有tidb集群安装一个tiflash节点,最后阶段报错如下
问题:遇到这个报错后续如何处理?我使用tiup cluster display test-cluster观察,实际上tiflash已经起起来了,但我从监控里看不到tiflash相关的信息,通过ps查看该节点上也只起了tiflash这个进程,相关监控没有起

报错日志:

  • [Parallel] - UserSSH: user=tidb, host=10.200.45.180
  • [ Serial ] - save meta
  • [ Serial ] - ClusterOperate: operation=StartOperation, options={Roles:[] Nodes:[] Force:false SSHTimeout:0 OptTimeout:60 APITimeout:0}
    Starting component tiflash
    Starting instance tiflash 10.200.45.180:9000
    retry error: operation timed out after 1m0s
    tiflash 10.200.45.180:9000 failed to start: timed out waiting for port 9000 to be started after 1m0s, please check the log of the instance

Error: failed to start: failed to start tiflash: tiflash 10.200.45.180:9000 failed to start: timed out waiting for port 9000 to be started after 1m0s, please check the log of the instance: timed out waiting for port 9000 to be started after 1m0s

Verbose debug logs has been written to /home/tidb/logs/tiup-cluster-debug-2020-06-18-16-34-16.log.
Error: run /home/tidb/.tiup/components/cluster/v1.0.5/tiup-cluster (wd:/home/tidb/.tiup/data/S2FaBW8) failed: exit status 1

你好,

请提供以下信息,以便排查问题

  1. 请提供下 edit-config 看下原集群 topo,并附赠 display 截图
  2. scale-out 的 topo 文件烦请也上传下
  3. 看下 tiflash/log 下的所有 log 文件请打包上传下。这边看下是否有更详细的报错

这个问题结贴。我通过重做tiflash节点,运行扩容命令时增加–wait-timeout 300参数,问题解决

你好,

通过 scale-in ,scale-oout 解决了?

是的,先scale-in缩容,清理节点上的安装信息,再scale-out,scale-out的时候因为tiflash启动慢,把wait-timeout由默认的60s改成300s

:ok_hand:感谢反馈

此话题已在最后回复的 1 分钟后被自动关闭。不再允许新回复。