TiDB启动Tispark报错

retry error: operation timed out after 2m0s
tispark 10.20.116.74:7077 failed to start: timed out waiting for port 7077 to be started after 2m0s, please check the log of the instance

Error: failed to start tispark: tispark 10.20.116.74:7077 failed to start: timed out waiting for port 7077 to be started after 2m0s, please check the log of the instance: timed out waiting for port 7077 to be started after 2m0s

Verbose debug logs has been written to /home/tidb/logs/tiup-cluster-debug-2020-09-07-16-08-31.log.
Error: run /home/tidb/.tiup/components/cluster/v1.1.1/tiup-cluster (wd:/home/tidb/.tiup/data/S9t6A81) failed: exit status 1

tiup-cluster-debug-2020-09-07-16-57-44.log (138.9 KB)

麻烦反馈下 tispark 的日志,多谢。

[tidb@dtaspkuat01v logs]$ less spark-tidb-org.apache.spark.deploy.master.Master-1-dtaspkuat01v.out

目前集群还没启动起来
[tidb@dtadbtuat01v logs]$ tiup cluster start tidb-OA
到tispark遇到这个问题
tiup-cluster-debug-2020-09-07-17-32-35.log (65.0 KB)

看日志时间应该tispark在启动,还有其他报错信息吗?

暂时没有了,就这些;Spark感觉起来了,但是tiup启动集群到这里就停止了
image

2020-09-07T17:53:39.407+0800 INFO SSHCommand {“host”: “10.20.116.74”, “port”: “22”, “cmd”: “export LANG=C; PATH=$PATH:/usr/bin:/usr/sbin sudo -H bash -c "systemctl daemon-reload && systemctl start tispark-master-7077.service && systemctl enable tispark-master-7077.service"”, “error”: “ssh: handshake failed: ssh: unable to authenticate, attempted methods [none publickey], no supported methods remain”, “stdout”: “”, “stderr”: “”}
2020-09-07T17:53:39.407+0800 DEBUG TaskFinish {“task”: “StartCluster”, “error”: “failed to start tispark: failed to start: tispark 10.20.116.74:7077: executor.ssh.execute_failed: Failed to execute command over SSH for ‘tidb@10.20.116.74:22’ {ssh_stderr: , ssh_stdout: , ssh_command: export LANG=C; PATH=$PATH:/usr/bin:/usr/sbin sudo -H bash -c "systemctl daemon-reload && systemctl start tispark-master-7077.service && systemctl enable tispark-master-7077.service"}, cause: ssh: handshake failed: ssh: unable to authenticate, attempted methods [none publickey], no supported methods remain”, “errorVerbose”: "executor.ssh.execute_failed: Failed to execute command over SSH for ‘tidb@10.20.116.74:22’ {ssh_stderr: , ssh_stdout: , ssh_command: export LANG=C; PATH=$PATH:/usr/bin:/usr/sbin sudo -H bash -c "systemctl daemon-reload && systemctl start tispark-master-7077.service && systemctl enable tispark-master-7077.service"}, cause: ssh: handshake failed: ssh: unable to authenticate, attempted methods [none publickey], no supported methods remain\
at github.com/pingcap/tiup/pkg/cluster/executor.(*EasySSHExecutor).Execute()\
\tgithub.com/pingcap/tiup@/pkg/cluster/executor/ssh.go:195\
at github.com/pingcap/tiup/pkg/cluster/module.(*SystemdModule).Execute()\
\tgithub.com/pingcap/tiup@/pkg/cluster/module/systemd.go:98\
at github.com/pingcap/tiup/pkg/cluster/operation.startInstance()\
\tgithub.com/pingcap/tiup@/pkg/cluster/operation/action.go:460\
at github.com/pingcap/tiup/pkg/cluster/operation.StartComponent.func1()\
\tgithub.com/pingcap/tiup@/pkg/cluster/operation/action.go:513\
at golang.org/x/sync/errgroup.(*Group).Go.func1()\
\tgolang.org/x/sync@v0.0.0-20190911185100-cd5d95a43a6e/errgroup/errgroup.go:57\
at runtime.goexit()\
\truntime/asm_amd64.s:1357\
failed to start: tispark 10.20.116.74:7077\
github.com/pingcap/tiup/pkg/cluster/operation.startInstance\
\tgithub.com/pingcap/tiup@/pkg/cluster/operation/action.go:470\
github.com/pingcap/tiup/pkg/cluster/operation.StartComponent.func1\
\tgithub.com/pingcap/tiup@/pkg/cluster/operation/action.go:513\
golang.org/x/sync/errgroup.(*Group).Go.func1\
\tgolang.org/x/sync@v0.0.0-20190911185100-cd5d95a43a6e/errgroup/errgroup.go:57\
runtime.goexit\
\truntime/asm_amd64.s:1357\
failed to start tispark"}
2020-09-07T17:53:39.407+0800 INFO Execute command finished {“code”: 1, “error”: “failed to start tispark: failed to start: tispark 10.20.116.74:7077: executor.ssh.execute_failed: Failed to execute command over SSH for ‘tidb@10.20.116.74:22’ {ssh_stderr: , ssh_stdout: , ssh_command: export LANG=C; PATH=$PATH:/usr/bin:/usr/sbin sudo -H bash -c "systemctl daemon-reload && systemctl start tispark-master-7077.service && systemctl enable tispark-master-7077.service"}, cause: ssh: handshake failed: ssh: unable to authenticate, attempted methods [none publickey], no supported methods remain”, “errorVerbose”: "executor.ssh.execute_failed: Failed to execute command over SSH for ‘tidb@10.20.116.74:22’ {ssh_stderr: , ssh_stdout: , ssh_command: export LANG=C; PATH=$PATH:/usr/bin:/usr/sbin sudo -H bash -c "systemctl daemon-reload && systemctl start tispark-master-7077.service && systemctl enable tispark-master-7077.service"}, cause: ssh: handshake failed: ssh: unable to authenticate, attempted methods [none publickey], no supported methods remain\
at github.com/pingcap/tiup/pkg/cluster/executor.(*EasySSHExecutor).Execute()\
\tgithub.com/pingcap/tiup@/pkg/cluster/executor/ssh.go:195\
at github.com/pingcap/tiup/pkg/cluster/module.(*SystemdModule).Execute()\
\tgithub.com/pingcap/tiup@/pkg/cluster/module/systemd.go:98\
at github.com/pingcap/tiup/pkg/cluster/operation.startInstance()\
\tgithub.com/pingcap/tiup@/pkg/cluster/operation/action.go:460\
at github.com/pingcap/tiup/pkg/cluster/operation.StartComponent.func1()\
\tgithub.com/pingcap/tiup@/pkg/cluster/operation/action.go:513\
at golang.org/x/sync/errgroup.(*Group).Go.func1()\
\tgolang.org/x/sync@v0.0.0-20190911185100-cd5d95a43a6e/errgroup/errgroup.go:57\
at runtime.goexit()\
\truntime/asm_amd64.s:1357\
failed to start: tispark 10.20.116.74:7077\
github.com/pingcap/tiup/pkg/cluster/operation.startInstance\
\tgithub.com/pingcap/tiup@/pkg/cluster/operation/action.go:470\
github.com/pingcap/tiup/pkg/cluster/operation.StartComponent.func1\
\tgithub.com/pingcap/tiup@/pkg/cluster/operation/action.go:513\
golang.org/x/sync/errgroup.(*Group).Go.func1\
\tgolang.org/x/sync@v0.0.0-20190911185100-cd5d95a43a6e/errgroup/errgroup.go:57\
runtime.goexit\
\truntime/asm_amd64.s:1357\
failed to start tispark"}
[tiup-cluster-debug-2020-09-07-17-53-39.log attachment](upload://2KcPbvd0UgYmkL7SmVITXHP6BGh.log) (65.0 KB)

spark worker启动不了

现在现象是 tiup 状态是 down 状态,实际使用 Tispark 进行读操作可以执行么?

插入数据情况,现在这种情况正常么?

中控机器 执行报错:
Error: failed to start tispark: failed to start: tispark 10.20.116.74:7077: executor.ssh.execute_failed: Failed to execute command over SSH for ‘tidb@10.20.116.74:22’
{ssh_stderr: , ssh_stdout: , ssh_command: export LANG=C; PATH=$PATH:/usr/bin:/usr/sbin sudo -H bash -c
“systemctl daemon-reload && systemctl start tispark-master-7077.service && systemctl enable tispark-master-7077.service”},
cause: ssh: handshake failed: ssh: unable to authenticate, attempted methods [none publickey], no supported methods remain

直接远程执行命令: 难道需要root到spark节点免密登录?我这边只是做了tidb用户免密登录

给 tidb 用户在spark节点 sudo 权限试试

谢谢,已经解决;
ssh-copy-id 需要用到tidb集群里面的;/home/tidb/.tiup/storage/cluster/clusters/tidb-OA/ssh/id_rsa.pub
自己生产的ssh-keygen 不行,不晓得为啥

:+1:

此话题已在最后回复的 1 分钟后被自动关闭。不再允许新回复。