TiUP 部署集群在 CopyComponent 失败。

在CopyComponent这一步发生错误。

2020-06-03T09:12:24.968+0100    DEBUG   TaskBegin       {"task": "CopyComponent: component=blackbox_exporter, version=v0.12.0, remote=192.168.10.100:/root/tidb-deploy/monitor-9100 os=linux, arch=amd64"}
2020-06-03T09:12:25.265+0100    DEBUG   TaskFinish      {"task": "CopyComponent: component=node_exporter, version=v0.17.0, remote=192.168.10.100:/root/tidb-deploy/monitor-9100 os=linux, arch=amd64", "error": "Process exited with status 1", "errorVerbose": "Process exited with status 1\
github.com/pingcap/errors.AddStack\
\t/go/pkg/mod/github.com/pingcap/errors@v0.11.5-0.20190809092503-95897b64e011/errors.go:174\
github.com/pingcap/errors.Trace\
\t/go/pkg/mod/github.com/pingcap/errors@v0.11.5-0.20190809092503-95897b64e011/juju_adaptor.go:15\
github.com/pingcap/tiup/pkg/cluster/task.(*InstallPackage).Execute\
\t/home/jenkins/agent/workspace/tiup-release/pkg/cluster/task/install_package.go:45\
github.com/pingcap/tiup/pkg/cluster/task.(*CopyComponent).Execute\
\t/home/jenkins/agent/workspace/tiup-release/pkg/cluster/task/copy_component.go:46\
github.com/pingcap/tiup/pkg/cluster/task.(*Serial).Execute\
\t/home/jenkins/agent/workspace/tiup-release/pkg/cluster/task/task.go:183\
github.com/pingcap/tiup/pkg/cluster/task.(*StepDisplay).Execute\
\t/home/jenkins/agent/workspace/tiup-release/pkg/cluster/task/step.go:85\
github.com/pingcap/tiup/pkg/cluster/task.(*Parallel).Execute.func1\
\t/home/jenkins/agent/workspace/tiup-release/pkg/cluster/task/task.go:236\
runtime.goexit\
\t/usr/local/go/src/runtime/asm_amd64.s:1357"}

以下是我的配置文件:


# # Global variables are applied to all deployments and used as the default value of
# # the deployments if a specific deployment value is missing.
global:
 user: "tidb"
 ssh_port: 22
 deploy_dir: "/root/tidb-deploy"
 data_dir: "/root/tidb-data"

# # Monitored variables are applied to all the machines.
monitored:
 node_exporter_port: 9100
 blackbox_exporter_port: 9115

server_configs:
 tidb:
   log.slow-threshold: 300
 tikv:
   readpool.storage.use-unified-pool: false
   readpool.coprocessor.use-unified-pool: true
 pd:
   replication.enable-placement-rules: true
 tiflash:
   logger.level: "info"

pd_servers:
 - host: 192.168.10.100

tidb_servers:
 - host: 192.168.10.100

tikv_servers:
 - host: 192.168.10.100
   port: 20160
   status_port: 20180

 - host: 192.168.10.100
   port: 20161
   status_port: 20181

 - host: 192.168.10.100
   port: 20162
   status_port: 20182

tiflash_servers:
 - host: 192.168.10.100

你好,

尝试重复执行看是否可以解决,如果不行希望上传下 debug 完成日志和报错截图,

重试很多遍也不行。这是日志和截图哈:
tiup-cluster-debug-2020-06-03-09-12-25.log (85.3 KB)

部署用户设置的 tidb, 但部署目录指定在 /root 所以没有权限写入

建议把 global 中的 deploy_dir 和 data_dir 改为相对路径(会自动补全到部署用户的家目录下)或者直接删去这两行设置,如果一定要放在 /root 中,需要把部署用户改为 root, 或者手工在部署机上将这两个目录的权限修改为 tidb 用户可写入

1 个赞

好的,谢谢

:ok_hand:

此话题已在最后回复的 1 分钟后被自动关闭。不再允许新回复。