- 【 系统版本 & kernel 版本 】centOS7
- 【 TiDB 版本 】
tidb_version = v2.1.8
- 【 磁盘型号 】
- 【 集群节点分布 】
[tidb_servers] 192.168.104.65 192.168.36.151
[tikv_servers] 192.168.104.65 192.168.36.151 192.168.36.152
[pd_servers] 192.168.104.65 192.168.36.151 192.168.36.152
[spark_master]
[spark_slaves]
[lightning_server]
[importer_server]
[monitoring_servers] 192.168.104.65
[grafana_servers] 192.168.104.65
[monitored_servers] 192.168.104.65 192.168.36.151 192.168.36.152
[alertmanager_servers] 192.168.104.65
- 【 数据量 & region 数量 & 副本数 】
- 【 问题描述(我做了什么) 】
运行start.yml到启动pd-server这里的时候 PLAY [pd_servers]
TASK [start PD by supervise] **********************************************************************************************
TASK [start PD by systemd] ************************************************************************************************ changed: [192.168.36.152] changed: [192.168.36.151] changed: [192.168.104.65]
TASK [wait until the PD port is up] *************************************************************************************** ok: [192.168.104.65] fatal: [192.168.36.152]: FAILED! => {“changed”: false, “elapsed”: 300, “msg”: “the PD port 2379 is not up”} fatal: [192.168.36.151]: FAILED! => {“changed”: false, “elapsed”: 300, “msg”: “the PD port 2379 is not up”}
TASK [wait until the PD health page is available] ************************************************************************* FAILED - RETRYING: wait until the PD health page is available (12 retries left). FAILED - RETRYING: wait until the PD health page is available (11 retries left). FAILED - RETRYING: wait until the PD health page is available (10 retries left). FAILED - RETRYING: wait until the PD health page is available (9 retries left). FAILED - RETRYING: wait until the PD health page is available (8 retries left). FAILED - RETRYING: wait until the PD health page is available (7 retries left). FAILED - RETRYING: wait until the PD health page is available (6 retries left). FAILED - RETRYING: wait until the PD health page is available (5 retries left). FAILED - RETRYING: wait until the PD health page is available (4 retries left). FAILED - RETRYING: wait until the PD health page is available (3 retries left). FAILED - RETRYING: wait until the PD health page is available (2 retries left). FAILED - RETRYING: wait until the PD health page is available (1 retries left). fatal: [192.168.104.65]: FAILED! => {“attempts”: 12, “changed”: false, “content”: “”, “msg”: “Status code was -1 and not [200]: Request failed: <urlopen error [Errno 111] Connection refused>”, “redirected”: false, “status”: -1, “url”: “http://192.168.104.65:2379/health”} to retry, use: --limit @/home/tidb/tidb-ansible/retry_files/start.retry
PLAY RECAP **************************************************************************************************************** 192.168.104.65 : ok=19 changed=6 unreachable=0 failed=1
192.168.36.151 : ok=10 changed=3 unreachable=0 failed=1
192.168.36.152 : ok=10 changed=3 unreachable=0 failed=1
localhost : ok=1 changed=0 unreachable=0 failed=0
ERROR MESSAGE SUMMARY ***************************************************************************************************** [192.168.36.152]: Ansible FAILED! => playbook: start.yml; TASK: wait until the PD port is up; message: {“changed”: false, “elapsed”: 300, “msg”: “the PD port 2379 is not up”}
[192.168.36.151]: Ansible FAILED! => playbook: start.yml; TASK: wait until the PD port is up; message: {“changed”: false, “elapsed”: 300, “msg”: “the PD port 2379 is not up”}
[192.168.104.65]: Ansible FAILED! => playbook: start.yml; TASK: wait until the PD health page is available; message: {“attempts”: 12, “changed”: false, “content”: “”, “msg”: “Status code was -1 and not [200]: Request failed: <urlopen error [Errno 111] Connection refused>”, “redirected”: false, “status”: -1, “url”: “http://192.168.104.65:2379/health”}