使用ansbile关闭tidb,有一台服务器无法关闭

为提高效率,提问时请尽量提供详细背景信息,问题描述清晰可优先响应。以下信息点请尽量提供:

  • 系统版本 & kernel 版本】 CentOS Linux release 7.5.1804
  • TiDB 版本】 3.0.4
  • 磁盘型号】 ssd
  • 集群节点分布】 pd 3 tikv 3 tidb 2
  • 问题描述(我做了什么)】 使用ansible中控机关闭部署服务器,有两台关闭成功,一台无法成功
  • 关键词】 Could not find the requested service tidb-4000.service: hostansible.log (364.3 KB) inventory.ini (1.8 KB)

2019-10-23 06:18:17,227 p=29181 u=tidb | fatal: [192.168.1.281]: FAILED! => {“changed”: false, “msg”: “Could not find the requested service tidb-4000.service: host”} 2019-10-23 06:18:17,227 fail [192.168.1.281]: Ansible FAILED! => playbook: start.yml; TASK: start TiDB by systemd; message: {“changed”: false, “msg”: “Could not find the requested service tidb-4000.service: host”}

看起来是没有启动成功,停止肯定也会失败,需要先查一下启动失败的原因

2019-10-22 16:01:31,099 p=3602 u=tidb | fatal: [192.168.1.281]: FAILED! => {“changed”: false, “msg”: "You don’t mount the file system of /fast_data with ext4 nodelalloc option. See https://github.com/pingcap/docs/blob/master/dev/how-to/deploy/orchestrated/ansible.md#step-8-mount-the-data-disk-ext4-filesystem-with-options-on-the-target-machines."} 2019-10-22 16:01:31,099 fail [192.168.1.281]: Ansible FAILED! => playbook: deploy.yml; TASK: tikv : Preflight check - Does tikv data dir meet ext4 file system requirement; message: {“changed”: false, “msg”: "You don’t mount the file system of /fast_data with ext4 nodelalloc option. See https://github.com/pingcap/docs/blob/master/dev/how-to/deploy/orchestrated/ansible.md#step-8-mount-the-data-disk-ext4-filesystem-with-options-on-the-target-machines."} 2019-10-22 16:01:31,161 p=3602 u=tidb | TASK [tikv : Preflight check - Set fssystem_check_result fact] 日志里有报错,麻烦先修复

这是磁盘挂载后的ansible日志,281上面还有pd, tikv进程,手动kill会自动重启, 中控机ansible stop 还是报错 ansible.log (15.8 KB)

fatal: [192.168.1.281]: FAILED! => {“changed”: false, “msg”: “Could not find the requested service tidb-4000.service: host”} 2019-10-23 11:21:44,066 fail [192.168.1.281]: Ansible FAILED! => playbook: stop.yml; TASK: stop TiDB by systemd; message: {“changed”: false, “msg”: “Could not find the requested service tidb-4000.service: host”}

你先看一下这台机器上 tidb-server 进程有没有启动,部署有没有成功。