【 TiDB 使用环境】测试
【 TiDB 版本】v6.5.2
【复现路径】刚部署了不超过24小时的新集群,只做过TPC-H测试,TPC-C的测试的数据生成部分。空闲一段时间(一晚上)后2个KV节点在早上8点49自动重启了。
经过调查分析,在操作系统日志中发现了如下日志:
Apr 25 08:49:04 tikv119 kernel: Out of memory: Kill process 4583 (tikv-server) score 917 or sacrifice child
Apr 25 08:49:04 tikv119 kernel: Killed process 4583 (tikv-server), UID 0, total-vm:44109576kB, anon-rss:30524348kB, file-rss:520kB, shmem-rss:0kB
Apr 25 08:49:08 tikv119 systemd: tikv-20160.service: main process exited, code=killed, status=9/KILL
Apr 25 08:49:08 tikv119 systemd: Unit tikv-20160.service entered failed state.
Apr 25 08:49:08 tikv119 systemd: tikv-20160.service failed.
Apr 25 08:49:23 tikv119 systemd: tikv-20160.service holdoff time over, scheduling restart.
Apr 25 08:49:23 tikv119 systemd: Stopped tikv service.
Apr 25 08:49:23 tikv119 systemd: Started tikv service.
Apr 25 08:49:23 tikv119 bash: sync ...
Apr 25 08:49:23 tikv119 bash: real#0110m0.003s
Apr 25 08:49:23 tikv119 bash: user#0110m0.000s
Apr 25 08:49:23 tikv119 bash: sys#0110m0.001s
Apr 25 08:49:23 tikv119 bash: ok
【遇到的问题:问题现象及影响】
【资源配置】
1个Server、1个PD节点、3个KV节点。
【附件:截图/日志/监控】
PD节点的日志
pd.log (3.2 KB)
Server节点的日志
tidb-server.log (13.6 KB)
其中一台KV节点的日志
tikv119.log (3.3 MB)