TiKV_GC_can_not_work 告警

日志如下:(生产环境比较急)

[2020/05/24 08:35:44.081 +08:00] [INFO] [gc_worker.go:277] ["[gc worker] starts the whole job"] [uuid=5be7daf755c0017] [safePoint=416881556597243904] [concurrency=3] [2020/05/24 08:35:44.081 +08:00] [INFO] [gc_worker.go:773] ["[gc worker] start resolve locks"] [uuid=5be7daf755c0017] [safePoint=416881556597243904] [concurrency=3] [2020/05/24 08:35:51.230 +08:00] [INFO] [gc_worker.go:794] ["[gc worker] finish resolve locks"] [uuid=5be7daf755c0017] [safePoint=416881556597243904] [regions=19800] [2020/05/24 08:36:44.059 +08:00] [INFO] [gc_worker.go:246] ["[gc worker] there’s already a gc job running, skipped"] [“leaderTick on”=5be7daf755c0017] [2020/05/24 08:37:31.237 +08:00] [INFO] [gc_worker.go:582] ["[gc worker] start delete"] [uuid=5be7daf755c0017] [ranges=0] [2020/05/24 08:37:31.237 +08:00] [INFO] [gc_worker.go:601] ["[gc worker] finish delete ranges"] [uuid=5be7daf755c0017] [“numof ranges”=0] [“cost time”=382ns] [2020/05/24 08:37:31.239 +08:00] [INFO] [gc_worker.go:624] ["[gc worker] start redo-delete ranges"] [uuid=5be7daf755c0017] [“num of ranges”=0] [2020/05/24 08:37:31.239 +08:00] [INFO] [gc_worker.go:643] ["[gc worker] finish redo-delete ranges"] [uuid=5be7daf755c0017] [“num of ranges”=0] [“cost time”=512ns] [2020/05/24 08:37:31.244 +08:00] [INFO] [gc_worker.go:921] ["[gc worker] sent safe point to PD"] [uuid=5be7daf755c0017] [“safe point”=416881556597243904]

从日志看是正常的, TiKV_GC_can_not_work 告警规则 3.0 版本有变化,v3.0.5 之前的版本需要更新告警规则

sum(increase(tikv_gcworker_gc_tasks_vec{task=“gc”}[1d])) < 1

https://pingcap.com/docs-cn/stable/alert-rules/#tikv_gc_can_not_work

谢谢大佬,我改了告警规则后,告警立马闭嘴了

:+1:

此话题已在最后回复的 1 分钟后被自动关闭。不再允许新回复。