tidb升级后grafana监控异常

为提高效率,提问时请提供以下信息,问题描述清晰可优先响应。

  • 【TiDB 版本】:5.7.25-TiDB-v3.0.9
  • 【问题描述】:从2.1.0升级到3.0.9后,发现grafana 监控异常,比如tidb监控数据正常,但是tikv,tipd升级后的监控数据就没的。
系统信息
+--------------+---------------------------+
|     Host     |          Release          |
+--------------+---------------------------+
| test-tidb002 | 3.10.0-693.2.2.el7.x86_64 |
| test-tidb003 | 3.10.0-693.2.2.el7.x86_64 |
| test-tidb001 | 3.10.0-693.2.2.el7.x86_64 |
+--------------+---------------------------+
TiDB 集群信息
+--------------------+--------------+------+----+------+
|    TiDB_version    | Clu_replicas | TiDB | PD | TiKV |
+--------------------+--------------+------+----+------+
| 5.7.25-TiDB-v3.0.9 |      3       |  2   | 3  |  3   |
+--------------------+--------------+------+----+------+
集群节点信息
+------------+--------------+
|  Node_IP   | Server_info  |
+------------+--------------+
| instance_0 |   tikv+pd    |
| instance_1 | tidb+pd+tikv |
| instance_2 | tidb+pd+tikv |
+------------+--------------+
容量 & region 数量
+---------------------+-----------------+--------------+
| Storage_capacity_GB | Storage_uesd_GB | Region_count |
+---------------------+-----------------+--------------+
|       2639.53       |       1.05      |      93      |
+---------------------+-----------------+--------------+
QPS
+---------+----------------+-----------------+
| Clu_QPS | Duration_99_MS | Duration_999_MS |
+---------+----------------+-----------------+
|  145.20 |    1874.54     |     2030.65     |
+---------+----------------+-----------------+
热点 region 信息
+---------+----------+-----------+
|  Store  | Hot_read | Hot_write |
+---------+----------+-----------+
| store-1 |    0     |     0     |
| store-5 |    0     |     0     |
| store-4 |    1     |     0     |
+---------+----------+-----------+
磁盘延迟信息
+--------+------------+-------------+--------------+
| Device |  Instance  | Read_lat_MS | Write_lat_MS |
+--------+------------+-------------+--------------+
|  vda   | instance_0 |     nan     |     1.00     |
|  vda   | instance_1 |     nan     |     1.00     |
|  vda   | instance_2 |    45.00    |     0.80     |
|  vdb   | instance_0 |     nan     |     0.00     |
|  vdb   | instance_1 |     nan     |     0.00     |
|  vdb   | instance_2 |     nan     |     0.00     |
+--------+------------+-------------+--------------+

若提问为性能优化、故障排查类问题,请下载脚本运行。终端输出的打印结果,请务必全选并复制粘贴上传。

  1. 请问是否有正确执行 ansible-playbook rolling_update_monitor.yml ,执行过程中是否有报错呢?
  2. 3.0 对监控做了调整 ,麻烦确认下 grafana 的 dashboard 有没有选择对。TiKV 的监控应该为 TiKV-Summary 以及 TiKV-Detail

多谢。确实dashboard选错了。

:blush:,感谢回复,如果问题已解决,请选择一个解决方案吧~

如有新的问题,请另开新帖提问哦~