Tidb频繁oom大佬帮我分析下SQL

TiDBer_oHSwKxOH · 2023 年9 月 20 日 10:03

tidb 得单独部署不然pd oom很好数据也没了

dba-kit · 2023 年9 月 20 日 10:03

看起来统计信息也有问题。你看下analyze的状态？

TiDBer_oHSwKxOH · 2023 年9 月 20 日 10:05

吧业务先停吧不然数据会丢你pd已经挂了

像风一样的男子 · 2023 年9 月 20 日 10:19

这sql的完整执行计划贴一下，看sql不应该消耗那么多资源的

老龚caiji · 2023 年9 月 20 日 10:31

只是这1台OOM，PD数据没问题，不是所有tidb节点都oom

老龚caiji · 2023 年9 月 20 日 10:37

这个统计信息不准确，说明了什么问题？有啥影响

大飞哥online · 2023 年9 月 20 日 10:40

比如实际表有10万，统计信息是100万，那执行计划就会按照100万来走了。

老龚caiji · 2023 年9 月 20 日 10:49

我刚刚执行了一次analyze table，然后再explain我的SQL，状态还是stats:pseudo


MySQL [spiders]> analyze table authors;
Query OK, 0 rows affected (2 min 17.07 sec)

MySQL [spiders]> 

MySQL [spiders]> SHOW ANALYZE STATUS;
+--------------+------------+----------------+---------------------------------------------------+----------------+---------------------+---------------------+----------+
| Table_schema | Table_name | Partition_name | Job_info                                          | Processed_rows | Start_time          | End_time            | State    |
+--------------+------------+----------------+---------------------------------------------------+----------------+---------------------+---------------------+----------+
| spiders      | authors    |                | analyze index author_created_time                 |       69070360 | 2023-09-20 18:37:15 | 2023-09-20 18:37:48 | finished |
| spiders      | authors    |                | analyze index author_platform_id_user_id          |       69070362 | 2023-09-20 18:37:15 | 2023-09-20 18:38:05 | finished |
| spiders      | authors    |                | analyze index author_platform_id                  |       69070360 | 2023-09-20 18:37:48 | 2023-09-20 18:38:22 | finished |
| spiders      | authors    |                | analyze index author_updated_time                 |       69070362 | 2023-09-20 18:37:15 | 2023-09-20 18:38:26 | finished |
| spiders      | authors    |                | analyze columns                                   |       69070362 | 2023-09-20 18:37:15 | 2023-09-20 18:38:52 | finished |
| spiders      | authors    |                | analyze index author_crawl_level                  |       69070360 | 2023-09-20 18:38:22 | 2023-09-20 18:38:55 | finished |
| spiders      | authors    |                | analyze index author_user_id                      |       69070360 | 2023-09-20 18:38:05 | 2023-09-20 18:38:58 | finished |
| spiders      | authors    |                | analyze index author_platform_id_crawl_level      |       69070361 | 2023-09-20 18:38:26 | 2023-09-20 18:39:03 | finished |
| spiders      | authors    |                | analyze index author_platform_id_user_platform_id |       69070360 | 2023-09-20 18:38:49 | 2023-09-20 18:39:32 | finished |
+--------------+------------+----------------+---------------------------------------------------+----------------+---------------------+---------------------+----------+
9 rows in set (0.00 sec)

TiDBer_oHSwKxOH · 2023 年9 月 20 日 11:00

统计信息不准就很慢

DBRE · 2023 年9 月 20 日 12:15

分析可以参考这里的思路

林夕一指 · 2023 年9 月 21 日 03:36

能不能先将author_tag 和tag 的group by 完后再去和authors join 呢？从执行计划来看group by 是在tidb 上对所有join 完的数据进行聚合的，这些数据占用内存。针对这个sql, authors 其实可以不参与group by。最好能将group by 下推到tikv或者是用tiflash。

ajin0514 · 2023 年9 月 22 日 06:55

都部署在一台上了？

Billmay表妹 · 2023 年9 月 22 日 07:16

进入到 TiDB Dashboard -集群信息 (Cluster Info) -主机(Hosts) 截图此页面

发一下你的配置看一看

ajin0514 · 2023 年10 月 7 日 03:41

看看free -h

路在何chu · 2023 年10 月 7 日 07:55

优化sql吧