mysql 的一次查询优化过程

有这样一条查询语句:mysql

select sum(index_count) as count, theday ,appVersion, channel, type
  from tableA 
 where project_id='qjp' and theday>='2019-05-17' and theday<='2019-05-23' 
 group by theday
复制代码

执行用时21s. 数据量55万左右,实在太慢了。sql

首先我须要肯定这21s 都用在什么地方了,即性能瓶颈在哪里。须要用到Msql的 Query Profiler 诊断分析工具。使用方法以下:bash

  1. 开启 profiling 参数
root@localhost : (none) 10:53:11> set profiling=1;
  Query OK, 0 rows affected (0.00 sec)
复制代码
  1. 执行你的查询 Query
mysql> select sum(index_count) as count, theday from tableA where project_id='qjp' and theday>='2019-05-17' and theday<='2019-05-23' group by theday
+-----------+-----------+------------+------------+------
| count     | theday     | appVersion | channel  | type | 
+-----------+-----------+------------+------------+------
| 180205137 | 2019-05-17 | 1.2.6      | 55550006 |      | 
| 168597045 | 2019-05-18 | 1.2.7      | 55550337 |      | 
| 153154098 | 2019-05-19 | 1.2.7      | 55550006 |      | 
+-----------+------------+------------+----------+------+
7 rows in set (21.03 sec)
复制代码

在开启 Query Profiler 功能以后,MySQL 就会自动记录全部执行的 Query 的 profile 信息了。app

  1. 获取系统中保存的全部 Query 的 profile 概要信息
mysql> show profiles;
+----------+-------------+----------------------------------------------------------------------------------------------------------------------------------------------+
| Query_ID | Duration    | Query    |
+----------+-------------+----------------------------------------------------------------------------------------------------------------------------------------------+
|        1 | 21.02640425 | select sum(index_count) as count, theday from tableA where project_id='qjp' and theday>='2019-05-17' and theday<='2019-05-23' group by theday|
+----------+-------------+----------------------------------------------------------------------------------------------------------------------------------------------+
1 row in set, 1 warning (0.00 sec)
复制代码

经过执行 “show profiles ” 命令获取当前系统中保存的多个 Query 的 profile 的概要信息。工具

  1. 针对单个 Query 获取详细的 profile 信息。
mysql> show profile for query 1;
+----------------------+-----------+
| Status               | Duration  |
+----------------------+-----------+
| starting             |  0.000117 |
| checking permissions |  0.000020 |
| Opening tables       |  0.000032 |
| init                 |  0.000039 |
| System lock          |  0.000021 |
| optimizing           |  0.000023 |
| statistics           |  0.000124 |
| preparing            |  0.000032 |
| Creating tmp table   |  0.000042 |
| Sorting result       |  0.000018 |
| executing            |  0.000016 |
| Sending data         | 21.021533 |
| Creating sort index  |  0.003272 |
| end                  |  0.000019 |
| query end            |  0.000022 |
| removing tmp table   |  0.000728 |
| query end            |  0.000019 |
| closing tables       |  0.000023 |
| freeing items        |  0.000064 |
| logging slow query   |  0.000020 |
| Opening tables       |  0.000025 |
| System lock          |  0.000146 |
| cleaning up          |  0.000051 |
+----------------------+-----------+
23 rows in set, 1 warning (0.00 sec)

复制代码

从上面的可看出时间主要用在 Sending data ,所谓的“Sending data” 并非单纯的发送数据,而是包括“收集 + 发送 数据”。后来我又尝试将查询的列减小,可是没有什么效果。后来无心间检查下表结构发现其中 有两列projet_id和 theday的列类型是text,好像知道什么了(不知道是谁设计的,可是其实没有必要),改这两列为varchar类型,再执行一次查询,只用了0.5秒性能

自此问题解决。总结一下,设计表的时候,列类型必定要考虑好,text类型尽可能少用,设置为varchar的时候,长度够用就好,越短性能越好。ui

=============我是分割线======下面是追加的内容===================spa

查看表结构:.net

mysql> desc tmp_table;
+-------------+--------------+------+-----+---------+-------+
| Field       | Type         | Null | Key | Default | Extra |
+-------------+--------------+------+-----+---------+-------+
| project_id  | text         | YES  | MUL | NULL    |       |
| index_count | varchar(100) | YES  |     | NULL    |       |
| theday      | text         | YES  | MUL | NULL    |       |
| app_version | varchar(32)  | NO   |     | NULL    |       |
| channel     | varchar(32)  | YES  |     | NULL    |       |
+-------------+--------------+------+-----+---------+-------+
8 rows in set (0.00 sec)
复制代码

修改列类型:设计

alter table 表名 MODIFY COLUMN 列名 VARCHAR(16)
复制代码

对了,在处理的过程当中,我还尝试了加索引,可是发现加索引后,查询时间不快反而变的更慢。这里有一篇关于索引的文章 blog.csdn.net/u014470581/…

相关文章
相关标签/搜索