pt-table-checksum校验与pt-table-sync,前者主要用于数据的校验,验证主从是否一致,后者主要用来修复数据,二者通常状况结合起来用能够修复数据不一致的问题。mysql
下载工具包 的最新地址以下:sql
https://www.percona.com/downloads/percona-toolkit/LATEST/
安装pt-table-checksum 和pt-table-sync命令。须要先安装percona-toolkit 工具集数据库
1.1percona-toolkit 工具集安装方式介绍:
percona-toolkit 工具安装有3种方式:
第一种是下载二进制包,直接解压包,就可使用。
第二种是下载源码包,进行编译安装
第三种是rpm包安装
不论是二进制安装仍是源码包编译安装,仍是rpm包安装,安装前要下载相关的依赖软件包ide
yum ×××tall perl perl-devel perl-CPAN perl-DBD-MySQL perl-Time-HiRes perl-DBI perl-Digest-MD5 -y
1.2下载源码包编译安装:工具
yum ×××tall perl perl-devel perl-CPAN perl-DBD-MySQL perl-Time-HiRes perl-DBI perl-Digest-MD5 -y tar xf percona-toolkit-3.0.13.tar.gz [root@localhost ~]# cd percona-toolkit-3.0.13 [root@localhost percona-toolkit-3.0.13]# perl Makefile.PL Checking if your kit is complete... Looks good Writing Makefile for percona-toolkit make && make ×××tall
1.3下载二进制包直接解压安装:学习
yum ×××tall perl perl-devel perl-CPAN perl-DBD-MySQL perl-Time-HiRes perl-DBI perl-Digest-MD5 -y tar xf percona-toolkit-3.0.11_x86_64.tar.gz -C /usr/local/ mv percona-toolkit-3.0.11_x86_64 /usr/local/percona-toolkit [root@ks-es11 local]# cat /etc/profile.d/percona-toolkit.sh export PATH=$PATH:/usr/local/percona-toolkit/bin [root@ks-es11 local]# cat /etc/profile.d/mysql.sh export PATH=$PATH:/usr/local/mysql7/bin
master 192.168.0.39
slave 192.168.0.11测试
2.一、master上操做:this
mysql -uroot -p'jiawu256789' -e "ggrant replication slave on *.* to rept@'192.168.0.11' identified by 'JuwoSdk21TbUser'; flush privileges;" mysqldump -uroot -p'jiawu256789' -B -A -F --master-data=2 --single-transaction --set-gtid-purged=OFF --events|gzip >/opt/juwo_$(date +%F).sql.gz scp -rp -P10239 /opt/juwo_$(date +%F).sql.gz root@192.168.0.11:/root
2.二、slave上操做:日志
gzip -d /root/juwo_$(date +%F).sql.gz mysql -uroot -p'jiawu256789' -e "source /root/juwo_$(date +%F).sql;" mysql -uroot -p'jiawu256789' -e "CHANGE MASTER TO MASTER_HOST='192.168.0.39',MASTER_PORT=3306,MASTER_USER='rept',MASTER_PASSWORD='JuwoSdk21TbUser',MASTER_AUTO_POSITION = 1;start slave;show slave status\G"
建立pt-table-checksum 和pt-table-sync所须要的操做mysql的帐户以及修复数据时,所须要的权限code
登录slave和master(两边都要执行以下SQL)
master(192.168.0.18)
slave(192.168.0.22)
grant update,×××ert,select,create,drop,delete,index,execute,super,process,replication slave on *.* to ptsum@'192.168.0.%' identified by 'ptchecksums'; flush privileges;
master库上新建测试库和测试表,SQL指令以下:
create database test01; use test01; CREATE TABLE `frame01` ( `id` int(11) NOT NULL AUTO_INCREMENT,`parent_id` int(11) DEFAULT NULL,`dsn` varchar(255) NOT NULL,PRIMARY KEY (`id`)); ×××ert into `frame01` values(1,1,'192.168.0.22,u=pt22,p=ptchecksums,P=3307'); ×××ert into `frame01` values(2,2,'192.168.0.33,u=pt33,p=ptchecksums,P=3308');
slave查看数据已经同步过来了
mysql> select * from frame01; +----+-----------+------------------------------------------+ | id | parent_id | dsn | +----+-----------+------------------------------------------+ | 1 | 1 | 192.168.0.22,u=pt22,p=ptchecksums,P=3307 | | 2 | 2 | 192.168.0.33,u=pt33,p=ptchecksums,P=3308 | +----+-----------+------------------------------------------+ 2 rows in set (0.00 sec)
slave库修改frame01表数,模拟数据不一致
update frame01 set dsn='192.168.0.55,u=umaaa,p=ptchsyeudew,P=3310' where id=1; update frame01 set dsn='192.168.0.66,u=umbbb,p=ptchsyeudew,P=3311' where id=2; mysql> select * from frame01; +----+-----------+-------------------------------------------+ | id | parent_id | dsn | +----+-----------+-------------------------------------------+ | 1 | 1 | 192.168.0.55,u=umaaa,p=ptchsyeudew,P=3310 | | 2 | 2 | 192.168.0.66,u=umbbb,p=ptchsyeudew,P=3311 | +----+-----------+-------------------------------------------+
在新版本的pt-table-ckecksum不用本身添加checksums表,在pt-table-checksum 表时,会在master库上默认生成percona库和表checksums
[root@kusou-es11 ~]# pt-table-checksum h=192.168.0.39,u=ptsum,p='ptchecksums',P=3306 --tables=test01.frame01 --no-check-binlog-format --nocheck-replication-filters --recursion-method="processlist" Checking if all tables can be checksummed ... Starting checksum ... TS ERRORS DIFFS ROWS DIFF_ROWS CHUNKS SKIPPED TIME TABLE 06-15T10:37:41 0 1 2 0 1 0 0.314 test01.frame01
默认会在master库上生成库percona和表checksums
mysql> select * from percona.checksums; +--------+---------+-------+------------+-------------+----------------+----------------+----------+----------+------------+------------+---------------------+ | db | tbl | chunk | chunk_time | chunk_index | lower_boundary | upper_boundary | this_crc | this_cnt | master_crc | master_cnt | ts | +--------+---------+-------+------------+-------------+----------------+----------------+----------+----------+------------+------------+---------------------+ | test01 | frame01 | 1 | 0.001818 | NULL | NULL | NULL | 2eedbb7c | 2 | 2eedbb7c | 2 | 2019-06-15 11:08:00 | +--------+---------+-------+------------+-------------+----------------+----------------+----------+----------+------------+------------+---------------------+ 1 row in set (0.00 sec)
[root@kusou-es11 ~]# pt-table-checksum h=192.168.0.39,u=ptsum,p='ptchecksums',P=3306 --databases=test01 --nocheck-replication-filters --replicate=test01.checksums --no-check-binlog-format --recursion-method="processlist" Checking if all tables can be checksummed ... Starting checksum ... TS ERRORS DIFFS ROWS DIFF_ROWS CHUNKS SKIPPED TIME TABLE 06-15T10:40:18 0 1 2 0 1 0 0.317 test01.frame01
--nocheck-replication-filters 表示不检查复制过滤器; --replicate=test01.checksums 把检查结果写到mysql中test01库中的checksums表; --databases=test01 检查的库为test01, 此处能够写多个库,采用逗号进行分割; --no-check-binlog-format 不检查binlog日志格式, pt-table-checksum在运行的时候是基于statement复制格式的,若是binlog日志格式为row的话会报错,经过上面--no-check-binlog-format来去除binlog日志检查,以防报错; --recursion-method="proccesslist" 这时工具会从show processlist中在主库中自动寻找从库,固然不加此参数,默认也是采用的这种方式,还有另一种方式那就是dns,这个后面会介绍演示。
会把checksums表写入到master 192.168.0.39 的test01库中。登陆主库查看
mysql> select * from test01.checksums; +--------+---------+-------+------------+-------------+----------------+----------------+----------+----------+------------+------------+---------------------+ | db | tbl | chunk | chunk_time | chunk_index | lower_boundary | upper_boundary | this_crc | this_cnt | master_crc | master_cnt | ts | +--------+---------+-------+------------+-------------+----------------+----------------+----------+----------+------------+------------+---------------------+ | test01 | frame01 | 1 | 0.001907 | NULL | NULL | NULL | 2eedbb7c | 2 | 2eedbb7c | 2 | 2019-06-15 11:28:02 | +--------+---------+-------+------------+-------------+----------------+----------------+----------+----------+------------+------------+---------------------+ 1 row in set (0.00 sec)
TS :完成检查的时间。
ERRORS :检查时候发生错误和警告的数量。
DIFFS :0表示一致,1表示不一致。当指定--no-replicate-check时,会一直为0,当指定--replicate-check-only会显示不一样的信息。
验证演示以下:
[root@kusou-es11 ~]# pt-table-checksum h=192.168.0.39,u=ptsum,p='ptchecksums',P=3306 --databases=test01 --no-check-binlog-format --nocheck-replication-filters --no-replicate-check --recursion-method="processlist" Checking if all tables can be checksummed ... Starting checksum ... TS ERRORS DIFFS ROWS DIFF_ROWS CHUNKS SKIPPED TIME TABLE 06-15T11:08:00 0 0 2 0 1 0 0.012 test01.frame01
本来master 上 test01库和slave上test01库上的数据是不一致的,可是 pt-table-checksum 检测时,加上参数-no-replicate-check后,DIFFS 会始终是0
当指定--replicate-check-only会显示不一样的信息:
[root@kusou-es11 ~]# pt-table-checksum h=192.168.0.39,u=ptsum,p='ptchecksums',P=3306 --databases=test01 --no-check-binlog-format --nocheck-replication-filters --replicate-check-only --recursion-method="processlist" Checking if all tables can be checksummed ... Starting checksum ... Differences on kusou-es11 TABLE CHUNK CNT_DIFF CRC_DIFF CHUNK_INDEX LOWER_BOUNDARY UPPER_BOUNDARY test01.frame01 1 0 1
ROWS :表的行数。
CHUNKS :被划分到表中的块的数目。
SKIPPED :因为错误或警告或过大,则跳过块的数目。
TIME :执行的时间。
TABLE :被检查的表名
pt-table-checksum +dsn方式指令参数说明以下:
--nocheck-replication-filters :不检查复制过滤器,建议启用。后面能够用--databases来指定须要检查的数据库。 --no-check-binlog-format : 不检查复制的binlog模式,要是binlog模式是ROW,则会报错。 --replicate-check-only :只显示不一样步的信息。 --replicate= :把checksums的信息写入到指定表中,建议直接写到被检查的数据库当中。 --databases= :指定须要被检查的数据库,多个则用逗号隔开。 --tables= :指定须要被检查的表,多个用逗号隔开 h=192.168.0.39 :Master的内网地址 u=ptsum :用户名 p=ptchecksums :密码 P=3306 :端口 dsn=D :表示用dsn方式进行检查,D为database t=dsns :表示用这张表
配置dsn方式校验数据:
登录192.168.0.39 的·master库建立连接slave库的路由表
mysql> use percona; mysql>CREATE TABLE `dsns` ( `id` int(11) NOT NULL AUTO_INCREMENT,`parent_id` int(11) DEFAULT NULL,`dsn` varchar(255) NOT NULL,PRIMARY KEY (`id`)); mysql>######×××ert into dsns values(1,1,'h=slave的IP,u=username,p=password,P=3306'); mysql>×××ert into dsns values(1,1,'h=192.168.0.11,u=ptsum,p=ptchecksums,P=3306');
[root@kusou-es11 ~]# pt-table-checksum --nocheck-replication-filters --replicate=percona.checksums --databases=test01 --tables=frame01 --no-check-binlog-format h=192.168.0.39,u=ptsum,p=ptchecksums,P=3306 --recursion-method dsn=D=percona,t=dsns,h=192.168.0.11,u=ptsum,p=ptchecksums,P=3306 Checking if all tables can be checksummed ... Starting checksum ... TS ERRORS DIFFS ROWS DIFF_ROWS CHUNKS SKIPPED TIME TABLE 06-15T11:45:39 0 1 2 0 1 0 0.317 test01.frame01
DIFFS =1 也是能够看到;master 和slave的test01库中frame01表的数据是不一致的
指令参数简单说明:
第一段的ip,帐户,密码和端口,指的是链接主库的帐户信息。
h=192.168.0.39,u=ptsum,p=ptchecksums,P=3306
--recursion-method dsn= 采用dsn的连接方式进行校验数据
D=percona,t=dsns, 从 percona库中读取 dsns表中的连接从库的帐户信息
第二段的ip,帐户,密码和端口,指的是链接从库的帐户信息。
h=192.168.0.11,u=ptsum,p=ptchecksums,P=3306
提示:此条pt-table-checksum指令是从 从库192.168.0.11上的percona.dsns表读取连接从库的帐户信息,和主库192.168.0.39中的test01.frame01表的数据进行比对的。
若是此时登陆192.168.0.11从库,删掉slave库percona.dsns表中的数据,而后再进行pt-table-checksum比对的话,会提示链接不上从库,致使检查失败
mysql> select * from dsns; +----+-----------+---------------------------------------------+ | id | parent_id | dsn | +----+-----------+---------------------------------------------+ | 1 | 1 | h=192.168.0.11,u=ptsum,p=ptchecksums,P=3306 | +----+-----------+---------------------------------------------+ 1 row in set (0.00 sec) mysql> delete from dsns where id=1; Query OK, 1 row affected (0.00 sec) mysql> select * from dsns; [root@kusou-es11 ~]# pt-table-checksum --nocheck-replication-filters --replicate=percona.checksums --databases=test01 --tables=frame01 --no-check-binlog-format h=192.168.0.39,u=ptsum,p=ptchecksums,P=3306 --recursion-method dsn=D=percona,t=dsns,h=192.168.0.11,u=ptsum,p=ptchecksums,P=3306 Checking if all tables can be checksummed ... Starting checksum ... Diffs cannot be detected because no slaves were found. Please read the --recursion-method documentation for information. TS ERRORS DIFFS ROWS DIFF_ROWS CHUNKS SKIPPED TIME TABLE 06-15T12:23:40 0 0 2 0 1 0 0.009 test01.frame01
报错提示:Diffs cannot be detected because no slaves were found. Please read the --recursion-method documentation for information.
DIFFS =0 ,然而实际上,master和slave库的test01.frame01表的数据是不一致的,检测比对失败。
[root@kusou-es11 ~]# pt-table-checksum --nocheck-replication-filters --replicate=percona.checksums --databases=test01 --tables=frame01 --no-check-binlog-format h=192.168.0.39,u=ptsum,p=ptchecksums,P=3306 --recursion-method dsn=D=percona,t=dsns Checking if all tables can be checksummed ... Starting checksum ... TS ERRORS DIFFS ROWS DIFF_ROWS CHUNKS SKIPPED TIME TABLE 06-15T12:53:54 0 1 2 0 1 0 0.316 test01.frame01
DIFFS =1 说明master和slave数据存在不一致,检测成功
执行如上的命令,是能够检测成功的,缘由是:上述的命令是直接从主库192.168.0.39的percona.dsns表中读取的连接slave库的帐户信息的
slave库上操做下面的指令:
报错1: [root@localhost local]# /usr/local/percona-toolkit/bin/pt-table-checksum h=192.168.0.39,u=ptsum,p='ptchecksums',P=3306 --databases=mysql --no-check-binlog-format --nocheck-replication-filters Can't locate Digest/MD5.pm in @INC (@INC conta×××: /usr/local/lib64/perl5 /usr/local/share/perl5 /usr/lib64/perl5/vendor_perl /usr/share/perl5/vendor_perl /usr/lib64/perl5 /usr/share/perl5 .) at /usr/local/percona-toolkit/bin/pt-table-checksum line 789. BEGIN failed--compilation aborted at /usr/local/percona-toolkit/bin/pt-table-checksum line 789.
解决办法:
缘由是:缺乏perl-Digest-MD5包, 安装perl-Digest-MD5便可解决。 [root@localhost ~]# yum -y ×××tall perl-Digest-MD5
pt-table-checksum的数据校验工具到此简单介绍完成。有疑问的,欢迎留言一块儿交流学习。