Cassandra repair 工具使用

时间 2019-11-17

标签 cassandra repair 工具使用繁體版

原文原文链接

前言

Cassandra是一款去中心化的分布式数据库。一份数据会分布在多个对等的节点上，即有多个副本。咱们须要按期的对多个副本检查，看是否有不一致的状况。好比由于磁盘损坏，可能会致使副本丢失，这样同一份数据的多个副本就会出现不一致。node

nodetool repair

Cassandra提供的nodetool中提供了repair这个工具，能够用来平常巡检数据的一致性。或者当修修改了keysapce 副本配置时，也须要运行此工具。数据库

能够经过nodetool help 'repair'查看命令帮助，以下：微信

NAME nodetool repair - Repair one or more tables SYNOPSIS nodetool [(-h <host> | --host <host>)] [(-p <port> | --port <port>)] [(-pw <password> | --password <password>)] [(-pwf <passwordFilePath> | --password-file <passwordFilePath>)] [(-u <username> | --username <username>)] repair [(-dc <specific_dc> | --in-dc <specific_dc>)...] [(-dcpar | --dc-parallel)] [(-et <end_token> | --end-token <end_token>)] [(-full | --full)] [(-hosts <specific_host> | --in-hosts <specific_host>)...] [(-j <job_threads> | --job-threads <job_threads>)] [(-local | --in-local-dc)] [(-pl | --pull)] [(-pr | --partitioner-range)] [(-seq | --sequential)] [(-st <start_token> | --start-token <start_token>)] [(-tr | --trace)] [--] [<keyspace> <tables>...] OPTIONS -dc <specific_dc>, --in-dc <specific_dc> Use -dc to repair specific datacenters -dcpar, --dc-parallel Use -dcpar to repair data centers in parallel. -et <end_token>, --end-token <end_token> Use -et to specify a token at which repair range ends -full, --full Use -full to issue a full repair. -h <host>, --host <host> Node hostname or ip address -hosts <specific_host>, --in-hosts <specific_host> Use -hosts to repair specific hosts -j <job_threads>, --job-threads <job_threads> Number of threads to run repair jobs. Usually this means number of CFs to repair concurrently. WARNING: increasing this puts more load on repairing nodes, so be careful. (default: 1, max: 4) -local, --in-local-dc Use -local to only repair against nodes in the same datacenter -p <port>, --port <port> Remote jmx agent port number -pl, --pull Use --pull to perform a one way repair where data is only streamed from a remote node to this node. -pr, --partitioner-range Use -pr to repair only the first range returned by the partitioner -pw <password>, --password <password> Remote jmx agent password -pwf <passwordFilePath>, --password-file <passwordFilePath> Path to the JMX password file -seq, --sequential Use -seq to carry out a sequential repair -st <start_token>, --start-token <start_token> Use -st to specify a token at which the repair range starts -tr, --trace Use -tr to trace the repair. Traces are logged to system_traces.events. -u <username>, --username <username> Remote jmx agent username -- This option can be used to separate command-line options from the list of argument, (useful when arguments might be mistaken for command-line options [<keyspace> <tables>...] The keyspace followed by one or many tables

主要用法说明

nodetool repair mykeyspace mytable 检查并修复特定表分布式

经常使用参数：工具

-j <job_threads>, --job-threads <job_threads>this

后台并行运行的RepairSession个数，一个RepairSession对应一组节点以及节点共同维护的分区。这个谨慎调整，会增长集群负载。spa

-full, --fullcode

全量检查并修复，2.2以后的版本引入增量修复功能(increment repair)，默认都是走增量。增量修复会把已经repair过的数据从sstable里分离出来，分红2个sstable，一个是检修过的，一个是包含未检修数据（这个过程叫AntiCompaction）。这样下次运行repair只会检查没有修复过的那个sstable，减小磁盘带宽和创建MerkleTree开销，避免影响在线服务（repair过程是会读取数据并创建MerkleTree，而后在某一节点上对比不一样节点上各自维护的副本的MerkleTree）。orm

-st <start_token>, --start-token <start_token>token

-et <end_token>, --end-token <end_token>

自定义token范围，也就是分区（range）范围。好比(100,1000] 表示只检查一致性hash环上从100到1000这个区间段内分区段数据。默认无需指定，会检修运行repair命令的当前节点上全部token。指定了这个参数，至关于作一个subrange repair，会跳过AntiCompaction。一若是想避免AntiCompaction的影响，能够本身计算好token范围，本身作多个subrange repair。

-pr, --partitioner-range

只检修主要的range。主要range是什么？好比一行数据被hash到某个range，也就是对应了某个token（此token假设由节点A负责）。而后由于keyspace是多副本的，会根据keyspace配置的ReplicationStrategy，再选出多个token负责（这些token是不一样节点维护的）存放副本。那么这个range对于节点A而言就是主要range。

此参数不作subrange repair才有效

-dc <specific_dc>, --in-dc <specific_dc>

检修只会涉及到指定dc中的节点

-hosts <specific_host>, --in-hosts <specific_host>

检修只会涉及到指定主机列表中的节点

写在最后

为了营造一个开放的Cassandra技术交流环境，社区创建了微信公众号和钉钉群。为广大用户提供专业的技术分享及问答，按期开展专家技术直播，欢迎你们加入。另云Cassandra免费火爆公测中，欢迎试用：https://www.aliyun.com/product/cds

原文连接

本文为云栖社区原创内容，未经容许不得转载。

1. Cassandra 可视化工具
2. cassandra的使用
3. cassandra使用与介绍
4. Cassandra mapper的使用
5. mac 中使用cassandra
6. Cassandra数据操作管理工具tableplus
7. 什么是repair？什么是soft repair、hard repair、lane repair？
8. Cassandra Vnodes在Cassandra 2.0-4.0中的演进
9. Cassandra探究（一）
10. 工具使用
更多相关文章...
• jQuery Mobile 工具栏 - jQuery Mobile 教程
• netwox网络工具集入门教程 - TCP/IP教程
• PHP开发工具
• Composer 安装与使用