10-kafka集群扩容

第一步 配置新得broker

  1. 将现有的集群上任一个服务器上的kafka目录拷贝到新的服务器上
  2. 修改config/server.properties中的broker.id、log.dirs、listeners
  3. 建立logs.dirs指定的目录,并设定读写权限(chomd -R 777 XXX)
broker.id=3
log.dirs=kafka-logs
listeners=PLAINTEXT://172.16.49.174:9092

第二步 启动新的broker

bin/kafka-server-start.sh config/server.properties  &

第三步 迁移指定topic的数据到新的broker

虽然通过上面两个步骤后已经完成了集群的扩容;可是集群上原有的topic的数据不会自动迁移到新的broker上。能够在新的broker所在的服务器上经过 ls /home/lxh/kafka_2.11-0.10.0.0/kafka-logs 查看到并无一原有的topic名称的文件目录(由于建立topic后会在config/server.properties中的配置的log.dirs 目录中生产以topic名称+分区编号的文件目录);那么就须要手动的区迁移数据json

(一)、生成迁移分配规则json文件

建立编辑要迁移的topic的 json文件bash

vi topic-to-move.json

好比要将topic名称为test和paritioer_test的数据从新平衡到集群中,就能够新增如下内容服务器

{"topics": [{"topic": "test"},
            {"topic": "paritioer_test"}],
 "version":1
}

生成迁移分配规则json文件spa

bin/kafka-reassign-partitions.sh --zookeeper localhost:2181 --topics-to-move-json-file topic-to-move.json --broker-list "0,1,2" --generate

获得的结果为code

Current partition replica assignment 

{"version":1,"partitions":[{"topic":"test","partition":4,"replicas":[0,1]},{"topic":"test","partition":1,"replicas":[1,0]},{"topic":"paritioer_test","partition":0,"replicas":[0]},{"topic":"test","partition":2,"replicas":[0,1]},{"topic":"test","partition":0,"replicas":[0,1]},{"topic":"test","partition":3,"replicas":[1,0]}]}
Proposed partition reassignment configuration

{"version":1,"partitions":[{"topic":"test","partition":4,"replicas":[1,0]},{"topic":"test","partition":1,"replicas":[1,2]},{"topic":"test","partition":2,"replicas":[2,0]},{"topic":"paritioer_test","partition":0,"replicas":[0]},{"topic":"test","partition":0,"replicas":[0,1]},{"topic":"test","partition":3,"replicas":[0,2]}]}

其中的Current partition replica assignment指的是迁移前的partition replica;Proposed partition reassignment configuration 指的就是迁移分配规则json。须要将该json文件保存到json文件中(如expand-cluster-reassignment.json)server

(二)、执行迁移分配

bin/kafka-reassign-partitions.sh --zookeeper localhost:2181 --reassignment-json-file expand-cluster-reassignment.json --execute

注意:在迁移过程当中不能人为的结束或中止kafka服务,否则会有数据不一致的问题kafka

(三)、验证分配

在执行的过程当中,能够新开一个终端执行如下命令来查看执行是否正确完成it

bin/kafka-reassign-partitions.sh --zookeeper localhost:2181 --reassignment-json-file expand-cluster-reassignment.json --verify

输出io

Status of partition reassignment:
Reassignment of partition [test,4] completed successfully
Reassignment of partition [test,0] completed successfully
Reassignment of partition [paritioer_test,0] completed successfully
Reassignment of partition [test,3] completed successfully
Reassignment of partition [test,2] completed successfully
Reassignment of partition [test,1] completed successfully

在迁移完成过程后,可使用如下命令看下topic的每一个partitions的分布状况class

bin/kafka-topics.sh --zookeeper 172.16.49.173:2181 --describe --topic test
Topic:test  PartitionCount:5    ReplicationFactor:2 Configs:
    Topic: test Partition: 0    Leader: 0   Replicas: 0,1   Isr: 0,1
    Topic: test Partition: 1    Leader: 1   Replicas: 1,2   Isr: 1,2
    Topic: test Partition: 2    Leader: 2   Replicas: 2,0   Isr: 0,2
    Topic: test Partition: 3    Leader: 0   Replicas: 0,2   Isr: 0,2
    Topic: test Partition: 4    Leader: 0   Replicas: 1,0   Isr: 0,1

能够看到名为test的topic的有的数据以及存在于编号为2的新broker上了

相关文章
相关标签/搜索