kafka/zookeeper学习记录

 

1,kafka依赖于zookeeper,下载:html

kafka2.10-0.10.00包下载zookeeper3.4.10下载java

2,配置启动ZOOKEEPERapache

    配置项:ZOOKEEPER_HOME,和PATH;参考:bootstrap

修改zookeeper-3.4.10/conf下,zoo.conf文件:服务器

设置项:session

dataDir=/home/t/source/zookeeper-3.4.10/dataDir
dataLogDir=/home/t/source/zookeeper-3.4.10/dataLogDir

zookeeper启动:spa

./zkServer.sh start

3,配置启动kafka3d

修改kafka配置项:code

启动kafkaserver

./kafka-server-start.sh  ../config/server.properties

建立topic(消息类型)

./kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic test

生产消息:

./kafka-console-producer.sh  --broker-list localhost:9092 --topic test

消费消息:

./kafka-console-consumer.sh --zookeeper localhost:2181 --topic test --from-beginning

最终效果:

生产端输入什么,消费端输出什么。

partition

分区,对于一个topic,3个分区,则同一组消费者数量应当<=3,不然有消费者接受不到数据;

http://www.cnblogs.com/liuwei6/p/6900686.html

Topic在逻辑上能够被认为是一个queue,每条消费都必须指定它的Topic,能够简单理解为必须指明把这条消息放进哪一个queue里。为了使得Kafka的吞吐率能够线性提升,物理上把Topic分红一个或多个Partition,每一个Partition在物理上对应一个文件夹,该文件夹下存储这个Partition的全部消息和索引文件。

kafka外网访问 advertised.listeners=PLAINTEXT://x.x.x.x:9092

kafka读写

import org.apache.kafka.clients.consumer.ConsumerRecord;  
import org.apache.kafka.clients.consumer.ConsumerRecords;  
import org.apache.kafka.clients.consumer.KafkaConsumer;
import org.apache.kafka.clients.producer.KafkaProducer;
import org.apache.kafka.clients.producer.ProducerConfig;
import org.apache.kafka.clients.producer.ProducerRecord;
import org.apache.kafka.common.TopicPartition;

import java.util.Arrays;
import java.util.Date;
import java.util.Properties;

import javax.print.attribute.standard.PrinterLocation;  
  
public class KafkaConsumerExample {  
    public static void main(String[] args) throws InterruptedException {  
        Properties props = new Properties();  
        props.put("bootstrap.servers", "192.168.1.166:9092");  
        props.put("group.id", "test13");  
        props.put("enable.auto.commit", "true");  
        props.put("auto.commit.interval.ms", "1000");  
        props.put("session.timeout.ms", "30000");  
        props.put("key.deserializer", "org.apache.kafka.common.serialization.StringDeserializer");  
        props.put("value.deserializer", "org.apache.kafka.common.serialization.StringDeserializer");  
        props.put("key.serializer", "org.apache.kafka.common.serialization.StringSerializer");  
        props.put("value.serializer", "org.apache.kafka.common.serialization.StringSerializer");  
        props.put("auto.offset.reset", "earliest");        
        props.put(ProducerConfig.BATCH_SIZE_CONFIG, 1024*1024*5);  
        //往kafka服务器提交消息间隔时间,0则当即提交不等待  
        props.put(ProducerConfig.LINGER_MS_CONFIG,0);  
        
        
        //Kafka Reader
        KafkaConsumer<String, String> consumer = new KafkaConsumer<>(props);  
        consumer.subscribe(Arrays.asList("test"));  
        consumer.seek(new TopicPartition("test", 1), 1);
        while (true) {  
            ConsumerRecords<String, String> records = consumer.poll(2000);
            System.out.println("-------------"+new Date());
            for (ConsumerRecord<String, String> record : records)  
                System.out.printf("offset = %d, key = %s, value = %s\n", record.offset(), record.key(), record.value());  
        }
        
        /*
        //KafkaWriter
        KafkaProducer<String, String> productor = new KafkaProducer<>(props);
        productor.send(new ProducerRecord<String, String>("test", "aaa", "xiaoxiaoxiao2018"));
        */
    }  
}

名词解释:

bootstrap.servers:Kafka集群链接串,能够由多个host:port组成【your.host.name:9092】

相关文章
相关标签/搜索