1,kafka依赖于zookeeper,下载:html
kafka2.10-0.10.00包下载,zookeeper3.4.10下载;java
2,配置启动ZOOKEEPERapache
配置项:ZOOKEEPER_HOME,和PATH;参考:bootstrap
修改zookeeper-3.4.10/conf下,zoo.conf文件:服务器
设置项:session
dataDir=/home/t/source/zookeeper-3.4.10/dataDir dataLogDir=/home/t/source/zookeeper-3.4.10/dataLogDir
zookeeper启动:spa
./zkServer.sh start
3,配置启动kafka3d
修改kafka配置项:code
启动kafkaserver
./kafka-server-start.sh ../config/server.properties
建立topic(消息类型)
./kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic test
生产消息:
./kafka-console-producer.sh --broker-list localhost:9092 --topic test
消费消息:
./kafka-console-consumer.sh --zookeeper localhost:2181 --topic test --from-beginning
最终效果:
生产端输入什么,消费端输出什么。
分区,对于一个topic,3个分区,则同一组消费者数量应当<=3,不然有消费者接受不到数据;
http://www.cnblogs.com/liuwei6/p/6900686.html
Topic在逻辑上能够被认为是一个queue,每条消费都必须指定它的Topic,能够简单理解为必须指明把这条消息放进哪一个queue里。为了使得Kafka的吞吐率能够线性提升,物理上把Topic分红一个或多个Partition,每一个Partition在物理上对应一个文件夹,该文件夹下存储这个Partition的全部消息和索引文件。
kafka外网访问 advertised.listeners=PLAINTEXT://x.x.x.x:9092
kafka读写
import org.apache.kafka.clients.consumer.ConsumerRecord; import org.apache.kafka.clients.consumer.ConsumerRecords; import org.apache.kafka.clients.consumer.KafkaConsumer; import org.apache.kafka.clients.producer.KafkaProducer; import org.apache.kafka.clients.producer.ProducerConfig; import org.apache.kafka.clients.producer.ProducerRecord; import org.apache.kafka.common.TopicPartition; import java.util.Arrays; import java.util.Date; import java.util.Properties; import javax.print.attribute.standard.PrinterLocation; public class KafkaConsumerExample { public static void main(String[] args) throws InterruptedException { Properties props = new Properties(); props.put("bootstrap.servers", "192.168.1.166:9092"); props.put("group.id", "test13"); props.put("enable.auto.commit", "true"); props.put("auto.commit.interval.ms", "1000"); props.put("session.timeout.ms", "30000"); props.put("key.deserializer", "org.apache.kafka.common.serialization.StringDeserializer"); props.put("value.deserializer", "org.apache.kafka.common.serialization.StringDeserializer"); props.put("key.serializer", "org.apache.kafka.common.serialization.StringSerializer"); props.put("value.serializer", "org.apache.kafka.common.serialization.StringSerializer"); props.put("auto.offset.reset", "earliest"); props.put(ProducerConfig.BATCH_SIZE_CONFIG, 1024*1024*5); //往kafka服务器提交消息间隔时间,0则当即提交不等待 props.put(ProducerConfig.LINGER_MS_CONFIG,0); //Kafka Reader KafkaConsumer<String, String> consumer = new KafkaConsumer<>(props); consumer.subscribe(Arrays.asList("test")); consumer.seek(new TopicPartition("test", 1), 1); while (true) { ConsumerRecords<String, String> records = consumer.poll(2000); System.out.println("-------------"+new Date()); for (ConsumerRecord<String, String> record : records) System.out.printf("offset = %d, key = %s, value = %s\n", record.offset(), record.key(), record.value()); } /* //KafkaWriter KafkaProducer<String, String> productor = new KafkaProducer<>(props); productor.send(new ProducerRecord<String, String>("test", "aaa", "xiaoxiaoxiao2018")); */ } }
名词解释:
bootstrap.servers:Kafka集群链接串,能够由多个host:port组成【your.host.name:9092】