本期内容:继承
1,动态Batch Size深刻it
2,RateController解析date
Spark Streaming中采用的模型以下,基本思想是按照100ms的批次发送数据给Controller,刚开始Controller直接转给JobGenerator,JobGenerator在生成Job,交给JobProcessor来处理,JobProcessor将job统计信息发送给Controller,Controller接收到统计信息,会动态改变Batch Size来给Job发送数据。sso
RateController的子类有ReceiverRateController、DirectKafkaRateController。方法
ReceiverRateController会向receiverTracker发送UpdateReceiverRateLimit消息。im
ReceiverTracker接收到UpdateReceiverRateLimit消息后,向ReceiverSupervisorImpl转发UpdateRateLimit消息。统计
ReceiverSupervisorImpl接收到UpdateRateLimit消息后,调用BlockGenerator的updateRate方法,来控制数据的接收速度。数据
BlockGenerator继承自RateLimiter。img