转载请注明原创地址:http://www.cnblogs.com/dongxiao-yang/p/7200971.html

最近需要研究flink-connector-kafka的消费行为,发现flink使用了kafka consumer一个比较底层一点的assign接口而不是之前比较常用的subscirbe,于是研究下二者之间的差异。

首先看api文档:http://kafka.apache.org/0110/javadoc/index.html?org/apache/kafka/clients/consumer/KafkaConsumer.html

public void assign(Collection<TopicPartition> partitions)

Manually assign a list of partitions to this consumer. This interface does not allow for incremental assignment and will replace the previous assignment (if there is one). If the given list of topic partitions is empty, it is treated the same as unsubscribe().

Manual topic assignment through this method does not use the consumer's group management functionality. As such, there will be no rebalance operation triggered when group membership or cluster and topic metadata change. Note that it is not possible to use both manual partition assignment with assign(Collection) and group assignment with subscribe(Collection, ConsumerRebalanceListener).

与subscirbe方法不同,assign方法由用户直接手动consumer实例消费哪些具体分区,根据api上述描述,assign的consumer不会拥有kafka的group management机制,也就是当group内消费者数量变化的时候不会有reblance行为发生。assign的方法不能和subscribe方法同时使用。

然后看一下具体实现源码:

public class KafkaConsumer<K, V> implements Consumer<K, V>{
...........
private final SubscriptionState subscriptions; public void subscribe(Pattern pattern, ConsumerRebalanceListener listener) {
...........
this.subscriptions.subscribe(new HashSet<>(topics), listener);
...........
} ......
public void assign(Collection<TopicPartition> partitions) {
...........
this.subscriptions.assignFromUser(new HashSet<>(partitions));
...........
}

每一个KafkaConsumer实例内部都拥有一个SubscriptionState对象,subscribe内部调用了subscribe方法,assign内部调用了assignFromUser方法,看一下这两个方法的具体实现:

    private enum SubscriptionType {
NONE, AUTO_TOPICS, AUTO_PATTERN, USER_ASSIGNED
} private void setSubscriptionType(SubscriptionType type) {
if (this.subscriptionType == SubscriptionType.NONE)
this.subscriptionType = type;
else if (this.subscriptionType != type)
throw new IllegalStateException(SUBSCRIPTION_EXCEPTION_MESSAGE);
} public void subscribe(Set<String> topics, ConsumerRebalanceListener listener) {
if (listener == null)
throw new IllegalArgumentException("RebalanceListener cannot be null"); setSubscriptionType(SubscriptionType.AUTO_TOPICS); this.listener = listener; changeSubscription(topics);
} public void assignFromUser(Set<TopicPartition> partitions) {
setSubscriptionType(SubscriptionType.USER_ASSIGNED); if (!this.assignment.partitionSet().equals(partitions)) {
fireOnAssignment(partitions); Map<TopicPartition, TopicPartitionState> partitionToState = new HashMap<>();
for (TopicPartition partition : partitions) {
TopicPartitionState state = assignment.stateValue(partition);
if (state == null)
state = new TopicPartitionState();
partitionToState.put(partition, state);
}
this.assignment.set(partitionToState);
this.needsFetchCommittedOffsets = true;
}
}

由上述代码可知,SubscriptionState 内部拥有一个SubscriptionType类型的枚举变量subscriptionType,枚举共拥有NONE, AUTO_TOPICS, AUTO_PATTERN, USER_ASSIGNED四种状态类型,subscribe方法会把subscriptionType状态设置为AUTO_TOPICS,assignFromUser会设置为USER_ASSIGNED。尤其是setSubscriptionType设置枚举的方法内部:else if (this.subscriptionType != type) throw new IllegalStateException(SUBSCRIPTION_EXCEPTION_MESSAGE); 代码保证了,如果同一个consumer已经调用了某一种订阅模式,再次试图更改为另一种模式的时候程序会直接抛出错误。

poll方法调用情况下的不同实现

上述两种模式初始化的consumer在fetch数据的时候调用的是同样的poll方法,每次poll会调用pollOnce方法内的

coordinator.poll(time.milliseconds());

具体源码如下

public void poll(long now) {
invokeCompletedOffsetCommitCallbacks(); if (subscriptions.partitionsAutoAssigned() && coordinatorUnknown()) {
ensureCoordinatorReady();
now = time.milliseconds();
} if (needRejoin()) {
// due to a race condition between the initial metadata fetch and the initial rebalance,
// we need to ensure that the metadata is fresh before joining initially. This ensures
// that we have matched the pattern against the cluster's topics at least once before joining.
if (subscriptions.hasPatternSubscription())
client.ensureFreshMetadata(); ensureActiveGroup();
now = time.milliseconds();
} pollHeartbeat(now);
maybeAutoCommitOffsetsAsync(now);
} public boolean needRejoin() {
if (!subscriptions.partitionsAutoAssigned())
return false; // we need to rejoin if we performed the assignment and metadata has changed
if (assignmentSnapshot != null && !assignmentSnapshot.equals(metadataSnapshot))
return true; // we need to join if our subscription has changed since the last join
if (joinedSubscription != null && !joinedSubscription.equals(subscriptions.subscription()))
return true; return super.needRejoin();
}


public boolean partitionsAutoAssigned() {


return this.subscriptionType == SubscriptionType.AUTO_TOPICS || this.subscriptionType == SubscriptionType.AUTO_PATTERN;


}

 

ConsumerCoordinator的poll方法会判断consumer的前述SubscriptionType类型,只有类型是AUTO_TOPICS或者AUTO_PATTERN(subscribe方法的另一种参数重载),才会与ConsumerCoordinator进行交互,判断是否需要reblance等行为。

所以正如api文档描述,assign的consumer不会拥有kafka的group management机制,也就是当group内消费者数量变化的时候不会有reblance行为发生。








最新文章

  1. centos7查看系统版本,查看机器位数x86-64
  2. AFN中的PATCH 和 DELETE 请求方式
  3. GLSL Interface Block参考
  4. [转]MySQL5.6新特性之Multi-Range Read
  5. version `GLIBC_2.17&#39; not found
  6. oracle 启用归档日志
  7. excel 的一些操作
  8. BZOJ_3270_博物馆_(高斯消元+期望动态规划+矩阵)
  9. 【插件】WordPress缓存最佳组合:DB Cache Reloaded Fix + Hyper Cache
  10. [shell]Shell经常使用特殊符号
  11. RAC之RMAN恢复
  12. hdu 5954 -- Do not pour out(积分+二分)
  13. NOIP-火柴棒等式
  14. Asp.Net百度站长工具的主动推送功能
  15. href和src区别
  16. java int and Integer
  17. 78. Subsets(M) &amp; 90. Subsets II(M) &amp; 131. Palindrome Partitioning
  18. 动态计算UITableViewCell高度详解
  19. InFusion错误类型分析
  20. 七 使用list和tuple

热门文章

  1. 2014 非常好用的开源 Android 测试工具
  2. Scrum之成败——从自身案例说起,仅供参考
  3. POJ 2311 Cutting Game (Multi-Nim)
  4. 【数学期望】【高斯消元】bzoj3143 [Hnoi2013]游走
  5. 【最大权闭合子图/最小割】BZOJ3438-小M的作物【待填】
  6. 开发板无法ping通虚拟机的问题解决一例
  7. expdp impdp 数据库导入导出命令详解
  8. 自定义WebViewPage,实现Url.Action生成绝对地址
  9. 数组中的push()和pop()方法
  10. react项目如何调试?