0

我从不同的 Kafka 主题中读取了多个 OpaqueTridentKafkaSpout。我希望所有这些流中的数据通过同一组函数。实现这一目标的最佳方法是什么。我是否需要创建单独的流并将每个元组再次传递给同一组函数。像下面这样?

BrokerHosts zk = new ZkHosts(getZooKeeperHosts());
TridentKafkaConfig spoutConf = new TridentKafkaConfig(zk, "Test");
spoutConf.scheme = new SchemeAsMultiScheme(new StringScheme());
TridentKafkaConfig spoutConf1 = new TridentKafkaConfig(zk, "Test1");
spoutConf1.scheme = new SchemeAsMultiScheme(new StringScheme());
OpaqueTridentKafkaSpout kafkaSpout1 = new OpaqueTridentKafkaSpout(spoutConf1);

topology.newStream("event", kafkaSpout).each(new Fields("document"), new ExtractDocumentInfo(), new Fields("id", "index", "type"));
topology.newStream("event1", kafkaSpout1).each(new Fields("document"), new ExtractDocumentInfo(), new Fields("id", "index", "type"));
4

1 回答 1

0

您可以将流合并在一起,但任何失败都会导致两个 spout 重播批处理。

于 2015-08-09T15:45:52.660 回答