我正在尝试将 zookeeper 和 kafka 设置为共享命名空间中的单独 Kubernetes 部署/pod。我已经在我的 Ubuntu 沙箱上通过 kubeadm 用 Calico 引导了一个本地 K8s 1.8 ......
对于 Zookeeper,我使用的是 hub.docker.com 中的 zookeeper:3.4 映像,并创建了一个 Kubernetes 部署和服务,其中我公开了端口:2181 2888 3888。服务名称是 zookeeper,我假设我应该能够使用来自命名空间中的 pod 的这个主机名。
对于 Kafka 1.0,我创建了自己的容器映像,可以使用环境变量进行控制……我将 zookeeper.connect 设置为 zookeeper:2181。我假设 Kubernetes DNS 会解决这个问题并打开与服务的连接。
不幸的是我得到:
[2018-01-03 15:48:26,292] INFO Waiting for keeper state SyncConnected (org.I0Itec.zkclient.ZkClient)
[2018-01-03 15:48:32,293] INFO Terminate ZkClient event thread. (org.I0Itec.zkclient.ZkEventThread)
[2018-01-03 15:48:46,286] INFO Opening socket connection to server zookeeper.sandbox.svc.cluster.local/10.107.41.148:2181. Will not attempt to authenticate using SASL (unknown error) (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:48:46,299] INFO Socket connection established to zookeeper.sandbox.svc.cluster.local/10.107.41.148:2181, initiating session (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:48:46,319] INFO Session establishment complete on server zookeeper.sandbox.svc.cluster.local/10.107.41.148:2181, sessionid = 0x10000603c560001, negotiated timeout = 6000 (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:48:46,331] INFO Session: 0x10000603c560001 closed (org.apache.zookeeper.ZooKeeper)
[2018-01-03 15:48:46,333] FATAL Fatal error during KafkaServer startup. Prepare to shutdown (kafka.server.KafkaServer)
org.I0Itec.zkclient.exception.ZkTimeoutException: Unable to connect to zookeeper server 'zookeeper:2181' with timeout of 6000 ms
at org.I0Itec.zkclient.ZkClient.connect(ZkClient.java:1233)
at org.I0Itec.zkclient.ZkClient.<init>(ZkClient.java:157)
at org.I0Itec.zkclient.ZkClient.<init>(ZkClient.java:131)
at kafka.utils.ZkUtils$.createZkClientAndConnection(ZkUtils.scala:115)
at kafka.utils.ZkUtils$.withMetrics(ZkUtils.scala:92)
at kafka.server.KafkaServer.initZk(KafkaServer.scala:346)
at kafka.server.KafkaServer.startup(KafkaServer.scala:194)
at kafka.server.KafkaServerStartable.startup(KafkaServerStartable.scala:38)
at kafka.Kafka$.main(Kafka.scala:92)
at kafka.Kafka.main(Kafka.scala)
所以我假设我的集群中有一个通用的网络问题,然后我注意到一些对我来说更令人困惑的事情......如果我将 zookeeper.connect 设置为 10.107.41.148:2181 (zookeeper 服务的当前地址),连接工作(至少从kafka到zookeeper)。
[2018-01-03 15:51:31,092] INFO Waiting for keeper state SyncConnected (org.I0Itec.zkclient.ZkClient)
[2018-01-03 15:51:31,094] INFO Opening socket connection to server 10.107.41.148/10.107.41.148:2181. Will not attempt to authenticate using SASL (unknown error) (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:51:31,105] INFO Socket connection established to 10.107.41.148/10.107.41.148:2181, initiating session (org.apache.zookeeper.ClientCnxn)
[2018-01-03 15:51:31,134] INFO Session establishment complete on server 10.107.41.148/10.107.41.148:2181, sessionid = 0x10000603c560005, negotiated timeout = 6000 (org.apache.zookeeper.ClientCnxn)
通过这个设置,我可以使用来自 kubernetes 集群主机的 zookeeper 服务来执行例如“bin/kafka-topics.sh --list --zookeeper 10.107.41.148:2181”。生成消息不起作用...我假设一旦网络正常工作,我需要添加卡夫卡广告地址...
kafka-console-producer.sh --broker-list 10.100.117.196:9092 --topic test1
>test-msg1
>[2018-01-03 17:05:35,689] WARN [Producer clientId=console-producer] Connection to node 0 could not be established. Broker may not be available. (org.apache.kafka.clients.NetworkClient)
任何提示我的 Kubernetes 网络设置有什么问题,或者至少从哪里开始故障排除?
谢谢你和最好的问候,帕维尔