我在一个名为"a"的名称空间中有一个简单的荚,在命名空间"b"中有另一个荚.
我还有一个测试脚本,用于从"a"到"b"进行grpc调用。
我想是DNS解析中的一些错误,但我真的不能前进。
有什么帮助吗?
kubectl exec -n a somepod-f647b7d95-mrvfr cat /etc/resolv.conf
nameserver 10.12.0.10
search chimera.svc.cluster.local svc.cluster.local cluster.local c.<company-name>-staging.internal <provider>.internal
options ndots:5kubectl get pods -n kube-system
event-exporter-v0.2.4-6d4c69fbfb-f4xpf 1/1 Running 0 24d
fluentd-gcp-scaler-6965bb45c9-mzvw6 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-2m2bf 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-2v6bq 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-4xpbc 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-7g5hm 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-8mqvc 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-f9hrs 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-fr58c 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-hzrsb 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-kq8hc 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-kt6p5 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-nsztm 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-qcl4r 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-qggv9 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-qkkp5 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-rm9hn 1/1 Running 0 5d5h
fluentd-gcp-v3.2.0-sv52h 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-t75fp 1/1 Running 0 7d6h
fluentd-gcp-v3.2.0-v49fv 1/1 Running 0 7d6h
kube-dns-6cd7bbdf65-jnntn 4/4 Running 0 24d
kube-dns-6cd7bbdf65-txmlj 4/4 Running 0 24d
kube-dns-autoscaler-8687c64fc-29jgq 1/1 Running 0 7d6h
kube-proxy-gke-iceberg-api-v2-201908101259587443-01f0b55b-q0k3 1/1 Running 0 217d
kube-proxy-gke-iceberg-api-v2-201908101259587443-0d661dfb-3zhx 1/1 Running 0 217d
kube-proxy-gke-iceberg-api-v2-201908101259587443-92bbd393-w96w 1/1 Running 1 115d
kube-proxy-gke-iceberg-es-single-202003021919386-1b520a2e-sn9m 1/1 Running 0 5d6h
kube-proxy-gke-iceberg-es-single-202003021919386-bf6046bf-7wsp 1/1 Running 0 5d5h
kube-proxy-gke-iceberg-es-single-202003021919386-d64daa4e-1jqz 1/1 Running 0 5d5h
kube-proxy-gke-iceberg-general-20190810125958886-21ed2623-4m0p 1/1 Running 0 217d
kube-proxy-gke-iceberg-general-20190810125958886-8b185cf9-x1j2 1/1 Running 0 217d
kube-proxy-gke-iceberg-general-20190810125958886-eaf63d3c-k338 1/1 Running 0 217d
kube-proxy-gke-iceberg-kafka-2019081012595876540-429586da-m2qf 1/1 Running 0 217d
kube-proxy-gke-iceberg-kafka-2019081012595876540-76ebb654-z7xx 1/1 Running 0 217d
kube-proxy-gke-iceberg-kafka-2019081012595876540-c3abee6e-4q76 1/1 Running 0 217d
kube-proxy-gke-iceberg-rabbitmq-2019081012595876-552d6676-8z2k 1/1 Running 0 217d
kube-proxy-gke-iceberg-rabbitmq-2019081012595876-662980f7-76jc 1/1 Running 0 217d
kube-proxy-gke-iceberg-rabbitmq-2019081012595876-b269df22-6zqj 1/1 Running 0 217d
kube-proxy-gke-iceberg-redis-2019081012595877180-38264a5e-c0ch 1/1 Running 0 217d
kube-proxy-gke-iceberg-redis-2019081012595877180-9412d5f5-pt3w 1/1 Running 0 217d
kube-proxy-gke-iceberg-redis-2019081012595877180-947dc20b-c002 1/1 Running 0 217d
kube-state-metrics-67b67d8fdd-nkpt4 2/2 Running 0 24d
l7-default-backend-fd59995cd-cvqwb 1/1 Running 0 24d
metrics-server-v0.3.1-5c8f664b95-sthjz发布于 2020-03-28 00:31:58
从您的描述来看,您正在运行KubeDNS。我给您的第一条建议是迁移 to CoreDNS,因为KubeDNS在弃用路径上。
第二,有两件事突然出现在我身上。
首先,请您对A和B荚正在运行的两个命名空间进行kubectl get svc -n NAMESPACE,并确认已经创建了ClusterIP类型的服务并将IP地址与服务相关联?
其次,您能否尝试通过指定以下FQDN格式从应用程序A到应用程序B的服务进行连接尝试?
some-service.b.svc.cluster.local
注意svc部分。你在行动中提到了some-service.b.cluster.local。
最后,如果一切恢复正常,我们可以开始对kube-dns进行故障排除。看起来这三个吊舱都在运行。但是,您是否尝试过描述和/或抓起他们的原木?如果有什么看上去有趣的东西,你能试一试并分享一下总结吗?
kubectl describe pod -n kube-system kube-dns-6cd7bbdf65-jnntn
kubectl describe pod -n kube-system kube-dns-6cd7bbdf65-txmlj
kubectl describe pod -n kube-system kube-dns-autoscaler-8687c64fc-29jgqkubectl logs -n kube-system kube-dns-6cd7bbdf65-jnntn
kubectl logs -n kube-system kube-dns-6cd7bbdf65-txmlj
kubectl logs -n kube-system kube-dns-autoscaler-8687c64fc-29jgq我想logs命令将为您提供您要寻找的答案。如果你在这个问题上需要进一步的澄清或帮助,请告诉我。我很乐意帮忙。
https://stackoverflow.com/questions/60888402
复制相似问题