文章/答案/技术大牛

发布

社区首页 >问答首页 >从GKE 1.15更新到1.16时出现Prometheus (node_exporter)问题

问从GKE 1.15更新到1.16时出现Prometheus (node_exporter)问题
EN

Stack Overflow用户

提问于 2020-09-04 21:14:52

回答 1查看 519关注 0票数 1

我在Google GKE的Kubernetes上使用Prometheus和Grafana应用程序已经有好几个月了。例如，在Grafana上，我曾经监控过container_cpu_usage_seconds_total。

但是自从我将GKE的节点从1.15升级到1.16之后，我丢失了container_*信息。

为了测试它，我使用1.15版本创建了一个新集群。我从Google Marketeplace安装了Prometheus，并一步一步地升级了GKE，直到问题出现。同样，container_*监控在版本1.16中停止。

Here you can see container_cpu_usage_seconds_total and it stopped when I upgrade the node. There are 3 nodes

我是唯一一个有这个问题的人吗？有人找到解决方案了吗？

感谢您的帮助:)

Valentin

kubernetes

google-kubernetes-engine

prometheus-node-exporter

回答 1

Stack Overflow用户

回答已采纳

发布于 2020-09-23 17:42:03

我发现了哪里出了问题。使用docker或kubernetes，node-exporter不会发送pods指标( container_* )。必须安装Cadvisor (在Google Marketeplace中，Cadvisor安装在node-exporter镜像中)从Kubernetes 1.16开始，Cadvisor的配置是错误的。您应该编辑配置以解决此问题

所有信息都在这篇文章中：Prometheus not receiving metrics from cadvisor in GKE

票数 1

页面原文内容由Stack Overflow提供。腾讯云小微IT领域专用引擎提供翻译支持

原文链接：

https://stackoverflow.com/questions/63741959

复制

相似问题

问从GKE 1.15更新到1.16时出现Prometheus (node_exporter)问题
EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问从GKE 1.15更新到1.16时出现Prometheus (node_exporter)问题EN

回答 1

Stack Overflow用户

社区

活动

圈层

关于

腾讯云开发者

热门产品

热门推荐

更多推荐

问从GKE 1.15更新到1.16时出现Prometheus (node_exporter)问题
EN