我看到以下kubernetes集成测试总是失败,在RHEL 7.2、Fedora 24和CentOS7.1上大约90%的时间都会失败:
test/integration/garbagecollector
test/integration/replicationcontroller它们似乎是由于etcd故障造成的。我的在线查询使我相信这可能还包含一个apiserver问题。我的设置很简单,我安装/启动docker,安装go,从github克隆kubernetes存储库,从存储库使用hack/ install -etcd.sh并将其添加到path,获取ginkgo、gomega和go-bindata,然后运行'make test-integration‘。我不会手动更改任何内容或添加任何自定义文件/配置。有没有人遇到这些问题并知道解决方案?我在网上看到的关于这个问题的唯一提到被认为是一个薄片,没有列出的解决方案,但我几乎每次测试运行都会遇到这个问题。下面是错误的部分,如果需要,我可以给出更多:
垃圾收集器:
\*many lines from garbagecollector.go that look good*
I0920 14:42:39.725768 11823 garbagecollector.go:479] create storage for resource { v1 secrets}
I0920 14:42:39.725786 11823 garbagecollector.go:479] create storage for resource { v1 serviceaccounts}
I0920 14:42:39.725803 11823 garbagecollector.go:479] create storage for resource { v1 services}
I0920 14:43:09.565529 11823 trace.go:61] Trace "List *rbac.ClusterRoleList" (started 2016-09-20 14:42:39.565113203 -0400 EDT):
[2.564µs] [2.564µs] About to list etcd node
[30.000353492s] [30.000350928s] Etcd node listed
[30.000361771s] [8.279µs] END
E0920 14:43:09.566770 11823 cacher.go:258] unexpected ListAndWatch error: pkg/storage/cacher.go:198: Failed to list *rbac.RoleBinding: client: etcd cluster is unavailable or misconfigured
\*repeats over and over with different thing failed to list*复制控制器:
I0920 14:35:16.907283 10482 replication_controller.go:481] replication controller worker shutting down
I0920 14:35:16.907293 10482 replication_controller.go:481] replication controller worker shutting down
I0920 14:35:16.907298 10482 replication_controller.go:481] replication controller worker shutting down
I0920 14:35:16.907303 10482 replication_controller.go:481] replication controller worker shutting down
I0920 14:35:16.907307 10482 replication_controller.go:481] replication controller worker shutting down
E0920 14:35:16.948417 10482 util.go:45] Metric for replication_controller already registered
--- FAIL: TestUpdateLabelToBeAdopted (30.07s)
replicationcontroller_test.go:270: Failed to create replication controller rc: Timeout: request did not complete within allowed duration
E0920 14:44:06.820506 12053 storage_rbac.go:116] unable to initialize clusterroles: client: etcd cluster is unavailable or misconfigured/var/log中甚至没有以kube开头的文件。
提前感谢!
发布于 2016-09-21 23:39:18
我增加了对文件描述符数量的限制,从那以后就再也没有遇到过这个问题。所以,我要说这件事已经解决了
https://stackoverflow.com/questions/39601735
复制相似问题