我试图安装Openshift原产地与Ansible。执行deploy_cluster.yml时遇到了一个问题,错误是:
TASK [openshift_control_plane : Report control plane errors] ***********************************************************************************************************
fatal: [masterserver.srv.com]: FAILED! => {"changed": false, "msg": "Control plane pods didn't come up"}
NO MORE HOSTS LEFT *****************************************************************************************************************************************************
PLAY RECAP *************************************************************************************************************************************************************
localhost : ok=11 changed=0 unreachable=0 failed=0 skipped=5 rescued=0 ignored=0
masterserver.srv.com : ok=295 changed=44 unreachable=0 failed=1 skipped=233 rescued=0 ignored=4
nodeserver.srv.com : ok=103 changed=16 unreachable=0 failed=0 skipped=88 rescued=0 ignored=0
INSTALLER STATUS *******************************************************************************************************************************************************
Initialization : Complete (0:02:49)
Health Check : Complete (0:00:36)
Node Bootstrap Preparation : Complete (0:09:55)
etcd Install : Complete (0:02:05)
Master Install : In Progress (0:42:42)
This phase can be restarted by running: playbooks/openshift-master/config.yml
Failure summary:
1. Hosts: masterserver.srv.com
Play: Configure masters
Task: Report control plane errors
Message: Control plane pods didn't come up关于我的环境的描述:
root@ ~# enter code here ansible --版本ansible 2.9.2enter code here config file =/etc/ansible.cfg配置模块搜索路径=u‘/root/. ansible /plugins/enter code here’,u‘/usr/enter code here/ansible/plugins/enter code here模块位置= /usr/lib/python2.7/site-packages/ansible可执行位置= /usr/bin/ansible python版本= 2.7.5 (默认值,4月11日,2008,07:36:10) GCC 4.8.5 20150623 (红帽4.8.5-28)
root@ ~#服务器NAME= cat /etc/os-发布NAME="CentOS Linux“VERSION=7 (Core)”ID="centos“ID_LIKE=”ID_LIKE=“VERSION_ID="7”PRETTY_NAME="CentOS Linux 7 (Core)“ANSI_COLOR="0;31“CPE_NAME="cpe:/o:centos:centos:7”HOME_URL="https://www.centos.org/“BUG_REPORT_URL="https://bugs.centos.org/”https://bugs.centos.org/“CENTOS_MANTISBT_PROJECT=-7”CENTOS_MANTISBT_PROJECT_VERSION="7“REDHAT_SUPPORT_PRODUCT="centos”https://bugs.centos.org/“版本
root@Server Server ~# docker版本客户端:版本: 1.13.1 API版本: 1.26包版本: go1.10.3 Git提交: 7f2769b/1.13.1构建: Sun 9ep 14:06:47 2019 OS/Arch: linux/amd64 Server: version: 1.13.1 API版本:1.26 (最低版本1.12)包版本: docker-1.13.1-103.git7f2769b.el7.centos.x86_64 Go版本: go1.10.3 Git提交: 7f2769b/1.13.1构建: Sun 9月15日14:06:47 2019 OS/Arch: linux/amd64 64实验性: false
我所做的步骤:
补充:
[root@masterserver ~]# cat /etc/ansible/hosts
[OSEv3:children]
masters
nodes
etcd
[OSEv3:vars]
ansible_ssh_user=origin
ansible_become=true
openshift_deployment_type=origin
openshift_master_identity_providers=[{'name': 'htpasswd_auth', 'login': 'true', 'challenge': 'true', 'kind': 'HTPasswdPasswordIdentityProvider'}]
openshift_master_default_subdomain=apps-masterserver.srv.com
openshift_disable_check=disk_availability,docker_storage,memory_availability,docker_image_availability
openshift_master_api_port=8443
openshift_master_console_port=8443
osm_etcd_image=registry.access.redhat.com/rhel7/etcd:3.2.22
[masters]
masterserver.srv.com
[etcd]
masterserver.srv.com
[nodes]
masterserver.srv.com openshift_node_group_name='node-config-master-infra'
nodeserver.srv.com openshift_node_group_name='node-config-compute'root@masterserver ~#主机名
masterserver.srv.com
根@主服务器~# oc获取节点
到服务器masterserver.srv.com:8443的连接被拒绝-您指定了正确的主机或端口吗?
root@masterserver ~# netstat -ntlp
活动Internet连接(仅服务器)
Proto Recv-Q发送-Q本地地址外部地址状态PID/程序名
tcp 0 0 0.0.0.0:8444 0.0.0.0:* LISTEN 1700/openshift
TCP0127.0.0.1:44642 0.0.0.0:*聆听1407/hyperkube
tcp 0 192.168.43.50:2379 0.0.0.0:*聆听1647/etcd
tcp 0 192.168.43.50:2380 0.0.0.0:*聆听1647/etcd
tcp 00.0.0.0:111 0.0.0.0:*侦听1/systemd
tcp 0 172.17.0.1:53 0.0.0.0:*聆听1024/dnsmasq
tcp 0 192.168.43.50:53 0.0.0.0:*聆听1024/dnsmasq
tcp 00.0.0.0:22 0.0.0.0:*侦听1029/sshd
tcp 0 127.0.0.1:25 0.0.0.0:* LISTEN 1166/master
tcp6 0 0:10250::*侦听1407/hyperkube
tcp6 0 0::111::*侦听1/systemd
tcp6 0 0 fe80::a00:27ff:53 8:53::*听1024/dnsmasq
tcp6 0 0:22::*侦听1029/sshd
tcp6 0 0 ::1:25:*聆听1166/主
root@masterserver ~# cat /etc/host
127.0.0.1本地主机localhost.localdomain localhost4 localhost4.localdomain 4
*1本地主机localhost.localdomain localhost6 localhost6.localdomain6
192.168.43.51 nodeserver.srv.com
192.168.43.50 masterserver.srv.com
Control plane pods didn't come up so the installation got stuck. This is the one of error process when ansible was run :
```javascript警告:在atexit._run_exitfuncs: Traceback (最近一次调用的最后一次调用):File“/usr/lib64 64/python2.7/atexit.py”之后,
第24行,在文件"/tmp/ansible_oc_obj_payload_h6RqDy/ansible_oc_obj_payload.zip/ansible/modules/oc_obj.py",第1257行(*targs,**kargs)中,在
清除AttributeError:'NoneType‘对象在sys.exitfunc中没有属性'path’错误(最近一次调用):File“/usr/lib64 64/python2.7/atexit.py”,
第24行,在文件"/tmp/ansible_oc_obj_payload_h6RqDy/ansible_oc_obj_payload.zip/ansible/modules/oc_obj.py",第1257行(*targs,**kargs)中,在
清除AttributeError:“NoneType”对象没有属性“路径”
有人能帮我解决这个问题吗?谢谢。
发布于 2020-02-18 07:44:40
解决了!把我的环境移到更高的规格。我看到一些日志显示,我在1vcpu和RAM 2GB (主+ Infra1,计算1) => Recording NodeHasSufficientResources in /var/log/messages之前使用的资源。
目前,我使用2 2vcpu和RAM 8GB (主+ Infra 1,计算1),工作良好!
https://stackoverflow.com/questions/60180660
复制相似问题