now deployment failed on first ansible job with:
<localhost> ssh_retry: attempt: 8, ssh return code is 255. cmd (['ssh', '-o', 'UserKnownHostsFile=/dev/null', '-o', 'StrictHostKeyChecking=no', '-o', 'ControlMaster=auto', '-o', 'ControlPersist=30m', '-o', 'ServerAliveInterval=5', '-o', 'ServerAliveCountMax=5', '-o', 'IdentityFile="/var/lib/mistral/.ssh/tripleo-admin-rsa"', '-o', 'KbdInteractiveAuthentication=no', '-o', 'PreferredAuthentications=gssapi-with-mic,gssapi-keyex,hostbased,publickey', '-o', 'PasswordAuthentication=no', '-o', 'User="tripleo-admin"', '-o', 'ConnectTimeout=30', '-o', 'ControlPath=/var/lib/mistral/C104/ansible-ssh/ff320dd376', 'localhost', "/bin/sh -c '/usr/bin/python2 && sleep 0'"]...), pausing for 30 seconds
fatal: [undercloud]: UNREACHABLE! => {"changed": false, "msg": "SSH Error: data could not be sent to remote host \"localhost\". Make sure this host can be reached over ssh", "unreachable": true}
interesting, that it fails to connect to localhost and fails, I have checked and
[stack@undercloud104 ~]$ ssh -l stack localhost
The authenticity of host 'localhost (::1)' can't be established.
ECDSA key fingerprint is SHA256:0/Axj7n0cQU9eKCFisOpI2HeaOZeI05RhNa/qT/2/2A.
ECDSA key fingerprint is MD5:fa:e4:df:f5:8b:63:41:ae:c3:a3:2d:7d:55:2d:7f:65.
Are you sure you want to continue connecting (yes/no)? yes
Warning: Permanently added 'localhost' (ECDSA) to the list of known hosts.
Last login: Thu Oct 3 09:04:45 2019 from 10.120.123.5
[stack@undercloud104 ~]$ logout
Connection to localhost closed.
[stack@undercloud104 ~]$
Also from mistral container:
)[mistral@undercloud104 /]$ ssh -l stack -i /var/lib/mistral/.ssh/tripleo-admin-rsa localhost
The authenticity of host 'localhost (::1)' can't be established.
<...>
Last login: Thu Oct 3 09:05:32 2019 from ::1
[stack@undercloud104 ~]$
any ideas?
If I should send this to a different group/page, please let me know.
Thank you
just now noticed, I do not have controller! Even I have 1 in node-info.yaml:
(undercloud) [stack@undercloud104 ~]$ less c104/node-info.yaml
parameter_defaults:
OvercloudControllerFlavor: control
OvercloudComputeFlavor: compute
ControllerCount: 1
ComputeCount: 3
(undercloud) [stack@undercloud104 ~]$ openstack baremetal node list -c UUID -c "Instance UUID" -c "Power State" -c "Provisioning State"
+--------------------------------------+---------------+-------------+--------------------+
| UUID | Instance UUID | Power State | Provisioning State |
+--------------------------------------+---------------+-------------+--------------------+
| 7162d96f-65c9-4f87-b9ef-982e75dc8abc | None | power off | available |
| db52c6da-67dd-4e21-baa0-455937822300 | None | power off | available |
| e5420f96-05c5-4ebb-b9a6-499b8b9b6841 | None | power off | available |
| dcb47213-f2fa-48a9-bf1a-2d2ebdf13784 | None | power off | available |
| 8bc1e8e4-9c51-44bf-81be-b1dd1d19dec4 | None | power off | available |
| c9687294-c336-481c-b0a3-1464d4209ba9 | None | power off | available |
| 2f398a0b-a73c-4bc9-affd-df62e1eaa262 | None | power off | available |
| b011dfb9-aa3c-4f9f-ade8-88ee96f8ae16 | None | power on | active |
| f2fb7e39-73f5-42da-a3d1-03f16fa6457e | None | power off | available |
| c75e7582-1f1f-424b-8614-2110cc0a7539 | None | power on | active |
| f4ed164a-56d3-4536-a665-aa626b9346b9 | None | power off | available |
| 44d9d25b-3e88-42d4-b17f-770b262584bb | None | power off | available |
| 3bccd4ae-4e1c-419f-b2b1-124af13e4fce | None | power off | available |
| 83857f28-8b2f-4d08-9354-0f9488867d62 | None | power off | available |
| 095da91b-bba9-4a49-a7b0-2af00f26f309 | None | power on | active |
| 9d3475c6-4cdf-4406-8df8-beaff1a1db45 | None | power off | available |
+--------------------------------------+---------------+-------------+--------------------+
I believe it is a cause, but not sure, how to cure it, try to redeploy it now.
Hi team.
I am using CentOS7 + Openstack-stein repo and " OpenStack stein Trunk Tested".
yum repolist:
!base/7/x86_64
!centos-ceph-nautilus/7/x86_64
!centos-nfs-ganesha28/7/x86_64
!centos-openstack-stein/7/x86_64
!centos-qemu-ev/7/x86_64
!extras/7/x86_64
!rdo-trunk-stein-tested
!updates/7/x86_64
When doing deployment everything looks promising but failing with the last steps, according to error, looks like ansible version error.
QUESTION:
What I can check more, to debug. I am stuck now.
files used for install:
(undercloud) [stack@undercloud104 ~]$ ls -ld *
drwxrwxr-x. 3 stack stack 4096 Oct 2 09:43 c104
-rw-r--r--. 1 stack stack 4683 Sep 30 12:19 cert.2019-09-06.pem
-rw-rw-r--. 1 stack stack 543 Oct 1 14:21 deploy.sh
drwxrwxr-x. 17 stack stack 4096 Sep 30 12:58 generated-openstack-tripleo-heat-templates
-rw-rw-r--. 1 stack stack 8632 Sep 30 09:30 hosts.json
-rw-rw-r--. 1 stack stack 0 Oct 1 14:16 install-undercloud.log
drwxr-xr-x. 2 stack stack 4096 Sep 27 15:25 repos
lrwxrwxrwx. 1 stack stack 58 Sep 30 14:11 scripts -> generated-openstack-tripleo-heat-templates/network/scripts
-rw-------. 1 stack stack 775 Sep 30 17:42 stackrc
drwxrwxr-x. 2 stack stack 40 Sep 27 11:17 tripleo-config-generated-env-files
-rw-------. 1 stack root 9697 Sep 30 17:18 tripleo-undercloud-passwords.yaml
-rw-------. 1 stack root 2250 Sep 30 17:18 undercloud-passwords.conf
-rw-r--r--. 1 stack stack 14405 Sep 30 16:57 undercloud.conf
(undercloud) [stack@undercloud104 ~]$ ls -ld repos/*
-rw-r--r--. 1 stack stack 1664 Sep 27 15:25 repos/CentOS-Base.repo
-rw-r--r--. 1 stack stack 1309 Sep 27 15:25 repos/CentOS-CR.repo
-rw-r--r--. 1 stack stack 956 Sep 27 15:25 repos/CentOS-Ceph-Nautilus.repo
-rw-r--r--. 1 stack stack 649 Sep 27 15:25 repos/CentOS-Debuginfo.repo
-rw-r--r--. 1 stack stack 630 Sep 27 15:25 repos/CentOS-Media.repo
-rw-r--r--. 1 stack stack 715 Sep 27 15:25 repos/CentOS-NFS-Ganesha-28.repo
-rw-r--r--. 1 stack stack 1290 Sep 27 15:25 repos/CentOS-OpenStack-stein.repo
-rw-r--r--. 1 stack stack 612 Sep 27 15:25 repos/CentOS-QEMU-EV.repo
-rw-r--r--. 1 stack stack 1331 Sep 27 15:25 repos/CentOS-Sources.repo
-rw-r--r--. 1 stack stack 353 Sep 27 15:25 repos/CentOS-Storage-common.repo
-rw-r--r--. 1 stack stack 6639 Sep 27 15:25 repos/CentOS-Vault.repo
-rw-r--r--. 1 stack stack 314 Sep 27 15:25 repos/CentOS-fasttrack.repo
(undercloud) [stack@undercloud104 ~]$
Last lines in output were:
Removing short term keys locally
Enabling ssh admin - COMPLETE.
Waiting for messages on queue 'tripleo' with no timeout.
Config downloaded at /var/lib/mistral/C104
The action raised an exception [action_ex_id=49561c17-928d-4d66-a2ad-466b57c13253, action_cls='<class 'mistral.actions.action_factory.AnsibleGenerateInventoryAction'>', attributes='{}', params='{u'work_dir': u'/var/lib/mistral/C104', u'ansible_python_interpreter': None, u'ansible_ssh_user': u'tripleo-admin', u'undercloud_key_file': u'/var/lib/mistral/.ssh/tripleo-admin-rsa', u'plan_name': u'C104', u'ssh_network': u'ctlplane'}']
list index out of range
Overcloud configuration failed.
(undercloud) [stack@undercloud104 ~]$
--
Ruslanas Gžibovskis
+370 6030 7030
--
Ruslanas Gžibovskis
+370 6030 7030
--
Ruslanas Gžibovskis
+370 6030 7030