[rdo-users] [TripleO] Undercloud unreachable

Phill. Whiteside phillwuk at gmail.com
Tue May 12 07:20:59 UTC 2020


Hi Ashish,

just my experiences... 16GB is not *really* enough, although if you
allocate 16GB swap it *should* install. As you're running on bare metal, if
it really complains, rather than chase it - just do a re-install. That way
you, and everyone else, knows you have a 'clean' installation. But, if the
extra RAM is due today, get the RAM in and do a re-install with 32GB swap.
The system will thank you for it by not being in the land of the "weird
errors".

Hope that helps,

Phill.

On Tue, 12 May 2020 at 07:09, Ashish Kurian <ashishbnv at gmail.com> wrote:

> Hello Yatin,
>
> I am deploying on a baremetal. The configuration is Centos 7. 16GB Ram. I
> hope this much of information about my machine configuration is sufficient.
> Let me know if you need more details like the kernel version or anything
> else.
>
> Following your suggestion, I tried with the Train release and the
> undercloud succeeded and the process went much further. However, the
> process failed (remained stuck) at deploying overcloud phase.
>
> Do you think this could be due to insufficient amount of memory? I have
> ordered for another 16GB of RAM and it will arrive today.
>
> Also, what is the correct procedure to rerun the deployment rather than
> starting from the scratch with a new Centos installation?
>
> Best Regards,
> Ashish Kurian
>
>
> On Mon, May 11, 2020 at 9:44 AM YATIN KAREL <yatinkarel at gmail.com> wrote:
>
>> Hi Ashish,
>>
>> On Sun, May 10, 2020 at 12:08 AM Ashish Kurian <ashishbnv at gmail.com>
>> wrote:
>>
>>>
>>> Helllo Folks,
>>>
>>> I am still waiting for some assistance from this group. Really cannot
>>> proceed without that.
>>>
>> Can u share more details wrt to your environment like which release you
>> are trying to deploy, you deploying on a baremetal or a vm, what's the
>> configuration of baremetal/vm, how you trying to deploy, etc so people on
>> list have more context.
>> Recently there was a bug wrt slow overcloud nodes
>> https://bugs.launchpad.net/tripleo/+bug/1873892 which is fixed in Train.
>>
>> In your case it's undercloud ssh failing, and i believe it's also due to
>> slow nodes(as you said you are able to SSH to undercloud manually). I
>> assume you are using tripleo-quickstart to deploy, if yes you can try
>> similar retry/pause in quickstart to see if it helps.
>>
>> Also you can join freenode channel #tripleo, #oooq to have quicker
>> feedback.
>>
>>
>>>
>>> Best Regards,
>>> Ashish Kurian
>>>
>>>
>>> On Mon, May 4, 2020 at 6:44 PM Ashish Kurian <ashishbnv at gmail.com>
>>> wrote:
>>>
>>>> Hello Folks,
>>>>
>>>> For my previous question to the mailing list, Arkady was able to figure
>>>> out the exact error message that was being generated in the logs. I am
>>>> forwarding my email conversation with Arkady so that all the information is
>>>> collected in the email.
>>>>
>>>> Additionally I am attaching the undercloud logs collected using the
>>>> LogTools utility with run mode 8, if required.
>>>>
>>>> Can anyone in this list, help me with identifying what is wrong with
>>>> the template and where can I locate this template to take a look into it?
>>>>
>>>> Best Regards,
>>>> Ashish Kurian
>>>>
>>>>
>>>> ---------- Forwarded message ---------
>>>> From: Arkady Shtempler <ashtempl at redhat.com>
>>>> Date: Mon, May 4, 2020 at 4:23 PM
>>>> Subject: Re: [rdo-users] [TripleO] Undercloud unreachable
>>>> To: Ashish Kurian <ashishbnv at gmail.com>
>>>>
>>>>
>>>> Hi Ashish!
>>>>
>>>> I was able to find these Errors (these are nor related to SSH problem
>>>> that you have, but indicates on some FATAL error in used templates) in
>>>> *builder-undercloud.log*
>>>>
>>>> ~~~~~~~~~~~~~~~~
>>>> /home/ashtempl/zahlabut/home/stack/builder-undercloud.log
>>>> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>>>>
>>>> 2058-  Updating   : linux-firmware-20191203-76.gite8a0f4c.el7.noarch
>>>>       273/629
>>>> 2059-  Installing : kernel-3.10.0-1127.el7.x86_64
>>>>      274/629
>>>>
>>>> *2060:grubby fatal error: unable to find a suitable template2061:grubby
>>>> fatal error: unable to find a suitable template*
>>>> 2062-  Updating   : libreport-filesystem-2.1.11-53.el7.centos.x86_64
>>>>       275/629
>>>> 2063-  Updating   : mdadm-4.1-4.el7.x86_64
>>>>       276/629
>>>> 2064-  Updating   : 1:libguestfs-1.40.2-9.el7.x86_64
>>>>       277/629
>>>> 2065-  Updating   : 1:python-libguestfs-1.40.2-9.el7.x86_64
>>>>      278/629
>>>> 2066-  Updating   : fence-agents-all-4.2.1-30.el7.x86_64
>>>>       279/629
>>>> 2067-  Updating   : 2:docker-1.13.1-161.git64e9980.el7_8.x86_64
>>>>      280/629
>>>> 2068-  Updating   : conntrack-tools-1.4.4-7.el7.x86_64
>>>>       281/629
>>>>
>>>>
>>>>
>>>> 2421-No '/dev/log' or 'logger' included for syslog logging
>>>> 2422-No '/dev/log' or 'logger' included for syslog logging
>>>>
>>>> *2423:grubby fatal error: unable to find a suitable template2424:grubby
>>>> fatal error: unable to find a suitable template*
>>>> 2425-  Verifying  : 10:qemu-kvm-common-ev-2.12.0-44.1.el7_8.1.x86_64
>>>>         1/629
>>>> 2426-  Verifying  : 1:grub2-tools-2.02-0.81.el7.centos.x86_64
>>>>        2/629
>>>> 2427-  Verifying  : certmonger-0.78.4-12.el7.x86_64
>>>>        3/629
>>>> 2428-  Verifying  : boost-program-options-1.53.0-28.el7.x86_64
>>>>         4/629
>>>> 2429-  Verifying  : dracut-config-rescue-033-568.el7.x86_64
>>>>        5/629
>>>> 2430-  Verifying  : libvirt-daemon-driver-qemu-4.5.0-33.el7.x86_64
>>>>         6/629
>>>> 2431-  Verifying  : 1:libguestfs-1.40.2-9.el7.x86_64
>>>>         7/629
>>>>
>>>>
>>>>
>>>> 1780-  Updating   : ipa-common-4.6.6-11.el7.centos.noarch
>>>>        3/629
>>>> 1781-  Updating   : setup-2.8.71-11.el7.noarch
>>>>         4/629newaliases: warning: valid_hostname: invalid character
>>>> 40(decimal)...<--LogTool-LINE IS TOO LONG!
>>>> *1782:newaliases: fatal: unable to use my own hostname*
>>>> 1783-
>>>> 1784-warning: /etc/shadow created as /etc/shadow.rpmnew
>>>> 1785-  Updating   : 32:bind-license-9.11.4-16.P2.el7_8.2.noarch
>>>>        5/629
>>>> 1786-  Updating   :
>>>> subscription-manager-rhsm-certificates-1.24.26-1.el7.c     6/629
>>>> 1787-  Updating   : ipa-client-common-4.6.6-11.el7.centos.noarch
>>>>         7/629
>>>> 1788-  Updating   : 1:grub2-pc-modules-2.02-0.81.el7.centos.noarch
>>>>         8/629
>>>> 1789-  Updating   : libvirt-bash-completion-4.5.0-33.el7.x86_64
>>>>        9/629
>>>>
>>>> BTW - maybe you can try to run LogTool
>>>> <https://github.com/zahlabut/LogTool> you need to run mode number 8, I
>>>> mean:
>>>> 8) - Export ERRORs/WARNINGs from Undercloud logs
>>>> Actually it's very simple to use this tool, you just have to clone it
>>>> to your Undercloud host and to start PyTool.py
>>>>
>>>> Thanks!
>>>>
>>>> On Mon, May 4, 2020 at 5:05 PM Ashish Kurian <ashishbnv at gmail.com>
>>>> wrote:
>>>>
>>>>> Hello Arkady,
>>>>>
>>>>> Please find the two set of log files.
>>>>>
>>>>> Just to make your analysis easier, the quickstart is failing at the
>>>>> playbook :
>>>>>
>>>>> TASK [Gathering Facts]
>>>>> ********************************************************************************************************************************************************************************
>>>>> task path: /home/ashish/.quickstart/playbooks/quickstart.yml:67
>>>>>
>>>>> Best Regards,
>>>>> Ashish Kurian
>>>>>
>>>>>
>>>>> On Mon, May 4, 2020 at 3:50 PM Arkady Shtempler <ashtempl at redhat.com>
>>>>> wrote:
>>>>>
>>>>>> Hi!
>>>>>>
>>>>>> I'm not sure that you'll be able to get them all in one zip file that
>>>>>> won't exceed max email attachment size.
>>>>>> Let's start with log files that you have under /home/stack and
>>>>>> /var/log
>>>>>>
>>>>>> Thanks!
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Mon, May 4, 2020 at 4:36 PM Ashish Kurian <ashishbnv at gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>>> Hello Arkady,
>>>>>>>
>>>>>>> Do you need all of them? How should I provide them to you? Over
>>>>>>> email or something else?
>>>>>>>
>>>>>>> Best Regards,
>>>>>>> Ashish Kurian
>>>>>>>
>>>>>>>
>>>>>>> On Mon, May 4, 2020 at 3:33 PM Arkady Shtempler <ashtempl at redhat.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> Hi Ashish!
>>>>>>>>
>>>>>>>> On your Undercloud host you have a bunch of logs under:
>>>>>>>> ['/var/log', '/home/stack', '/usr/share/', '/var/lib/']'
>>>>>>>>
>>>>>>>> Thanks!
>>>>>>>>
>>>>>>>>
>>>>>>>> On Mon, May 4, 2020 at 4:28 PM Ashish Kurian <ashishbnv at gmail.com>
>>>>>>>> wrote:
>>>>>>>>
>>>>>>>>> Hello Arkady,
>>>>>>>>>
>>>>>>>>> I appreciate your help. Ofcourse I can provide you the required
>>>>>>>>> log files. However, can you let me know what log file are you looking for
>>>>>>>>> and where they are located?
>>>>>>>>>
>>>>>>>>> Best Regards,
>>>>>>>>> Ashish Kurian
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> On Mon, May 4, 2020 at 3:17 PM Arkady Shtempler <
>>>>>>>>> ashtempl at redhat.com> wrote:
>>>>>>>>>
>>>>>>>>>> Hi Ashish!
>>>>>>>>>>
>>>>>>>>>> Is that possible to get the access to your log files?
>>>>>>>>>>
>>>>>>>>>> Thanks!
>>>>>>>>>>
>>>>>>>>>> On Mon, May 4, 2020 at 2:28 PM Ashish Kurian <ashishbnv at gmail.com>
>>>>>>>>>> wrote:
>>>>>>>>>>
>>>>>>>>>>> Hello Folks,
>>>>>>>>>>>
>>>>>>>>>>> For my TripleO installation, I am constantly getting failure to
>>>>>>>>>>> reach the undercloud with the message:
>>>>>>>>>>>
>>>>>>>>>>> MSG:
>>>>>>>>>>>
>>>>>>>>>>> Data could not be sent to remote host "undercloud". Make sure
>>>>>>>>>>> this host can be reached over ssh: Warning: Permanently added 'undercloud'
>>>>>>>>>>> (ECDSA) to the list of known hosts.
>>>>>>>>>>> System is booting up. See pam_nologin(8)
>>>>>>>>>>> Authentication failed.
>>>>>>>>>>>
>>>>>>>>>>> When I actually try to manually ssh into the undercloud using
>>>>>>>>>>> the actual commands, I am able to login into the undercloud.
>>>>>>>>>>>
>>>>>>>>>>> Can someone help me what might be the problem?
>>>>>>>>>>>
>>>>>>>>>>> Best Regards,
>>>>>>>>>>> Ashish Kurian
>>>>>>>>>>> _______________________________________________
>>>>>>>>>>> users mailing list
>>>>>>>>>>> users at lists.rdoproject.org
>>>>>>>>>>> http://lists.rdoproject.org/mailman/listinfo/users
>>>>>>>>>>>
>>>>>>>>>>> To unsubscribe: users-unsubscribe at lists.rdoproject.org
>>>>>>>>>>>
>>>>>>>>>> _______________________________________________
>>> users mailing list
>>> users at lists.rdoproject.org
>>> http://lists.rdoproject.org/mailman/listinfo/users
>>>
>>> To unsubscribe: users-unsubscribe at lists.rdoproject.org
>>>
>>
>>
>> --
>> Yatin Karel
>>
> _______________________________________________
> users mailing list
> users at lists.rdoproject.org
> http://lists.rdoproject.org/mailman/listinfo/users
>
> To unsubscribe: users-unsubscribe at lists.rdoproject.org
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.rdoproject.org/pipermail/users/attachments/20200512/0bc30b9c/attachment-0001.html>


More information about the users mailing list