[rdo-list] RDO TripleO Mitaka Overcloud Failing

Marius Cornea marius at remote-lab.net
Thu Aug 4 08:25:51 UTC 2016


OK, I don't actually see an error in the logs, the last thing that shows up is:

on controller-0:
[DEBUG] Running /var/lib/heat-config/hooks/puppet <
/var/lib/heat-config/deployed/c989f58d-cd38-4813-a174-7e42c82bcb6f.json

on compute-0:
[DEBUG] Running /var/lib/heat-config/hooks/puppet <
/var/lib/heat-config/deployed/c5265c58-96ae-49d5-9c1e-a38041e2b130.json

I suspect these steps are timing out so let's try running them
manually to figure out what's going on:

Running the commands manually will output a puppet apply command,
showing one from my environment as an example:

# /var/lib/heat-config/hooks/puppet <
/var/lib/heat-config/deployed/d41cefd9-b70e-4e22-9e86-9a5cf5de5bff.json

[2016-08-04 08:12:21,609] (heat-config) [DEBUG] Running
FACTER_heat_outputs_path="/var/run/heat-config/heat-config-puppet/d41cefd9-b70e-4e22-9e86-9a5cf5de5bff"
 FACTER_fqdn="overcloud-controller-0.localdomain"
FACTER_deploy_config_name="ControllerOvercloudServicesDeployment_Step4"
 puppet apply --detailed-exitcodes
/var/lib/heat-config/heat-config-puppet/d41cefd9-b70e-4e22-9e86-9a5cf5de5bff.pp

Next step is to stop it(ctrl+c), copy the puppet apply command, add
--debug and run it:

# FACTER_heat_outputs_path="/var/run/heat-config/heat-config-puppet/d41cefd9-b70e-4e22-9e86-9a5cf5de5bff"
 FACTER_fqdn="overcloud-controller-0.localdomain"
FACTER_deploy_config_name="ControllerOvercloudServicesDeployment_Step4"
 puppet apply --detailed-exitcodes
/var/lib/heat-config/heat-config-puppet/d41cefd9-b70e-4e22-9e86-9a5cf5de5bff.pp
--debug

This should output puppet debug info that might lead us to where it
gets stuck.  Please paste the output so we can investigate further.

Thanks

On Thu, Aug 4, 2016 at 3:22 AM, Gunjan, Milind [CTO]
<Milind.Gunjan at sprint.com> wrote:
> Thanks a lot Christopher for the suggestions.
>
> Marius: Thanks a lot for helping me out.  I am attaching the requested logs.
>
> I tried to redeploy overcloud with 3 controller but the issue remains the same. Overcloud stack deployment is failing at Post-deployment configuration steps as before. When I was going to /var/log/messages for different services, it seems there is issue with haproxy service. Neutron service is failing too and the service endpoints being configured through puppet are not reachable for all failed service. I have attached os-collect-config journals from all four nodes.
>
>
> Please let me know if there is any other logs or any other troubleshooting steps which I can implement.
>
> Best Regards,
> Milind
>
> -----Original Message-----
> From: Marius Cornea [mailto:marius at remote-lab.net]
> Sent: Wednesday, August 03, 2016 4:00 PM
> To: Gunjan, Milind [CTO] <Milind.Gunjan at sprint.com>
> Cc: rdo-list at redhat.com
> Subject: Re: [rdo-list] RDO TripleO Mitaka Non-HA Overcloud Failing
>
> Hi,
>
> Could you please ssh to the nodes, gather the os-collect-config journals (journalctl -l -u os-collect-config) and attach them here?
>
> Thank you,
> Marius
>
> On Wed, Aug 3, 2016 at 8:40 PM, Gunjan, Milind [CTO] <Milind.Gunjan at sprint.com> wrote:
>> Hi All,
>>
>>
>>
>> I am currently working on Tripleo Mitaka Openstack deployment on
>> baremetal
>> servers:
>>
>> Undercloud – 1 baremetal server with 2 NIC (1 for provisioning and 2nd
>> for external network connectivity)
>>
>> Controller – 1 baremetal server ( 6 NICs with each openstack VLANs on
>> separate NIC)
>>
>> Compute – 1 baremetal server
>>
>>
>>
>> I  followed Graeme's instructions here :
>> https://www.redhat.com/archives/rdo-list/2016-June/msg00049.html to
>> set up Undercloud . Undercloud deployment was successful and all the
>> images required for overcloud deployment was properly built as per the instruction.
>> I would like to mention that I used libvirt tools to modify the root
>> password on overcloud-full.qcow2 and we also modified the grub file to
>> include “net.ifnames=0 biosdevname=0” to restore old interface naming.
>>
>>
>>
>> I was able to successfully introspect 2 serves to be used for
>> controller and compute nodes. Also , we added the serial device
>> discovered during introspection as root device:
>>
>> ironic node-update 604f7dfc-38af-4fe0-8986-4c8ac5f956e2 add
>> properties/root_device='{"serial": "618e728372833010c79bead9066f0f9e"}'
>>
>> ironic node-update afcfbee3-3108-48da-a6da-aba8f422642c add
>> properties/root_device='{"serial": "618e7283728347101f2107b511603adc"}'
>>
>>
>>
>> Next, we added compute and control tag to respective introspected node
>> with local boot option:
>>
>>
>>
>> ironic node-update 604f7dfc-38af-4fe0-8986-4c8ac5f956e2 add
>> properties/capabilities='profile:control,boot_option:local'
>>
>> ironic node-update afcfbee3-3108-48da-a6da-aba8f422642c add
>> properties/capabilities='profile:compute,boot_option:local'
>>
>>
>>
>> We used multiple NIC templates for control and compute node which has
>> been attached along with network-environment.yaml file. Default
>> network isolation template file has been used.
>>
>>
>>
>>
>>
>> Deployment script looks like this :
>>
>> #!/bin/bash
>>
>> DIR="$( cd "$( dirname "${BASH_SOURCE[0]}" )" && pwd )"
>>
>> template_base_dir="$DIR"
>>
>> ntpserver=<sprint.ntp.server.ip> #Sprint LAB
>>
>> openstack overcloud deploy --templates \
>>
>> -e
>> /usr/share/openstack-tripleo-heat-templates/environments/network-isola
>> tion.yaml
>> \
>>
>> -e ${template_base_dir}/environments/network-environment.yaml \
>>
>> --control-flavor control --compute-flavor compute  \
>>
>> --control-scale 1 --compute-scale 1 \
>>
>> --ntp-server $ntpserver \
>>
>> --neutron-network-type vxlan --neutron-tunnel-types vxlan --debug
>>
>>
>>
>> Heat stack deployment goes on more really long time (more than 4
>> hours) and gets stuck at postdeployment configurations. Please find
>> below the capture during install :
>>
>>
>>
>>
>>
>> Every 2.0s: ironic node-list && nova list && heat stack-list && heat
>> resource-list -n5 overcloud | grep -vi complete Wed Aug  3 17:33:37
>> 2016
>>
>>
>>
>> +--------------------------------------+------+--------------------------------------+-------------+--------------------+-------------+
>>
>> | UUID                                 | Name | Instance UUID
>> | Power State | Provisioning State | Maintenance |
>>
>> +--------------------------------------+------+--------------------------------------+-------------+--------------------+-------------+
>>
>> | 604f7dfc-38af-4fe0-8986-4c8ac5f956e2 | None |
>> 9e7aae15-cabc-4489-a1b2-778915a78df2 | power on    | active             |
>> False       |
>>
>> | afcfbee3-3108-48da-a6da-aba8f422642c | None |
>> c1ab52a9-461a-4a11-a13e-e57ff0a3ae2a | power on    | active             |
>> False       |
>>
>> +--------------------------------------+------+--------------------------------------+-------------+--------------------+-------------+
>>
>> +--------------------------------------+-------------------------+--------+------------+-------------+------------------------+
>>
>> | ID                                   | Name                    | Status |
>> Task State | Power State | Networks               |
>>
>> +--------------------------------------+-------------------------+--------+------------+-------------+------------------------+
>>
>> | 9e7aae15-cabc-4489-a1b2-778915a78df2 | overcloud-controller-0  |
>> | ACTIVE |
>> -          | Running     | ctlplane=192.168.149.9 |
>>
>> | c1ab52a9-461a-4a11-a13e-e57ff0a3ae2a | overcloud-novacompute-0 |
>> | ACTIVE |
>> -          | Running     | ctlplane=192.168.149.8 |
>>
>> +--------------------------------------+-------------------------+--------+------------+-------------+------------------------+
>>
>> +--------------------------------------+------------+---------------+---------------------+--------------+
>>
>> | id                                   | stack_name | stack_status  |
>> creation_time       | updated_time |
>>
>> +--------------------------------------+------------+---------------+---------------------+--------------+
>>
>> | 26ee0150-4cfa-4268-9107-8bfbf6712913 | overcloud  | CREATE_FAILED |
>> 2016-08-03T08:11:34 | None         |
>>
>> +--------------------------------------+------------+---------------+---------------------+--------------+
>>
>> +---------------------------------------------+-----------------------------------------------+------------------------------------------------------------------------
>>
>> ---------+--------------------+---------------------+---------------------------------------------------------------------------------------------------------------+
>>
>> | resource_name                               | physical_resource_id
>> | resource_type
>>
>>          | resource_status    | updated_time        | stack_name
>> |
>>
>> +---------------------------------------------+-----------------------------------------------+------------------------------------------------------------------------
>>
>> ---------+--------------------+---------------------+---------------------------------------------------------------------------------------------------------------+
>>
>> | ComputeNodesPostDeployment                  |
>> 3797aec6-e543-4dda-9cd1-c7261e827a64          |
>> OS::TripleO::ComputePostDeployment
>>
>>          | CREATE_FAILED      | 2016-08-03T08:11:35 | overcloud
>> |
>>
>> | ControllerNodesPostDeployment               |
>> 6ad9f88c-5c55-4125-97f1-eb0e33329d16          |
>> OS::TripleO::ControllerPostDeployment
>>
>>          | CREATE_FAILED      | 2016-08-03T08:11:35 | overcloud
>> |
>>
>> | ComputePuppetDeployment                     |
>> 8b199f85-e4f9-48ad-9aee-b1cdf4900b9f          |
>> OS::Heat::StructuredDeployments
>>
>>          | CREATE_FAILED      | 2016-08-03T08:29:19 |
>> overcloud-ComputeNodesPostDeployment-6vxfu2g2qucy
>> |
>>
>> | ControllerOvercloudServicesDeployment_Step4 |
>> 15509f59-ff28-43af-95dd-6247a6a32c2d          |
>> OS::Heat::StructuredDeployments
>>
>>          | CREATE_FAILED      | 2016-08-03T08:29:20 |
>> overcloud-ControllerNodesPostDeployment-35y7uafngfwj
>> |
>>
>> | 0                                           |
>> 7cd0aa3d-742f-4e78-99ca-b2a575913f8e          |
>> OS::Heat::StructuredDeployment
>>
>>          | CREATE_IN_PROGRESS | 2016-08-03T08:30:04 |
>> overcloud-ComputeNodesPostDeployment-6vxfu2g2qucy-ComputePuppetDeploym
>> ent-cpahcct3tfw3
>> |
>>
>> | 0                                           |
>> 5e9308f7-c3a9-4a94-a017-e1acb694c036          |
>> OS::Heat::StructuredDeployment
>>
>>
>>
>>
>>
>> [stack at mitaka-uc ~]$ openstack software deployment show
>> 5e9308f7-c3a9-4a94-a017-e1acb694c036
>>
>> +---------------+--------------------------------------+
>>
>> | Field         | Value                                |
>>
>> +---------------+--------------------------------------+
>>
>> | id            | 5e9308f7-c3a9-4a94-a017-e1acb694c036 |
>>
>> | server_id     | 9e7aae15-cabc-4489-a1b2-778915a78df2 |
>>
>> | config_id     | 86d49e66-2f25-4cb1-b623-5ae87b01bb64 |
>>
>> | creation_time | 2016-08-03T08:32:10                  |
>>
>> | updated_time  |                                      |
>>
>> | status        | IN_PROGRESS                          |
>>
>> | status_reason | Deploy data available                |
>>
>> | input_values  | {}                                   |
>>
>> | action        | CREATE                               |
>>
>> +---------------+--------------------------------------+
>>
>>
>>
>> [stack at mitaka-uc ~]$ openstack software deployment show --long
>> 5e9308f7-c3a9-4a94-a017-e1acb694c036
>>
>> +---------------+--------------------------------------+
>>
>> | Field         | Value                                |
>>
>> +---------------+--------------------------------------+
>>
>> | id            | 5e9308f7-c3a9-4a94-a017-e1acb694c036 |
>>
>> | server_id     | 9e7aae15-cabc-4489-a1b2-778915a78df2 |
>>
>> | config_id     | 86d49e66-2f25-4cb1-b623-5ae87b01bb64 |
>>
>> | creation_time | 2016-08-03T08:32:10                  |
>>
>> | updated_time  |                                      |
>>
>> | status        | IN_PROGRESS                          |
>>
>> | status_reason | Deploy data available                |
>>
>> | input_values  | {}                                   |
>>
>> | action        | CREATE                               |
>>
>> | output_values | None                                 |
>>
>> +---------------+--------------------------------------+
>>
>>
>>
>> [stack at mitaka-uc ~]$ openstack stack resource list
>> 3797aec6-e543-4dda-9cd1-c7261e827a64
>>
>> +-------------------------+--------------------------------------+-------------------------------------------------+-----------------+---------------------+
>>
>> | resource_name           | physical_resource_id                 |
>> resource_type                                   | resource_status |
>> updated_time        |
>>
>> +-------------------------+--------------------------------------+-------------------------------------------------+-----------------+---------------------+
>>
>> | ComputeArtifactsConfig  | a33cd04d-61ab-4429-8565-182409c2b97f |
>> file:///usr/share/openstack-tripleo-heat-       | CREATE_COMPLETE |
>> 2016-08-03T08:29:19 |
>>
>> |                         |                                      |
>> templates/puppet/deploy-artifacts.yaml          |                 |
>> |
>>
>> | ComputePuppetConfig     | 5bb712b0-5358-46c7-a444-f9adedfedd50 |
>> OS::Heat::SoftwareConfig                        | CREATE_COMPLETE |
>> 2016-08-03T08:29:19 |
>>
>> | ComputePuppetDeployment | 8b199f85-e4f9-48ad-9aee-b1cdf4900b9f |
>> OS::Heat::StructuredDeployments                 | CREATE_FAILED   |
>> 2016-08-03T08:29:19 |
>>
>> | ComputeArtifactsDeploy  | 1d13bf34-fc66-4bf1-a3b7-1dd815f58f5a |
>> OS::Heat::StructuredDeployments                 | CREATE_COMPLETE |
>> 2016-08-03T08:29:19 |
>>
>> | ExtraConfig             |                                      |
>> OS::TripleO::NodeExtraConfigPost                | INIT_COMPLETE   |
>> 2016-08-03T08:29:19 |
>>
>> +-------------------------+--------------------------------------+-------------------------------------------------+-----------------+---------------------+
>>
>>
>>
>> [stack at mitaka-uc ~]$ openstack stack resource list
>> 8b199f85-e4f9-48ad-9aee-b1cdf4900b9f
>>
>> +---------------+--------------------------------------+--------------------------------+--------------------+---------------------+
>>
>> | resource_name | physical_resource_id                 | resource_type
>> | resource_status    | updated_time        |
>>
>> +---------------+--------------------------------------+--------------------------------+--------------------+---------------------+
>>
>> | 0             | 7cd0aa3d-742f-4e78-99ca-b2a575913f8e |
>> OS::Heat::StructuredDeployment | CREATE_IN_PROGRESS |
>> 2016-08-03T08:30:04 |
>>
>> +---------------+--------------------------------------+--------------------------------+--------------------+---------------------+
>>
>> [stack at mitaka-uc ~]$ openstack software deployment show
>> 7cd0aa3d-742f-4e78-99ca-b2a575913f8e
>>
>> +---------------+--------------------------------------+
>>
>> | Field         | Value                                |
>>
>> +---------------+--------------------------------------+
>>
>> | id            | 7cd0aa3d-742f-4e78-99ca-b2a575913f8e |
>>
>> | server_id     | c1ab52a9-461a-4a11-a13e-e57ff0a3ae2a |
>>
>> | config_id     | 24e5c0db-f84f-4a94-8f8e-8e38e73ccc86 |
>>
>> | creation_time | 2016-08-03T08:30:05                  |
>>
>> | updated_time  |                                      |
>>
>> | status        | IN_PROGRESS                          |
>>
>> | status_reason | Deploy data available                |
>>
>> | input_values  | {}                                   |
>>
>> | action        | CREATE                               |
>>
>> +---------------+--------------------------------------+
>>
>>
>>
>> Keystonerc file was not generated. Please find below openstack status
>> command result on controller and compute.
>>
>>
>>
>> [heat-admin at overcloud-controller-0 ~]$ openstack-status
>>
>> == Nova services ==
>>
>> openstack-nova-api:                     active
>>
>> openstack-nova-compute:                 inactive  (disabled on boot)
>>
>> openstack-nova-network:                 inactive  (disabled on boot)
>>
>> openstack-nova-scheduler:               activating(disabled on boot)
>>
>> openstack-nova-cert:                    active
>>
>> openstack-nova-conductor:               active
>>
>> openstack-nova-console:                 inactive  (disabled on boot)
>>
>> openstack-nova-consoleauth:             active
>>
>> openstack-nova-xvpvncproxy:             inactive  (disabled on boot)
>>
>> == Glance services ==
>>
>> openstack-glance-api:                   active
>>
>> openstack-glance-registry:              active
>>
>> == Keystone service ==
>>
>> openstack-keystone:                     inactive  (disabled on boot)
>>
>> == Horizon service ==
>>
>> openstack-dashboard:                    uncontactable
>>
>> == neutron services ==
>>
>> neutron-server:                         failed    (disabled on boot)
>>
>> neutron-dhcp-agent:                     inactive  (disabled on boot)
>>
>> neutron-l3-agent:                       inactive  (disabled on boot)
>>
>> neutron-metadata-agent:                 inactive  (disabled on boot)
>>
>> neutron-lbaas-agent:                    inactive  (disabled on boot)
>>
>> neutron-openvswitch-agent:              inactive  (disabled on boot)
>>
>> neutron-metering-agent:                 inactive  (disabled on boot)
>>
>> == Swift services ==
>>
>> openstack-swift-proxy:                  active
>>
>> openstack-swift-account:                active
>>
>> openstack-swift-container:              active
>>
>> openstack-swift-object:                 active
>>
>> == Cinder services ==
>>
>> openstack-cinder-api:                   active
>>
>> openstack-cinder-scheduler:             active
>>
>> openstack-cinder-volume:                active
>>
>> openstack-cinder-backup:                inactive  (disabled on boot)
>>
>> == Ceilometer services ==
>>
>> openstack-ceilometer-api:               active
>>
>> openstack-ceilometer-central:           active
>>
>> openstack-ceilometer-compute:           inactive  (disabled on boot)
>>
>> openstack-ceilometer-collector:         active
>>
>> openstack-ceilometer-notification:      active
>>
>> == Heat services ==
>>
>> openstack-heat-api:                     inactive  (disabled on boot)
>>
>> openstack-heat-api-cfn:                 active
>>
>> openstack-heat-api-cloudwatch:          inactive  (disabled on boot)
>>
>> openstack-heat-engine:                  inactive  (disabled on boot)
>>
>> == Sahara services ==
>>
>> openstack-sahara-api:                   active
>>
>> openstack-sahara-engine:                active
>>
>> == Support services ==
>>
>> libvirtd:                               active
>>
>> openvswitch:                            active
>>
>> dbus:                                   active
>>
>> target:                                 active
>>
>> rabbitmq-server:                        active
>>
>> memcached:                              active
>>
>>
>>
>>
>>
>> [heat-admin at overcloud-novacompute-0 ~]$ openstack-status
>>
>> == Nova services ==
>>
>> openstack-nova-api:                     inactive  (disabled on boot)
>>
>> openstack-nova-compute:                 activating(disabled on boot)
>>
>> openstack-nova-network:                 inactive  (disabled on boot)
>>
>> openstack-nova-scheduler:               inactive  (disabled on boot)
>>
>> openstack-nova-cert:                    inactive  (disabled on boot)
>>
>> openstack-nova-conductor:               inactive  (disabled on boot)
>>
>> openstack-nova-console:                 inactive  (disabled on boot)
>>
>> openstack-nova-consoleauth:             inactive  (disabled on boot)
>>
>> openstack-nova-xvpvncproxy:             inactive  (disabled on boot)
>>
>> == Glance services ==
>>
>> openstack-glance-api:                   inactive  (disabled on boot)
>>
>> openstack-glance-registry:              inactive  (disabled on boot)
>>
>> == Keystone service ==
>>
>> openstack-keystone:                     inactive  (disabled on boot)
>>
>> == Horizon service ==
>>
>> openstack-dashboard:                    uncontactable
>>
>> == neutron services ==
>>
>> neutron-server:                         inactive  (disabled on boot)
>>
>> neutron-dhcp-agent:                     inactive  (disabled on boot)
>>
>> neutron-l3-agent:                       inactive  (disabled on boot)
>>
>> neutron-metadata-agent:                 inactive  (disabled on boot)
>>
>> neutron-lbaas-agent:                    inactive  (disabled on boot)
>>
>> neutron-openvswitch-agent:              active
>>
>> neutron-metering-agent:                 inactive  (disabled on boot)
>>
>> == Swift services ==
>>
>> openstack-swift-proxy:                  inactive  (disabled on boot)
>>
>> openstack-swift-account:                inactive  (disabled on boot)
>>
>> openstack-swift-container:              inactive  (disabled on boot)
>>
>> openstack-swift-object:                 inactive  (disabled on boot)
>>
>> == Cinder services ==
>>
>> openstack-cinder-api:                   inactive  (disabled on boot)
>>
>> openstack-cinder-scheduler:             inactive  (disabled on boot)
>>
>> openstack-cinder-volume:                inactive  (disabled on boot)
>>
>> openstack-cinder-backup:                inactive  (disabled on boot)
>>
>> == Ceilometer services ==
>>
>> openstack-ceilometer-api:               inactive  (disabled on boot)
>>
>> openstack-ceilometer-central:           inactive  (disabled on boot)
>>
>> openstack-ceilometer-compute:           inactive  (disabled on boot)
>>
>> openstack-ceilometer-collector:         inactive  (disabled on boot)
>>
>> openstack-ceilometer-notification:      inactive  (disabled on boot)
>>
>> == Heat services ==
>>
>> openstack-heat-api:                     inactive  (disabled on boot)
>>
>> openstack-heat-api-cfn:                 inactive  (disabled on boot)
>>
>> openstack-heat-api-cloudwatch:          inactive  (disabled on boot)
>>
>> openstack-heat-engine:                  inactive  (disabled on boot)
>>
>> == Sahara services ==
>>
>> openstack-sahara-all:                   inactive  (disabled on boot)
>>
>> == Support services ==
>>
>> libvirtd:                               active
>>
>> openvswitch:                            active
>>
>> dbus:                                   active
>>
>> rabbitmq-server:                        inactive  (disabled on boot)
>>
>> memcached:                              inactive  (disabled on boot)
>>
>>
>>
>>
>>
>>
>>
>> Please let me know if there is any other logs which I can provide that
>> can help in troubleshooting.
>>
>>
>>
>>
>>
>> Thanks a lot in Advance for your help and support.
>>
>>
>>
>> Best Regards,
>>
>> Milind Gunjan
>>
>>
>>
>>
>> ________________________________
>>
>> This e-mail may contain Sprint proprietary information intended for
>> the sole use of the recipient(s). Any use by others is prohibited. If
>> you are not the intended recipient, please contact the sender and
>> delete all copies of the message.
>>
>> _______________________________________________
>> rdo-list mailing list
>> rdo-list at redhat.com
>> https://www.redhat.com/mailman/listinfo/rdo-list
>>
>> To unsubscribe: rdo-list-unsubscribe at redhat.com
>
> ________________________________
>
> This e-mail may contain Sprint proprietary information intended for the sole use of the recipient(s). Any use by others is prohibited. If you are not the intended recipient, please contact the sender and delete all copies of the message.




More information about the dev mailing list