[Rdo-list] intermittent dhcp problem
Zhao, Xin
xzhao at bnl.gov
Tue Oct 14 20:48:17 UTC 2014
Hello,
I have a multi-node icehouse deployment (one controller, one network and
one compute node), using neutron ML2 and openvswitch/VLAN networking,
based on RDO on RHEL6. Everything seems to work fine.
But once in a while, new instances can't get IP anymore, the syslog
shows errors like "DHCPDISCOVER .... no address available", neutron DB
shows the IP is allocated, but the DHCP conf file (eg.
/var/lib/neutron/dhcp/XXXXXX/host) doesn't have it. No particular error
messages from the dhcp agent log file. Restarting dhcp agent doesn't fix
it. I have to restart all neutron services, including openvswitch
daemon, to get the system back.
Another issue I notice is, on the network node, dmesg shows the
following errors repeatedly, even when the networking services appear to
be working fine :
"......
tap84e6f9d1-c2: hw csum failure.
Pid: 0, comm: swapper Not tainted 2.6.32-431.5.1.el6.x86_64 #1
Call Trace:
<IRQ> [<ffffffff8145bbf2>] ? netdev_rx_csum_fault+0x42/0x50
[<ffffffff81454200>] ? __skb_checksum_complete_head+0x60/0x70
......
"
Below after my signature are some system information of the network node.
Any ideas on what's going on and how to debug it?
Thanks a lot,
Xin
# uname -a
Linux cldnet02.cloud.local 2.6.32-431.5.1.el6.x86_64 #1 SMP Tue Feb 11
13:30:01 CST 2014 x86_64 x86_64 x86_64 GNU/Linux
# rpm -qa | grep neutron
python-neutronclient-2.3.4-4.el6.noarch
openstack-neutron-2014.1.2-1.el6.noarch
openstack-neutron-ml2-2014.1.2-1.el6.noarch
python-neutron-2014.1.2-1.el6.noarch
openstack-neutron-openvswitch-2014.1.2-1.el6.noarch
# rpm -qa | grep openvswitch
openvswitch-1.11.0_8ce28d-1.el6ost.x86_64
openstack-neutron-openvswitch-2014.1.2-1.el6.noarch
the NIC for the VM network:
# ethtool -i p1p2
driver: myri10ge
version: 1.5.1-1.451
firmware-version: 1.4.52 -- 2010/10/28 21:27:06 m
bus-info: 0000:06:00.0
supports-statistics: yes
supports-test: no
supports-eeprom-access: no
supports-register-dump: no
supports-priv-flags: no
the NIC for the management network:
# ethtool -i em1
driver: bnx2
version: 2.2.3
firmware-version: 4.6.8 bc 4.6.4 NCSI 1.0.6
bus-info: 0000:01:00.0
supports-statistics: yes
supports-test: yes
supports-eeprom-access: yes
supports-register-dump: yes
supports-priv-flags: no
the NIC for the external network:
# ethtool -i p1p1
driver: myri10ge
version: 1.5.1-1.451
firmware-version: 1.4.52 -- 2010/10/28 21:27:06 m
bus-info: 0000:05:00.0
supports-statistics: yes
supports-test: no
supports-eeprom-access: no
supports-register-dump: no
supports-priv-flags: no
More information about the dev
mailing list