On 06/23/2015 01:26 PM, Dmitry Tantsur wrote:
Hi all,
So, Ironic task manager is attacking us again:
https://bugzilla.redhat.com/show_bug.cgi?id=1233452
Previously we already had
https://bugzilla.redhat.com/show_bug.cgi?id=1212134
I've implemented retries upstream in ironicclient and backported to our
packages. Later I had to bump the retry timeout in instack-undercloud to
1 minute:
https://github.com/rdo-management/instack-undercloud/blob/master/scripts/...
Now we have the same problem in another place and I wonder how to fix
it. I have 2 obvious idea:
1. Patch ironicclient to have longer default timeout (2 mins?)
2. Update stackrc to carry longer IRONIC_MAX_RETRIES and
IRONIC_RETRY_INTERVAL
I'd prefer the latter, as it does not touch ironicclient package, only
undercloud installation tool. It can also be changed more easily in run
time. WDYT?
I wonder what the root cause is as well. I suspect some very slow BMC's
take too much time to do power actions or power syncs. It's possible
we'll make a wrong guess, and the problem will persist despite becoming
rare.
Update: posted
https://review.openstack.org/#/c/194619/ to help
debugging, otherwise it's hard to tell who and why reserved a node
Thanks,
Dmitry
_______________________________________________
Rdo-list mailing list
Rdo-list(a)redhat.com
https://www.redhat.com/mailman/listinfo/rdo-list
To unsubscribe: rdo-list-unsubscribe(a)redhat.com