Le 20/03/2015 14:40, Dan Kenigsberg a écrit :
On Fri, Mar 20, 2015 at 10:14:54AM +0100, Nicolas Ecarnot wrote:
> Hello,
>
> There are a number of bugs [1] reported these days about the issue aourd
> network config of the hosts, when dealing with interfaces manually
> configured, with bonding and VLANs.
> These /etc/sysconfig/network-scripts/ifcfg.* files are wiped by vdsm after
> rebooting.
>
> I see that there are people at Redhat working on these, and some cases were
> reproduced in lab conditions - and some were not.
>
> I upgraded 3 DC from 3.4.? to 3.5.1, and faced this issue (lost of every
> network files) in an non-consistent manner.
>
> I finally thought I coped with this problem by adding
> net_persistence = ifcfg
> to /etc/vdsm/vdsm.conf
> and indeed, when restarting vdsmd and the network, files were conserved.
>
> It was before I observed that some action [2] lead to /etc/vdsm/vdsm.conf
> being renamed into /etc/vdsm/vdsm.conf.some_timestamp and the original one
> replaced by a very short file with no netcfg persistence at all.
>
> I didn't identified [2]. That could be :
> - some actions made by me through the Web UI ?
> - service vdsmd restart ?
> - reboots ?
>
> I'm sure that some Redhat people know what could be responsible for renaming
> /etc/vdsm/vdsm.conf into /etc/vdsm/vdsm.conf.some_timestamp, and I wish they
> are working closely with Dan Kenigsberg and Michael Burman who helped a lot
> on these issues (or maybe, THEY are the coders responsible for this ?)
>
> [2] :
> -
https://bugzilla.redhat.com/show_bug.cgi?id=1154399
> -
https://bugzilla.redhat.com/show_bug.cgi?id=1188251
> - and more or less related :
>
https://bugzilla.redhat.com/show_bug.cgi?id=1134346
Thanks for reporting this issue. We are well aware of it, and working
hard to fix it. Unfortunately, there were several bugs on the process of
upgrading ifcfg-based network configuration to vdsm's own "unified
persistence" that sits under /var/lib/vdsm/persistence/netconf.
Would you share which platform are you using? el6? el7? ovirt-node, or
plain install?
We are using centos 6.6 on all our hosts, minimal install.
Idem on the manager, bare metal stand alone, not hosted.
There is a recent report that ovirt-node may be restarting
networking
while vdsm starts up, which may well explain the problem and its
inpredictability. Is this the case with you?
We are not using ovirt-nodes since 3 years, for some reasons.
Regarding /etc/vdsm/vdsm.conf: vdsm never rename it. Could it be
rpm's
new behavior (replacing vdsm.conf.rpmsave) ? Or could it be the node,
Fabian?
Let us stay prudent : I indeed did some yum upgrade, BUT :
- I made every step in a very modular way : first upgrade the manager
- then put one host in maintenance
- add the 3.5.1 repo on the host
- then web-gui-reinstall it (upgrading the useful packages)
- then put it up, migrate some VM on it, well, test it.
- then put it back into maintenance
- then yum upgrade it
- then reboot it
- then blah blah blah
well you see, I won't explain every step, but I did that in a very
cautious way, taking time for each of them, and repeating this whole
process more than 20 times.
I don't get why it is working like a charm on most of them, and facing
the issues mentioned above on a portion of them.
To answer to the renaming comment : yes Dan, some package upgrade
renamed vdsm.conf into rpmsave, BUT I was explicitly talking about an
additional renaming into vdsm.conf.201503191220 something, and I never
saw a package upgrade do that.
Just a final word : though I sound grumpy and find this issue a real
pain, I am actually absolutely amazed by all the work done by all the
oVirt community and the Redhat people :)
--
Nicolas Ecarnot