
Hey Nicolas, Am Donnerstag, den 10.10.2013, 11:50 +0200 schrieb Nicolas Ecarnot:
After a successful 3.1 setup, I'm starting a completely new 3.3 environment.
I installed a CentOS 6.4 for the manager with a 3.3 oVirt engine. For the nodes, on Dell M620 blades, I installed ovirt-node-iso-3.0.1-1.0.2.vdsm.el6.iso.
I setup a blade like this : - first two interfaces bonded + bridge dedicated to ovirt managment network - 4 next interfaces bonded + bridge dedicated to the iSCSI pure copper lan
(I've setup this exaclty same env. on my previous oVirt 3.1 setup and it is working fine)
Once I've configured that correctly on 3 nodes, I reboot (or stonith) the first node, to prove this is stable. I get various effects : - sometimes the bonding dedicated to the iscsi lan is lost (no more /etc/sysconfig/network-scripts/ifcfg-bond1 neither ifcfg-ovirtiscsi files) - sometimes the bonding dedicated to the ovirtmgmt part is lost or changed : the bonding mode I have set to 1 (active-backup) is replaced by mode 0 (balance-rr) (which leads to other local network issues)
could you provide the log files given in http://www.ovirt.org/Node_Troubleshooting#Log_Files so we get more information about what's happening in the background
- during reboot, I see "bnx2fs" issues : before them, pings to this node are OK (mode 1), after them, ping to this node get DUPs (mode 0)
This is very tiresome to see that such a simple thing as bonding and bridging are things I keep losing time on. Using command line on the node, I'm able to correct everything, but nothing is reboot-proof though I don't know what is causing those changes.
I'm sure I tried nothing exotic when first installing the node. I just used the TUI and made a very simple bonding with no VLAN. Even when trying to concentrate only on the ovirtmgmt bond oVirt can't seem to be stable.
Alon, can you say how vdsm/engine is handling pre-existing bonds and bridges (aka bonds and bridges created by nodes TUI before the registration) after the registration? Greetings fabian