
On Wed, Jul 31, 2013 at 04:45:03AM -0400, Sven Knohsalla wrote:
Hi,
I just set up a vanilla oVirt 3.2.1 environment, containing
oVirt engine 3.2.1 F18 based oVirt Engine Version: 3.2.1-1.fc18
F18 minimal node vdsm-4.10.3-10.fc18 libvirt-0.10.2.6-1.fc18 1.2.2 - 13.fc18 K ernel 3.9.11 - 200.fc18.x86_64
gluster-storage, replicated, EL6 glusterfs 3.4.0
F18 minimal node was installed via oVirt engine, worked fine, expect vdsmd wasn't started (node non-responsive for engine)
Manually starting vdsm works (HV up in engine), autostart of vdsm (during boot) fails:
Jul 31 10:04:47 deovn-a04 systemd-vdsmd[711]: Note: Forwarding request to 'systemctl disable libvirt-guests.service'. Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: vdsm: libvirt already configured for vdsm [ OK ] Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Starting ntpd... Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Redirecting to /bin/systemctl start ntpd.service Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Starting wdmd... Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Redirecting to /bin/systemctl start wdmd.service Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Starting sanlock... Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Redirecting to /bin/systemctl start sanlock.service Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Starting iscsid: Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Redirecting to /bin/systemctl start libvirtd.service Jul 31 10:06:17 deovn-a04 systemd[1]: vdsmd.service operation timed out. Terminating. Jul 31 10:06:17 deovn-a04 systemd[1]: Unit vdsmd.service entered failed state.
Did find similar issues: https://bugzilla.redhat.com/show_bug.cgi?id=832199 https://bugzilla.redhat.com/show_bug.cgi?id=720359 http://lists.ovirt.org/pipermail/users/2012-September/009624.html
Unfortunately, there's no hint.. libvirtd starts without any latency when manually triggered.
The strange thing here: F18 node deploy worked with engine 3.2.0 3-4 weeks ago. Did update engine / F18.
Can anybody give me an advice here?
I have only debugg idea. Could you extend TimeoutStartSec=10min in /usr/lib/systemd/system/vdsmd.service to give you more time for debugging? Then you can find the pid of the vdsmd.service script, and pstree -p <pid> to see what it runs. You can edit the first line of /lib/systemd/systemd-vdsmd to #!/bin/sh -xv to vomit more information to systemd's log. Please inform us what you found out. Dan.