[Users] oVirt 3.2.1 /F18 node vdsm issue
Dan Kenigsberg
danken at redhat.com
Wed Jul 31 11:34:50 UTC 2013
On Wed, Jul 31, 2013 at 04:45:03AM -0400, Sven Knohsalla wrote:
> Hi,
>
> I just set up a vanilla oVirt 3.2.1 environment, containing
>
> oVirt engine
> 3.2.1 F18 based oVirt Engine Version: 3.2.1-1.fc18
>
> F18 minimal node
> vdsm-4.10.3-10.fc18 libvirt-0.10.2.6-1.fc18 1.2.2 - 13.fc18 K ernel 3.9.11 - 200.fc18.x86_64
>
> gluster-storage, replicated, EL6
> glusterfs 3.4.0
>
> F18 minimal node was installed via oVirt engine, worked fine, expect vdsmd wasn't started
> (node non-responsive for engine)
>
> Manually starting vdsm works (HV up in engine), autostart of vdsm (during boot) fails:
>
> Jul 31 10:04:47 deovn-a04 systemd-vdsmd[711]: Note: Forwarding request to 'systemctl disable libvirt-guests.service'.
> Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: vdsm: libvirt already configured for vdsm [ OK ]
> Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Starting ntpd...
> Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Redirecting to /bin/systemctl start ntpd.service
> Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Starting wdmd...
> Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Redirecting to /bin/systemctl start wdmd.service
> Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Starting sanlock...
> Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Redirecting to /bin/systemctl start sanlock.service
> Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Starting iscsid:
> Jul 31 10:04:48 deovn-a04 systemd-vdsmd[711]: Redirecting to /bin/systemctl start libvirtd.service
> Jul 31 10:06:17 deovn-a04 systemd[1]: vdsmd.service operation timed out. Terminating.
> Jul 31 10:06:17 deovn-a04 systemd[1]: Unit vdsmd.service entered failed state.
>
> Did find similar issues:
> https://bugzilla.redhat.com/show_bug.cgi?id=832199
> https://bugzilla.redhat.com/show_bug.cgi?id=720359
> http://lists.ovirt.org/pipermail/users/2012-September/009624.html
>
> Unfortunately, there's no hint.. libvirtd starts without any latency when manually triggered.
>
> The strange thing here: F18 node deploy worked with engine 3.2.0 3-4 weeks ago.
> Did update engine / F18.
>
> Can anybody give me an advice here?
I have only debugg idea. Could you extend
TimeoutStartSec=10min
in /usr/lib/systemd/system/vdsmd.service to give you more time for
debugging? Then you can find the pid of the vdsmd.service script, and
pstree -p <pid> to see what it runs.
You can edit the first line of /lib/systemd/systemd-vdsmd to
#!/bin/sh -xv
to vomit more information to systemd's log.
Please inform us what you found out.
Dan.
More information about the Users
mailing list