<div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Aug 4, 2016 at 6:10 PM, Nicolás <span dir="ltr"><<a href="mailto:nicolas@devels.es" target="_blank">nicolas@devels.es</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><br>
<br>
El 04/08/16 a las 15:25, Arik Hadas escribió:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div class="h5">
<br>
----- Original Message -----<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
El 2016-08-04 08:24, Arik Hadas escribió:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
----- Original Message -----<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
El 04/08/16 a las 07:18, Arik Hadas escribió:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
----- Original Message -----<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Hi,<br>
<br>
We're running oVirt 4.0.1 and today I found out that one of our hosts<br>
has all its VMs in an unknown state. I actually don't know how (and<br>
when) did this happen, but I'd like to restore service possibly without<br>
turning off these machines. The host is up, the VMs are up, 'qemu'<br>
process exists, no errors, it's just the VMs running on it that have a<br>
'?' where status is defined.<br>
<br>
Is it safe in this case to simply modify database and set those VM's<br>
status to 'up'? I remember having to do this a time ago when we faced<br>
storage issues, it didn't break anything back then. If not, is there a<br>
"safe" way to migrate those VMs to a different host and restart the<br>
host<br>
that marked them as unknown?<br>
</blockquote>
Hi Nicolás,<br>
<br>
I assume that the host these VMs are running on is empty in the<br>
webadmin,<br>
right? if that is the case then you've probably hit [1]. Changing their<br>
status to up is not the way to go since these VMs will not be monitored.<br>
</blockquote>
Hi Arik,<br>
<br>
By "empty" you mean the webadmin reports the host being running 0 VMs?<br>
If so, that's not the case, actually the VM count seems to be correct<br>
in<br>
relation to "qemu-*" processes (about 32 VMs), I can even see the<br>
machines in the "Virtual machines" tab of the host, it's just they are<br>
all marked with the '?' mark.<br>
</blockquote>
No, I meant the 'Host' column in the Virtual Machines tab but if you<br>
see<br>
the VMs in the "Virtual machines" sub-tab of the host then run_on_vds<br>
points to the right host..<br>
<br>
The host is up in the webadmin as well?<br>
Can you share the engine log?<br>
<br>
</blockquote>
Yes, the host is up in the webadmin, there are no issues with it, just<br>
the VMs running on it have the '?' mark. I've made 3 tests:<br>
<br>
1) Restart engine: did not help<br>
2) Check firewall, seems to be ok.<br>
2) PostgreSQL: UPDATE vm_dynamic SET status = 1 WHERE status = 8; :<br>
After a while, I see lots of entries like this:<br>
<br>
2016-08-04 09:23:10,910 WARN<br>
[org.ovirt.engine.core.dal.dbb<wbr>roker.auditloghandling.AuditLo<wbr>gDirector]<br>
(DefaultQuartzScheduler4) [6ad135b8] Correlation ID: null, Call Stack:<br>
null, Custom Event ID: -1, Message: VM xxx is not responding.<br>
<br>
I'm attaching the engine log, but I don't know when did this happen for<br>
the first time, though. If there's a manual way/command to migrate VMs<br>
to a different host I'd appreciate a hint about it.<br>
<br>
Is it safe to restart vdsmd on this host?<br>
</blockquote></div></div>
The engine log looks fine - the VMs are reported as not-responding for<br>
some reason. I would restart libvirtd and vdsmd then<br>
</blockquote>
<br>
Is restarting those two daemons safe? I mean, will that stop all qemu-* processes, so the VMs marked as unknown will stop?</blockquote><div><br></div><div>Neither should touch the qemu process, but re-connect to it as they restart. </div><div>Y.</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="HOEnZb"><div class="h5"><br>
<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Thanks.<br>
<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Thanks.<br>
<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Yes, there is no other way to resolve it other than changing the DB but<br>
the change should be to update run_on_vds field of these VMs to the host<br>
you know they are running on. Their status will then be updates in 15<br>
sec.<br>
<br>
[1] <a href="https://bugzilla.redhat.com/show_bug.cgi?id=1354494" rel="noreferrer" target="_blank">https://bugzilla.redhat.com/sh<wbr>ow_bug.cgi?id=1354494</a><br>
<br>
Arik.<br>
<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Thanks.<br>
<br>
Nicolás<br>
<br>
______________________________<wbr>_________________<br>
Users mailing list<br>
<a href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br>
<a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman<wbr>/listinfo/users</a><br>
<br>
</blockquote></blockquote>
<br>
</blockquote></blockquote></blockquote></blockquote>
<br>
______________________________<wbr>_________________<br>
Users mailing list<br>
<a href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br>
<a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman<wbr>/listinfo/users</a><br>
</div></div></blockquote></div><br></div></div>