<div dir="ltr"><div class="gmail_extra"><div class="gmail_quote">On Tue, Jan 9, 2018 at 12:04 PM, Peter Hudec <span dir="ltr">&lt;<a href="mailto:phudec@cnc.sk" target="_blank">phudec@cnc.sk</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">quick fix is follow the<br>
<a href="https://gerrit.ovirt.org/#/c/84802/2/backend/manager/modules/utils/src/main/java/org/ovirt/engine/core/utils/ovf/IOvfBuilder.java" rel="noreferrer" target="_blank">https://gerrit.ovirt.org/#/c/<wbr>84802/2/backend/manager/<wbr>modules/utils/src/main/java/<wbr>org/ovirt/engine/core/utils/<wbr>ovf/IOvfBuilder.java</a><br>
<br>
and remove trailing &#39;/&#39; in<br>
/usr/lib/python2.7/site-<wbr>packages/ovirt_hosted_engine_<wbr>ha/lib/ovf/ovf2VmParams.py<br></blockquote><div><br></div><div>Adding Denis. Thanks for the logs!<br></div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div class="HOEnZb"><div class="h5"><br>
<br>
On 09/01/2018 10:45, Peter Hudec wrote:<br>
&gt; the old  hypervisoer /oVirt 4.1.8/ got probblem to release the HE due<br>
&gt; this exception.  The HE is on the NFS store.<br>
&gt;<br>
&gt; MainThread::INFO::2018-01-09<br>
&gt; 10:40:28,497::upgrade::998::<wbr>ovirt_hosted_engine_ha.lib.<wbr>upgrade.StorageServer::(<wbr>upgrade_35_36)<br>
&gt; Host configuration is already up-to-date<br>
&gt; MainThread::INFO::2018-01-09<br>
&gt; 10:40:28,498::config::493::<wbr>ovirt_hosted_engine_ha.agent.<wbr>hosted_engine.HostedEngine.<wbr>config::(refresh_vm_conf)<br>
&gt; Reloading vm.conf from the shared storage domain<br>
&gt; MainThread::INFO::2018-01-09<br>
&gt; 10:40:28,498::config::416::<wbr>ovirt_hosted_engine_ha.agent.<wbr>hosted_engine.HostedEngine.<wbr>config::(_get_vm_conf_content_<wbr>from_ovf_store)<br>
&gt; Trying to get a fresher copy of vm configuration from the OVF_STORE<br>
&gt; MainThread::INFO::2018-01-09<br>
&gt; 10:40:28,498::ovf_store::132::<wbr>ovirt_hosted_engine_ha.lib.<wbr>ovf.ovf_store.OVFStore::(<wbr>getEngineVMOVF)<br>
&gt; Extracting Engine VM OVF from the OVF_STORE<br>
&gt; MainThread::INFO::2018-01-09<br>
&gt; 10:40:28,498::ovf_store::134::<wbr>ovirt_hosted_engine_ha.lib.<wbr>ovf.ovf_store.OVFStore::(<wbr>getEngineVMOVF)<br>
&gt; OVF_STORE volume path:<br>
&gt; /var/run/vdsm/storage/<wbr>3981424d-a55c-4f07-bff2-<wbr>aca316a95d1f/3513775f-d6b0-<wbr>4423-be19-bbeb79c72ad2/<wbr>7ee3f450-5976-48f8-b667-<wbr>27b48f6cf778<br>
&gt; MainThread::INFO::2018-01-09<br>
&gt; 10:40:28,517::config::435::<wbr>ovirt_hosted_engine_ha.agent.<wbr>hosted_engine.HostedEngine.<wbr>config::(_get_vm_conf_content_<wbr>from_ovf_store)<br>
&gt; Found an OVF for HE VM, trying to convert<br>
&gt; MainThread::ERROR::2018-01-09<br>
&gt; 10:40:28,523::agent::205::<wbr>ovirt_hosted_engine_ha.agent.<wbr>agent.Agent::(_run_agent)<br>
&gt; Traceback (most recent call last):<br>
&gt;   File<br>
&gt; &quot;/usr/lib/python2.7/site-<wbr>packages/ovirt_hosted_engine_<wbr>ha/agent/agent.py&quot;,<br>
&gt; line 191, in _run_agent<br>
&gt;     return action(he)<br>
&gt;   File<br>
&gt; &quot;/usr/lib/python2.7/site-<wbr>packages/ovirt_hosted_engine_<wbr>ha/agent/agent.py&quot;,<br>
&gt; line 64, in action_proper<br>
&gt;     return he.start_monitoring()<br>
&gt;   File<br>
&gt; &quot;/usr/lib/python2.7/site-<wbr>packages/ovirt_hosted_engine_<wbr>ha/agent/hosted_engine.py&quot;,<br>
&gt; line 421, in start_monitoring<br>
&gt;     self._config.refresh_vm_conf()<br>
&gt;   File<br>
&gt; &quot;/usr/lib/python2.7/site-<wbr>packages/ovirt_hosted_engine_<wbr>ha/env/config.py&quot;,<br>
&gt; line 496, in refresh_vm_conf<br>
&gt;     content_from_ovf = self._get_vm_conf_content_<wbr>from_ovf_store()<br>
&gt;   File<br>
&gt; &quot;/usr/lib/python2.7/site-<wbr>packages/ovirt_hosted_engine_<wbr>ha/env/config.py&quot;,<br>
&gt; line 438, in _get_vm_conf_content_from_ovf_<wbr>store<br>
&gt;     conf = ovf2VmParams.confFromOvf(<wbr>heovf)<br>
&gt;   File<br>
&gt; &quot;/usr/lib/python2.7/site-<wbr>packages/ovirt_hosted_engine_<wbr>ha/lib/ovf/ovf2VmParams.py&quot;,<br>
&gt; line 283, in confFromOvf<br>
&gt;     vmConf = toDict(ovf)<br>
&gt;   File<br>
&gt; &quot;/usr/lib/python2.7/site-<wbr>packages/ovirt_hosted_engine_<wbr>ha/lib/ovf/ovf2VmParams.py&quot;,<br>
&gt; line 210, in toDict<br>
&gt;     vmParams[&#39;vmId&#39;] = tree.find(&#39;Content/Section&#39;).<wbr>attrib[OVF_NS + &#39;id&#39;]<br>
&gt;   File &quot;lxml.etree.pyx&quot;, line 2272, in lxml.etree._Attrib.__getitem__<br>
&gt; (src/lxml/lxml.etree.c:55336)<br>
&gt; KeyError: &#39;{<a href="http://schemas.dmtf.org/ovf/envelope/1/}id" rel="noreferrer" target="_blank">http://schemas.dmtf.org/ovf/<wbr>envelope/1/}id</a>&#39;<br>
&gt;<br>
&gt;<br>
&gt; On 09/01/2018 10:18, Peter Hudec wrote:<br>
&gt;&gt; The HA is flapping between 3400 nad 0. ;(<br>
&gt;&gt; And I&#39;m not able to migrate also any other Vm to this host.<br>
&gt;&gt;<br>
&gt;&gt; Loggs fromthe /var/log/ovirt-hosted-engine-<wbr>ha/agent.log file<br>
&gt;&gt;<br>
&gt;&gt; MainThread::INFO::2018-01-08<br>
&gt;&gt; 21:44:45,805::state_machine::<wbr>174::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(refresh)<br>
&gt;&gt; Host <a href="http://dipovirt03.cnc.sk" rel="noreferrer" target="_blank">dipovirt03.cnc.sk</a> (id 1): {&#39;conf_on_shared_storage&#39;: True, &#39;extra&#39;:<br>
&gt;&gt; &#39;metadata_parse_version=1\<wbr>nmetadata_feature_version=1\<wbr>ntimestamp=8232312<br>
&gt;&gt; (Mon Jan  8 21:44:29<br>
&gt;&gt; 2018)\nhost-id=1\nscore=3400\<wbr>nvm_conf_refresh_time=8232316 (Mon Jan  8<br>
&gt;&gt; 21:44:33<br>
&gt;&gt; 2018)\nconf_on_shared_storage=<wbr>True\nmaintenance=False\<wbr>nstate=EngineUp\nstopped=<wbr>False\n&#39;,<br>
&gt;&gt; &#39;hostname&#39;: &#39;<a href="http://dipovirt03.cnc.sk" rel="noreferrer" target="_blank">dipovirt03.cnc.sk</a>&#39;, &#39;host-id&#39;: 1, &#39;engine-status&#39;:<br>
&gt;&gt; {&#39;health&#39;: &#39;good&#39;, &#39;vm&#39;: &#39;up&#39;, &#39;detail&#39;: &#39;up&#39;}, &#39;score&#39;: 3400,<br>
&gt;&gt; &#39;stopped&#39;: False, &#39;maintenance&#39;: False, &#39;crc32&#39;: &#39;f28d4648&#39;,<br>
&gt;&gt; &#39;local_conf_timestamp&#39;: 8232316, &#39;host-ts&#39;: 8232312}<br>
&gt;&gt; MainThread::INFO::2018-01-08<br>
&gt;&gt; 21:44:45,805::state_machine::<wbr>174::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(refresh)<br>
&gt;&gt; Host <a href="http://dipovirt02.cnc.sk" rel="noreferrer" target="_blank">dipovirt02.cnc.sk</a> (id 3): {&#39;conf_on_shared_storage&#39;: True, &#39;extra&#39;:<br>
&gt;&gt; &#39;metadata_parse_version=1\<wbr>nmetada...skipping...<br>
&gt;&gt; neVMOVF) OVF_STORE volume path:<br>
&gt;&gt; /var/run/vdsm/storage/<wbr>3981424d-a55c-4f07-bff2-<wbr>aca316a95d1f/3513775f-d6b0-<wbr>4423-be19-bbe<br>
&gt;&gt; b79c72ad2/7ee3f450-5976-48f8-<wbr>b667-27b48f6cf778<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:13,904::state_machine::<wbr>169::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(refresh)<br>
&gt;&gt; Global metadata: {&#39;maintenance&#39;: False}<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:13,905::state_machine::<wbr>174::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(refresh)<br>
&gt;&gt; Host <a href="http://dipovirt03.cnc.sk" rel="noreferrer" target="_blank">dipovirt03.cnc.sk</a> (id 1): {&#39;conf_on_shared_storage&#39;: True, &#39;extra&#39;:<br>
&gt;&gt; &#39;metadata_parse_version=1\<wbr>nmetadata_feature_version=1\<wbr>ntimestamp=38598<br>
&gt;&gt; (Tue Jan  9 09:23:33<br>
&gt;&gt; 2018)\nhost-id=1\nscore=3400\<wbr>nvm_conf_refresh_time=38598 (Tue Jan  9<br>
&gt;&gt; 09:23:34<br>
&gt;&gt; 2018)\nconf_on_shared_storage=<wbr>True\nmaintenance=False\<wbr>nstate=EngineUp\nstopped=<wbr>False\n&#39;,<br>
&gt;&gt; &#39;hostname&#39;: &#39;<a href="http://dipovirt03.cnc.sk" rel="noreferrer" target="_blank">dipovirt03.cnc.sk</a>&#39;, &#39;alive&#39;: False, &#39;host-id&#39;: 1,<br>
&gt;&gt; &#39;engine-status&#39;: {&#39;health&#39;: &#39;good&#39;, &#39;vm&#39;: &#39;up&#39;, &#39;detail&#39;: &#39;up&#39;},<br>
&gt;&gt; &#39;score&#39;: 3400, &#39;stopped&#39;: False, &#39;maintenance&#39;: False, &#39;crc32&#39;:<br>
&gt;&gt; &#39;4c1d1890&#39;, &#39;local_conf_timestamp&#39;: 38598, &#39;host-ts&#39;: 38598}<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:13,905::state_machine::<wbr>174::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(refresh)<br>
&gt;&gt; Host <a href="http://dipovirt02.cnc.sk" rel="noreferrer" target="_blank">dipovirt02.cnc.sk</a> (id 3): {&#39;conf_on_shared_storage&#39;: True, &#39;extra&#39;:<br>
&gt;&gt; &#39;metadata_parse_version=1\<wbr>nmetadata_feature_version=1\<wbr>ntimestamp=40677<br>
&gt;&gt; (Tue Jan  9 09:24:11<br>
&gt;&gt; 2018)\nhost-id=3\nscore=3400\<wbr>nvm_conf_refresh_time=40677 (Tue Jan  9<br>
&gt;&gt; 09:24:11<br>
&gt;&gt; 2018)\nconf_on_shared_storage=<wbr>True\nmaintenance=False\<wbr>nstate=EngineDown\nstopped=<wbr>False\n&#39;,<br>
&gt;&gt; &#39;hostname&#39;: &#39;<a href="http://dipovirt02.cnc.sk" rel="noreferrer" target="_blank">dipovirt02.cnc.sk</a>&#39;, &#39;alive&#39;: False, &#39;host-id&#39;: 3,<br>
&gt;&gt; &#39;engine-status&#39;: {&#39;reason&#39;: &#39;vm not running on this host&#39;, &#39;health&#39;:<br>
&gt;&gt; &#39;bad&#39;, &#39;vm&#39;: &#39;down&#39;, &#39;detail&#39;: &#39;unknown&#39;}, &#39;score&#39;: 3400, &#39;stopped&#39;:<br>
&gt;&gt; False, &#39;maintenance&#39;: False, &#39;crc32&#39;: &#39;3bf104bc&#39;,<br>
&gt;&gt; &#39;local_conf_timestamp&#39;: 40677, &#39;host-ts&#39;: 40677}<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:13,905::state_machine::<wbr>177::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(refresh)<br>
&gt;&gt; Local (id 2): {&#39;engine-health&#39;: {&#39;reason&#39;: &#39;vm not running on this<br>
&gt;&gt; host&#39;, &#39;health&#39;: &#39;bad&#39;, &#39;vm&#39;: &#39;down&#39;, &#39;detail&#39;: &#39;unknown&#39;}, &#39;bridge&#39;:<br>
&gt;&gt; True, &#39;mem-free&#39;: 39540.0, &#39;maintenance&#39;: False, &#39;cpu-load&#39;: 0.0432,<br>
&gt;&gt; &#39;gateway&#39;: 1.0, &#39;storage-domain&#39;: True}<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:13,905::states::775::<wbr>ovirt_hosted_engine_ha.agent.<wbr>hosted_engine.HostedEngine::(<wbr>consume)<br>
&gt;&gt; Another host already took over..<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:13,928::state_<wbr>decorators::88::ovirt_hosted_<wbr>engine_ha.agent.hosted_engine.<wbr>HostedEngine::(check)<br>
&gt;&gt; Timeout cleared while transitioning &lt;class<br>
&gt;&gt; &#39;ovirt_hosted_engine_ha.agent.<wbr>states.EngineStarting&#39;&gt; -&gt; &lt;class<br>
&gt;&gt; &#39;ovirt_hosted_engine_ha.agent.<wbr>states.EngineForceStop&#39;&gt;<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:14,046::brokerlink::68::<wbr>ovirt_hosted_engine_ha.lib.<wbr>brokerlink.BrokerLink::(<wbr>notify)<br>
&gt;&gt; Success, was notification of state_transition<br>
&gt;&gt; (EngineStarting-<wbr>EngineForceStop) sent? sent<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:14,464::hosted_engine::<wbr>494::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(_monitoring_<wbr>loop)<br>
&gt;&gt; Current state EngineForceStop (score: 3400)<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:14,467::hosted_engine::<wbr>1002::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(_stop_engine_<wbr>vm)<br>
&gt;&gt; Shutting down vm using `/usr/sbin/hosted-engine --vm-poweroff`<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:15,198::hosted_engine::<wbr>1007::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(_stop_engine_<wbr>vm)<br>
&gt;&gt; stdout:<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:15,198::hosted_engine::<wbr>1008::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(_stop_engine_<wbr>vm)<br>
&gt;&gt; stderr: Command VM.destroy with args {&#39;vmID&#39;:<br>
&gt;&gt; &#39;9a8ea503-f598-433e-9751-<wbr>93aee3e7b347&#39;} failed:<br>
&gt;&gt; (code=1, message=Virtual machine does not exist: {&#39;vmId&#39;:<br>
&gt;&gt; u&#39;9a8ea503-f598-433e-9751-<wbr>93aee3e7b347&#39;})<br>
&gt;&gt;<br>
&gt;&gt; MainThread::ERROR::2018-01-09<br>
&gt;&gt; 10:15:15,199::hosted_engine::<wbr>1013::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(_stop_engine_<wbr>vm)<br>
&gt;&gt; Failed to stop engine vm with /usr/sbin/hosted-engine --vm-poweroff:<br>
&gt;&gt; Command VM.destroy with args {&#39;vmID&#39;:<br>
&gt;&gt; &#39;9a8ea503-f598-433e-9751-<wbr>93aee3e7b347&#39;} failed:<br>
&gt;&gt; (code=1, message=Virtual machine does not exist: {&#39;vmId&#39;:<br>
&gt;&gt; u&#39;9a8ea503-f598-433e-9751-<wbr>93aee3e7b347&#39;})<br>
&gt;&gt;<br>
&gt;&gt; MainThread::ERROR::2018-01-09<br>
&gt;&gt; 10:15:15,199::hosted_engine::<wbr>1019::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(_stop_engine_<wbr>vm)<br>
&gt;&gt; Failed to stop engine VM: Command VM.destroy with args {&#39;vmID&#39;:<br>
&gt;&gt; &#39;9a8ea503-f598-433e-9751-<wbr>93aee3e7b347&#39;} failed:<br>
&gt;&gt; (code=1, message=Virtual machine does not exist: {&#39;vmId&#39;:<br>
&gt;&gt; u&#39;9a8ea503-f598-433e-9751-<wbr>93aee3e7b347&#39;})<br>
&gt;&gt;<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:15,317::brokerlink::68::<wbr>ovirt_hosted_engine_ha.lib.<wbr>brokerlink.BrokerLink::(<wbr>notify)<br>
&gt;&gt; Success, was notification of state_transition<br>
&gt;&gt; (EngineForceStop-<wbr>ReinitializeFSM) sent? sent<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:15,356::hosted_engine::<wbr>494::ovirt_hosted_engine_ha.<wbr>agent.hosted_engine.<wbr>HostedEngine::(_monitoring_<wbr>loop)<br>
&gt;&gt; Current state ReinitializeFSM (score: 0)<br>
&gt;&gt; MainThread::INFO::2018-01-09<br>
&gt;&gt; 10:15:25,560::brokerlink::68::<wbr>ovirt_hosted_engine_ha.lib.<wbr>brokerlink.BrokerLink::(<wbr>notify)<br>
&gt;&gt; Success, was notification of state_transition<br>
&gt;&gt; (ReinitializeFSM-EngineDown) sent? sent<br>
&gt;&gt;<br>
&gt;&gt;      Peter<br>
&gt;&gt;<br>
&gt;&gt; On 09/01/2018 09:35, Yedidyah Bar David wrote:<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;     3) Hosted Engine HA:<br>
&gt;&gt;&gt;     Hosted Engine HA on upgraded hosts is 3400, the same as on the 4.1<br>
&gt;&gt;&gt;     hosts. Is this good or bad?<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt;<br>
&gt;&gt;&gt; It&#39;s good.<br>
&gt;&gt;<br>
&gt;&gt;<br>
&gt;<br>
&gt;<br>
<br>
<br>
--<br>
*Peter Hudec*<br>
Infraštruktúrny architekt<br>
<a href="mailto:phudec@cnc.sk">phudec@cnc.sk</a> &lt;mailto:<a href="mailto:phudec@cnc.sk">phudec@cnc.sk</a>&gt;<br>
<br>
*CNC, a.s.*<br>
Borská 6, 841 04 Bratislava<br>
Recepcia: <a href="tel:%2B421%202%C2%A0%2035%20000%20100" value="+421235000100">+421 2  35 000 100</a><br>
<br>
Mobil:<a href="tel:%2B421%C2%A0905%20997%20203" value="+421905997203">+421 905 997 203</a><br>
*<a href="http://www.cnc.sk" rel="noreferrer" target="_blank">www.cnc.sk</a>* &lt;http:///<a href="http://www.cnc.sk" rel="noreferrer" target="_blank">www.cnc.sk</a>&gt;<br>
<br>
</div></div></blockquote></div><br><br clear="all"><br>-- <br><div class="gmail_signature" data-smartmail="gmail_signature">Didi<br></div>
</div></div>