<div dir="ltr">Promised patch: <a href="https://gerrit.ovirt.org/#/c/82203/">https://gerrit.ovirt.org/#/c/82203/</a><br></div><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Sep 26, 2017 at 11:44 AM, Miroslava Voglova <span dir="ltr">&lt;<a href="mailto:mvoglova@redhat.com" target="_blank">mvoglova@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr">So we found where is problem. There was patch before some time that was reverted and had two badly formatted jsons in value of vdc_option. If you have db from that time, the bug will appear, because the values are not updated (because they are already in db).<div><br><div>To fix this, its enough to change &#39;HotPlugMemorySupported&#39; to value &#39;{&quot;x86&quot;:&quot;true&quot;,&quot;ppc&quot;:&quot;true&quot;}&#39; for versions 4.0 - 4.2 and  &#39;HotUnplugMemorySupported&#39; to &#39;{&quot;x86&quot;:&quot;true&quot;,&quot;ppc&quot;:&quot;true&quot;}&#39; for 4.2.</div><div><br></div><div>Will do the patch that will include this update in 0000_config in case anyone else had the same problem.</div></div></div><div class="HOEnZb"><div class="h5"><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Sep 26, 2017 at 10:12 AM, Fred Rolland <span dir="ltr">&lt;<a href="mailto:frolland@redhat.com" target="_blank">frolland@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><div>I have same issue with new VM :<br><br>2017-09-26 11:07:59,255+03 ERROR [org.ovirt.engine.core.bll.Get<wbr>ArchitectureCapabilitiesQuery] (default task-2) [c066bc1d-4048-4b5f-bdb6-74fd8<wbr>13aa82e] Query &#39;GetArchitectureCapabilitiesQu<wbr>ery&#39; failed: null<br>2017-09-26 11:07:59,255+03 ERROR [org.ovirt.engine.core.bll.Get<wbr>ArchitectureCapabilitiesQuery] (default task-2) [c066bc1d-4048-4b5f-bdb6-74fd8<wbr>13aa82e] Exception: java.lang.NullPointerException<br>    at org.ovirt.engine.core.common.F<wbr>eatureSupported.supportedInCon<wbr>fig(FeatureSupported.java:23) [common.jar:]<br>    at org.ovirt.engine.core.common.F<wbr>eatureSupported.hotUnplugMemor<wbr>y(FeatureSupported.java:43) [common.jar:]<br>    at org.ovirt.engine.core.bll.GetA<wbr>rchitectureCapabilitiesQuery.<wbr>isSupported(GetArchitectureCap<wbr>abilitiesQuery.java:66) [bll.jar:]<br>    at org.ovirt.engine.core.bll.GetA<wbr>rchitectureCapabilitiesQuery.<wbr>getMap(GetArchitectureCapabili<wbr>tiesQuery.java:36) [bll.jar:]<br>    at org.ovirt.engine.core.bll.GetA<wbr>rchitectureCapabilitiesQuery.<wbr>executeQueryCommand(GetArchite<wbr>ctureCapabilitiesQuery.java:<wbr>22) [bll.jar:]<br>    at org.ovirt.engine.core.bll.Quer<wbr>iesCommandBase.executeCommand(<wbr>QueriesCommandBase.java:106) [bll.jar:]<br>    at org.ovirt.engine.core.dal.VdcC<wbr>ommandBase.execute(VdcCommandB<wbr>ase.java:33) [dal.jar:]<br>    at org.ovirt.engine.core.bll.exec<wbr>utor.DefaultBackendQueryExecut<wbr>or.execute(DefaultBackendQuery<wbr>Executor.java:14) [bll.jar:]<br><br></div>Then:<br><br>2017-09-26 11:08:08,397+03 ERROR [org.ovirt.engine.core.vdsbrok<wbr>er.CreateVDSCommand] (org.ovirt.thread.EE-ManagedTh<wbr>readFactory-engine-Thread-11) [9eb76d15-2bd5-49a9-a7d8-c73cf<wbr>d55282c] Failed to create VM: null<br>2017-09-26 11:08:08,398+03 ERROR [org.ovirt.engine.core.vdsbrok<wbr>er.CreateVDSCommand] (org.ovirt.thread.EE-ManagedTh<wbr>readFactory-engine-Thread-11) [9eb76d15-2bd5-49a9-a7d8-c73cf<wbr>d55282c] Command &#39;CreateVDSCommand( CreateVDSCommandParameters:{ho<wbr>stId=&#39;b6b6a226-8d4f-4929-85d1-<wbr>b218eceee99e&#39;, vmId=&#39;f15ddd07-408b-4665-aede-<wbr>a9efc5716dc7&#39;, vm=&#39;VM [FEDORA_CINDER]&#39;})&#39; execution failed: java.lang.NullPointerException<br><br></div><div class="m_5853326501725569905HOEnZb"><div class="m_5853326501725569905h5"><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Sep 26, 2017 at 10:26 AM, Tomas Jelinek <span dir="ltr">&lt;<a href="mailto:tjelinek@redhat.com" target="_blank">tjelinek@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote"><span>On Tue, Sep 26, 2017 at 9:17 AM, Miroslava Voglova <span dir="ltr">&lt;<a href="mailto:mvoglova@redhat.com" target="_blank">mvoglova@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr">From 4.0 architecture family was renamed in script 04_00_0080_rename_archi<wbr>tecture_family. So &#39;HotPlugCpuSupported&#39;, &#39;HotUnp<wbr>lugCpuSupported&#39;, &#39;HotPlugMemo<wbr>rySupported&#39;, &#39;HotUnplugMemory<wbr>Supported&#39;, &#39;IsMigrationSuppor<wbr>ted&#39;,  &#39;IsMemorySnapshotSupported&#39; and &#39;IsSuspendSupported&#39; are all in db with x86 not x86_64. In my point of view nothing wrong with that particular line in [1].<div><br><div>Could be that somewhere in code is not used architecture family, but host architecture, when asked for value of this ConfigValues. But that would throw exception even before my patch, because &#39;{&quot;x86:&quot;true&quot;,&quot;ppc&quot;:&quot;true&quot;}&#39; was default value for HotPlugMemorySupported.<br></div></div></div></blockquote><div><br></div></span><div>I see a code path where the cluster arch can be set to x86_64 - it is always executed for external VMs (imported from external provider or unmanaged). It does not happen all the time, it is only a fallback if the arch type is not known/reported etc.</div><div><br></div><div>@Alexander: by any chance, was this VM an unmanaged one? Or imported? In logs you should find something like:</div><div>&quot;Illegal architecture type: {}, replacing with x86_64&quot; or &quot;null architecture type, replacing with x86_64, {}&quot;.</div><div><br></div><div>Also, if you create a new VM, can you start it?<br></div><div><div class="m_5853326501725569905m_3128282558985985031h5"><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div><div><div><br></div><div><br></div><div><span style="font-size:12.8px">[1] </span><font color="#1155cc"><span style="font-size:12.8px"><u><a href="https://gerrit.ovirt.org/#/c/81464/7/packaging/dbscripts/upgrade/pre_upgrade/0000_config.sql" target="_blank">https://gerrit.ovirt.org/#<wbr>/c/81464/7/packaging/dbscripts<wbr>/upgrade/pre_upgrade/0000_conf<wbr>ig.sql</a></u></span></font><br></div></div></div></div><div class="m_5853326501725569905m_3128282558985985031m_9154079743965586961gmail-HOEnZb"><div class="m_5853326501725569905m_3128282558985985031m_9154079743965586961gmail-h5"><div class="gmail_extra"><br><div class="gmail_quote">On Tue, Sep 26, 2017 at 9:00 AM, Tomas Jelinek <span dir="ltr">&lt;<a href="mailto:tjelinek@redhat.com" target="_blank">tjelinek@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><br><div class="gmail_extra"><br><div class="gmail_quote"><span>On Mon, Sep 25, 2017 at 10:08 PM, Roy Golan <span dir="ltr">&lt;<a href="mailto:rgolan@redhat.com" target="_blank">rgolan@redhat.com</a>&gt;</span> wrote:<br><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><br><br><div class="gmail_quote"><span class="m_5853326501725569905m_3128282558985985031m_9154079743965586961gmail-m_-3832170205561208090m_-8422389147944334059gmail-"><div dir="ltr">On Mon, 25 Sep 2017 at 22:52 Alexander Wels &lt;<a href="mailto:awels@redhat.com" target="_blank">awels@redhat.com</a>&gt; wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">On Monday, September 25, 2017 3:50:56 PM EDT Roy Golan wrote:<br>
&gt; So somewhere in the code somebody used the Arch and not the family. See the<br>
&gt; enum getFamily() method<br>
&gt;<br>
<br>
Yep, in particular line 23 of FeatureSupported.java.<br>
<br></blockquote></span><div>I meant the caller of the method on this line. Do you have it in the trace so we can see who passed x86_64 as arch ?</div><div><div class="m_5853326501725569905m_3128282558985985031m_9154079743965586961gmail-m_-3832170205561208090m_-8422389147944334059gmail-h5"><div> <br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
&gt; On Mon, 25 Sep 2017 at 22:31 Alexander Wels &lt;<a href="mailto:awels@redhat.com" target="_blank">awels@redhat.com</a>&gt; wrote:<br>
&gt; &gt; On Monday, September 25, 2017 3:24:14 PM EDT Roy Golan wrote:<br>
&gt; &gt; &gt; what JRE are you using? any change with that?<br>
&gt; &gt;<br>
&gt; &gt; So I just figured out the problem, and its really strange. It has nothing<br>
&gt; &gt; to<br>
&gt; &gt; do with the SSL as the stack trace is mentioning. I manually stepped<br>
&gt; &gt; through<br>
&gt; &gt; the code to see what was going on and it turns out it is failing in<br>
&gt; &gt; FeatureSupported.java in supportedInConfig call from hotPlugMemory.<br>
&gt; &gt;<br>
&gt; &gt; The Config.&lt;Map&gt;getValue(feature, version.getValue()) (version is 4.2) is<br>
&gt; &gt; returning a map containing x86=true and ppc=true. But then it compares<br>
&gt; &gt; this to<br>
&gt; &gt; ArchitectureType.name() it returns null, because .name() return x86_64. No<br>
&gt; &gt; it<br>
&gt; &gt; appears that sometime during the last few months we dropped the _64 in the<br>
&gt; &gt; ArchitectureType, or at least in the database.<br></blockquote></div></div></div></div></blockquote><div><br></div></span><div>It looks a lot like introduced here: <a href="https://gerrit.ovirt.org/#/c/81464/" target="_blank">https://gerrit.ovirt.org/#/c/8<wbr>1464/</a></div><div><br></div><div>@Mirka: what you think?<br></div><div><div class="m_5853326501725569905m_3128282558985985031m_9154079743965586961gmail-m_-3832170205561208090h5"><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div dir="ltr"><div class="gmail_quote"><div><div class="m_5853326501725569905m_3128282558985985031m_9154079743965586961gmail-m_-3832170205561208090m_-8422389147944334059gmail-h5"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
&gt; &gt;<br>
&gt; &gt; As soon as I added a vdc_options tha contains x86_64 value for that key it<br>
&gt; &gt; started working. Now I have checked with Greg who has a fresh database<br>
&gt; &gt; that he<br>
&gt; &gt; can start VMs no problem, and his database contains x86 instead of x86_64.<br>
&gt; &gt;<br>
&gt; &gt; &gt; On Mon, 25 Sep 2017 at 21:12 Alexander Wels &lt;<a href="mailto:awels@redhat.com" target="_blank">awels@redhat.com</a>&gt; wrote:<br>
&gt; &gt; &gt; &gt; Hi guys,<br>
&gt; &gt; &gt; &gt;<br>
&gt; &gt; &gt; &gt; I see to be having an issue starting VMs with the latest master.<br>
&gt; &gt;<br>
&gt; &gt; Whenever<br>
&gt; &gt;<br>
&gt; &gt; &gt; &gt; I<br>
&gt; &gt; &gt; &gt; try to start a VM I get null pointer exception. And the VM doesn&#39;t<br>
&gt; &gt;<br>
&gt; &gt; start.<br>
&gt; &gt;<br>
&gt; &gt; &gt; &gt; I<br>
&gt; &gt; &gt; &gt; have debugged the engine, and it appears that the null pointer happens<br>
&gt; &gt; &gt; &gt; after<br>
&gt; &gt; &gt; &gt; the engine tries to connect to the host. In the stack trace I see<br>
&gt; &gt; &gt; &gt; SSLPeerUnverifiedException, so it appears something went wrong with a<br>
&gt; &gt; &gt; &gt; certificate somewhere.<br>
&gt; &gt; &gt; &gt;<br>
&gt; &gt; &gt; &gt; I have put my hosts in maintaince and re-enrolled the certificate, but<br>
&gt; &gt; &gt; &gt; that<br>
&gt; &gt; &gt; &gt; doesn&#39;t appear to be helping at all. Any other place I need to look at<br>
&gt; &gt;<br>
&gt; &gt; to<br>
&gt; &gt;<br>
&gt; &gt; &gt; &gt; make<br>
&gt; &gt; &gt; &gt; sure the engine can talk to the hosts? This appears to have started<br>
&gt; &gt;<br>
&gt; &gt; after<br>
&gt; &gt;<br>
&gt; &gt; &gt; &gt; I<br>
&gt; &gt; &gt; &gt; upgraded Wildfly to 11, so it is possible it has something to do with<br>
&gt; &gt;<br>
&gt; &gt; that<br>
&gt; &gt;<br>
&gt; &gt; &gt; &gt; as<br>
&gt; &gt; &gt; &gt; well.<br>
&gt; &gt; &gt; &gt;<br>
&gt; &gt; &gt; &gt; Any help figuring this out would be appreciated.<br>
&gt; &gt; &gt; &gt;<br>
&gt; &gt; &gt; &gt; Alexander<br>
&gt; &gt; &gt; &gt; ______________________________<wbr>_________________<br>
&gt; &gt; &gt; &gt; Devel mailing list<br>
&gt; &gt; &gt; &gt; <a href="mailto:Devel@ovirt.org" target="_blank">Devel@ovirt.org</a><br>
&gt; &gt; &gt; &gt; <a href="http://lists.ovirt.org/mailman/listinfo/devel" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman<wbr>/listinfo/devel</a><br>
<br>
<br>
</blockquote></div></div></div></div>
<br>______________________________<wbr>_________________<br>
Devel mailing list<br>
<a href="mailto:Devel@ovirt.org" target="_blank">Devel@ovirt.org</a><br>
<a href="http://lists.ovirt.org/mailman/listinfo/devel" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman<wbr>/listinfo/devel</a><br></blockquote></div></div></div><br></div></div>
</blockquote></div><br></div>
</div></div></blockquote></div></div></div><br></div></div>
<br>______________________________<wbr>_________________<br>
Devel mailing list<br>
<a href="mailto:Devel@ovirt.org" target="_blank">Devel@ovirt.org</a><br>
<a href="http://lists.ovirt.org/mailman/listinfo/devel" rel="noreferrer" target="_blank">http://lists.ovirt.org/mailman<wbr>/listinfo/devel</a><br></blockquote></div><br></div>
</div></div></blockquote></div><br></div>
</div></div></blockquote></div><br></div>