<div dir="ltr">Nir,<div><br></div><div>Looks like its crashing on the dmidecode system call.</div><div><br></div><div>I've attached the output from gbd as well as a dmidecode text dump, dmidecode binary dump and each keywords run individually.</div><div><br></div><div>From the keywords it look like my dmi info is corrupted. I have download a AMI dmi editor but this only allows access to limited fields. Do you know another tools to rewrite the dmi info?</div><div><br></div><div><br></div><div>Thanks so much for your help.</div><div><br></div><div>Cheers,</div><div><br></div></div><div class="gmail_extra"><br><div class="gmail_quote">On Thu, Oct 13, 2016 at 5:34 AM, Nir Soffer <span dir="ltr"><<a href="mailto:nsoffer@redhat.com" target="_blank">nsoffer@redhat.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">On Tue, Oct 11, 2016 at 11:59 PM, David Pinkerton <<a href="mailto:dpinkert@redhat.com">dpinkert@redhat.com</a>> wrote:<br>
> Logs attached<br>
<br>
According vdsm.log and supervdsm.log, each time vdsm try to call<br>
getHardwareInfo,<br>
supervdsm show the start of the call and then it show no logs for 10 seconds,<br>
and than we see the startup message.<br>
<br>
So it seems that supervdsm is crashing each time it try to invoke dmidecode<br>
code.<br>
<br>
To dig deeper, I suggest you try to run the relevant code from the<br>
shell. If this<br>
code crash, we will see the details in the shell, and we can also run the python<br>
shell in gdb to debug this.<br>
<br>
Try this:<br>
<br>
1. Open a python shell as root<br>
<br>
$ sudo python<br>
<br>
2. In the shell, type this<br>
<br>
>>> from vdsm import dmidecodeUtil<br>
>>> dmidecodeUtil.<wbr>getHardwareInfoStructure()<br>
<br>
If at this point the the python shell crash, please try:<br>
<br>
1. Install python debug-info packages:<br>
<br>
$ sudo debuginfo-install -y python<br>
<br>
2. Start python in gdb<br>
<br>
$ sudo gdb python<br>
<br>
3. In the gdb shell, run python<br>
<br>
(gdb) run<br>
<br>
Python shell will show, type the code above again.<br>
<br>
If this crash in gdb, please type this in the gdb shell:<br>
<br>
(gdb) thread apply all bt full<br>
<br>
<br>
Nir<br>
<br>
><br>
> On Mon, Oct 10, 2016 at 4:59 PM, Nir Soffer <<a href="mailto:nsoffer@redhat.com">nsoffer@redhat.com</a>> wrote:<br>
>><br>
>> On Mon, Oct 10, 2016 at 5:05 AM, Charles Kozler <<a href="mailto:ckozleriii@gmail.com">ckozleriii@gmail.com</a>><br>
>> wrote:<br>
>>><br>
>>> Possibly stupid question but are you doing this on a base empty<br>
>>> centos/rhel 7?<br>
>>><br>
>>><br>
>>> On Oct 9, 2016 9:48 PM, "David Pinkerton" <<a href="mailto:dpinkert@redhat.com">dpinkert@redhat.com</a>> wrote:<br>
>>>><br>
>>>><br>
>>>> I've spent the weekend trying to get to the bottom of this issue.<br>
>>>><br>
>>>> Adding a Host fails:<br>
>>>><br>
>>>> From RHVM<br>
>>>><br>
>>>><br>
>>>> VDSM rhv1 command failed: Connection reset by peer<br>
>>>> Could not get hardware information for host rhv1<br>
>>>> VDSM rhv1 command failed: Failed to read hardware information<br>
>>>> Host rhv1 installed<br>
>>>> Network changes were saved on host rhv1<br>
>>>> Installing Host rhv1. Stage: Termination.<br>
>>>> Installing Host rhv1. Retrieving installation logs to:<br>
>>>> '/var/log/ovirt-engine/host-<wbr>deploy/ovirt-host-deploy-<wbr>20161010115606-192.168.21.71-<wbr>24d39274.log'.<br>
>>>> Installing Host rhv1. Stage: Pre-termination.<br>
>>>> Installing Host rhv1. Starting ovirt-vmconsole-host-sshd.<br>
>>>> Installing Host rhv1. Starting vdsm.<br>
>>>> Installing Host rhv1. Stopping libvirtd.<br>
>>>> Installing Host rhv1. Stage: Closing up.<br>
>>>> Installing Host rhv1. Setting kernel arguments.<br>
>>>> Installing Host rhv1. Stage: Transaction commit.<br>
>>>> Installing Host rhv1. Enrolling serial console certificate.<br>
>>>> Installing Host rhv1. Enrolling certificate.<br>
>>>> Installing Host rhv1. Stage: Misc configuration.<br>
>>>><br>
>>>><br>
>>>><br>
>>>> This was in the /var/log/vdsm/vdsm.log on the host trying to be added:<br>
>>>><br>
>>>> jsonrpc.Executor/2::ERROR::<wbr>2016-10-10<br>
>>>> 11:57:10,276::API::1340::vds::<wbr>(getHardwareInfo) failed to retrieve hardware<br>
>>>> info<br>
>>>> Traceback (most recent call last):<br>
>>>> File "/usr/share/vdsm/API.py", line 1337, in getHardwareInfo<br>
>>>> hw = supervdsm.getProxy().<wbr>getHardwareInfo()<br>
>>>> File "/usr/lib/python2.7/site-<wbr>packages/vdsm/supervdsm.py", line 53, in<br>
>>>> __call__<br>
>>>> return callMethod()<br>
>>>> File "/usr/lib/python2.7/site-<wbr>packages/vdsm/supervdsm.py", line 51, in<br>
>>>> <lambda><br>
>>>> **kwargs)<br>
>>>> File "<string>", line 2, in getHardwareInfo<br>
>>>> File "/usr/lib64/python2.7/<wbr>multiprocessing/managers.py", line 759, in<br>
>>>> _callmethod<br>
>>>> kind, result = conn.recv()<br>
>>>> EOFError<br>
>><br>
>><br>
>> If a request to supervdsm fails with EOFError, something bad happened<br>
>> supervdsm and we would see the exception in the supervdsm log.<br>
>><br>
>> Can you share supervdsm.log?<br>
>><br>
>> Nir<br>
><br>
><br>
><br>
><br>
> --<br>
><br>
> David Pinkerton<br>
> Consultant<br>
> Red Hat Asia Pacific Pty. Ltd.<br>
> Level 11, Canberra House<br>
> 40 Marcus Clarke Street<br>
> Canberra 2600 ACT<br>
><br>
> Mobile: +61-488-904-232<br>
> Email: <a href="mailto:david.pinkerton@redhat.com">david.pinkerton@redhat.com</a><br>
> Web: <a href="http://apac.redhat.com/" rel="noreferrer" target="_blank">http://apac.redhat.com/</a><br>
><br>
><br>
> ______________________________<wbr>_________________<br>
> Users mailing list<br>
> <a href="mailto:Users@ovirt.org">Users@ovirt.org</a><br>
> <a href="http://lists.ovirt.org/mailman/listinfo/users" rel="noreferrer" target="_blank">http://lists.ovirt.org/<wbr>mailman/listinfo/users</a><br>
><br>
</blockquote></div><br><br clear="all"><div><br></div>-- <br><div class="gmail_signature" data-smartmail="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><pre cols="72">David Pinkerton
Consultant
Red Hat Asia Pacific Pty. Ltd.
Level 11, Canberra House
40 Marcus Clarke Street
Canberra 2600 ACT
Mobile: +61-488-904-232
Email: <a href="mailto:david.pinkerton@redhat.com" target="_blank">david.pinkerton@redhat.com</a>
Web: <a href="http://apac.redhat.com/" target="_blank">http://apac.redhat.com/</a>
<a href="http://apac.redhat.com/" target="_blank"></a></pre>
</div></div></div></div></div></div></div></div></div></div>
</div>