[Users] BUG: soft lockup

------=_Part_1216_16598823.1363371726540 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit I have a host that have this error BUG: soft lockup - CPU#4 stuck for 22s! [sh: 1534], and the host died this host was installed from fedora18 minimum installation, kernel version 3.8.2-206.fc18.x86_64. I have another host with the same configuration, and have no errors, until now. Any idea? ------=_Part_1216_16598823.1363371726540 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 7bit <html><head><style type='text/css'>p { margin: 0; }</style></head><body><div style='font-family: verdana,helvetica,sans-serif; font-size: 10pt; color: #330066'>I have a host that have this error BUG: soft lockup - CPU#4 stuck for 22s! [sh: 1534], and the host died<br><br>this host was installed from fedora18 minimum installation, kernel version 3.8.2-206.fc18.x86_64. I have another host with the same configuration, and have no errors, until now.<br><br>Any idea?<br></div></body></html> ------=_Part_1216_16598823.1363371726540--

On Fri, Mar 15, 2013 at 7:21 PM, wrote:
I have a host that have this error BUG: soft lockup - CPU#4 stuck for 22s! [sh: 1534], and the host died
this host was installed from fedora18 minimum installation, kernel version 3.8.2-206.fc18.x86_64. I have another host with the same configuration, and have no errors, until now.
Any idea?
I can only say that I hadd the same problem some days ago when I tried to add a new node to a pre-existing oVirt 3.2 install. I erroneously supposed it was a problem with my hw but when I tried to add another same hw node it got the same error. It was your same kernel giving problems. I solved the problem installing the same kernel as the first node, that was the latest 3.7.x in F18 release. Possibly 3.8.2 having problems with some cpus. My nodes were HP blades BL685c G1 with AMD Opteron G2 processors. Not had time to open bug or query on F18 user list yet... Yours? I also have a running oVirt 3.2 all-in-one setup with Fedora 18 and kernel 3.8.1-201.fc18.x86_64. In this case the cpu is (from cpuinfo) AMD Athlon(tm) II X4 630 Processor and configured in oVirt as "AMD Opteron G3" I'm going to upgrade this install with these steps: 1) only kernel and see what happens 2) upgrade to 3.2.1 Gianluca

Well, this happens a second time. I did an upgrade to the system, engine and fedora18 hosts, oVirt Engine Version: 3.2.1-1.fc18. Now I have a 2 Fujitsu servers. Both with kernel 3.8.2 - 206.fc18.x86_64, CPU one host : Intel(R) Xeon(R) CPU X3430 @ 2.40GHz (SMT Disabled), CPU other host : Intel(R) Xeon(R) CPU X3450 (SMT Enabled), installed packages on both hosts: vdsm-python-4.10.3-9.fc18.x86_64 vdsm-gluster-4.10.3-9.fc18.noarch vdsm-cli-4.10.3-9.fc18.noarch vdsm-xmlrpc-4.10.3-9.fc18.noarch vdsm-4.10.3-9.fc18.x86_64 When I add the second host start having this errors, BUG: soft lockup, and the host died. Should a kernel downgrade solve the problem? ----- Mensagem original ----- De: "Gianluca Cecchi" <gianluca.cecchi@gmail.com> Para: suporte@logicworks.pt Cc: "users" <Users@ovirt.org> Enviadas: Sexta-feira, 15 Março, 2013 18:42:12 Assunto: Re: [Users] BUG: soft lockup On Fri, Mar 15, 2013 at 7:21 PM, wrote:
I have a host that have this error BUG: soft lockup - CPU#4 stuck for 22s! [sh: 1534], and the host died
this host was installed from fedora18 minimum installation, kernel version 3.8.2-206.fc18.x86_64. I have another host with the same configuration, and have no errors, until now.
Any idea?
I can only say that I hadd the same problem some days ago when I tried to add a new node to a pre-existing oVirt 3.2 install. I erroneously supposed it was a problem with my hw but when I tried to add another same hw node it got the same error. It was your same kernel giving problems. I solved the problem installing the same kernel as the first node, that was the latest 3.7.x in F18 release. Possibly 3.8.2 having problems with some cpus. My nodes were HP blades BL685c G1 with AMD Opteron G2 processors. Not had time to open bug or query on F18 user list yet... Yours? I also have a running oVirt 3.2 all-in-one setup with Fedora 18 and kernel 3.8.1-201.fc18.x86_64. In this case the cpu is (from cpuinfo) AMD Athlon(tm) II X4 630 Processor and configured in oVirt as "AMD Opteron G3" I'm going to upgrade this install with these steps: 1) only kernel and see what happens 2) upgrade to 3.2.1 Gianluca

------P7OM65WCOP5ZC4Q6CY0TNH5FTHRYTW Content-Type: text/plain; charset=UTF-8 Content-Transfer-Encoding: 8bit suporte@logicworks.pt wrote:
I have a host that have this error BUG: soft lockup - CPU#4 stuck for 22s! [sh: 1534], and the host died
this host was installed from fedora18 minimum installation, kernel version 3.8.2-206.fc18.x86_64. I have another host with the same configuration, and have no errors, until now.
Any idea?
------------------------------------------------------------------------
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Just to let you know you're not alone. I have a few hosts which have the same problem. F18 minimal install, started at 3.6.x kernel. Every upgrade since has had this. Some reboots at higher versions work, mostly not. Disabling multipath might help but then vdsm doesn't work. Hosts are HP ML110G5, HP DL360 G6, from memory Joop -- Sent from my Android phone with K-9 Mail. Please excuse my brevity. ------P7OM65WCOP5ZC4Q6CY0TNH5FTHRYTW Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit <html><head/><body><html><head><style type="text/css">p { margin: 0; }</style></head><body><div class="gmail_quote">suporte@logicworks.pt wrote:<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;"> <div style="font-family: verdana,helvetica,sans-serif; font-size: 10pt; color: #330066">I have a host that have this error BUG: soft lockup - CPU#4 stuck for 22s! [sh: 1534], and the host died<br /><br />this host was installed from fedora18 minimum installation, kernel version 3.8.2-206.fc18.x86_64. I have another host with the same configuration, and have no errors, until now.<br /><br />Any idea?<br /></div><p style="margin-top: 2.5em; margin-bottom: 1em; border-bottom: 1px solid #000"></p><pre style="white-space: pre-wrap; word-wrap:break-word; font-family: sans-serif; margin-top: 0px"><hr /><br />Users mailing list<br />Users@ovirt.org<br /><a href="http://lists.ovirt.org/mailman/listinfo/users">http://lists.ovirt.org/mailman/listinfo/users</a><br /></pre></blockquote></div><br clear="all">Just to let you know you're not alone. I have a few hosts which have the same problem. F18 minimal install, started at 3.6.x kernel. Every upgrade since has had this. Some reboots! at higher versions work, mostly not. Disabling multipath might help but then vdsm doesn't work.<br> Hosts are HP ML110G5, HP DL360 G6, from memory<br> <br> Joop<br> -- <br> Sent from my Android phone with K-9 Mail. Please excuse my brevity.</body></html></body></html> ------P7OM65WCOP5ZC4Q6CY0TNH5FTHRYTW--

So, there is not a solid solution for this? ----- Mensagem original ----- De: "Joop" <jvdwege@xs4all.nl> Para: suporte@logicworks.pt, Users@ovirt.org Enviadas: Sexta-feira, 15 Março, 2013 21:57:35 Assunto: Re: [Users] BUG: soft lockup suporte@logicworks.pt wrote: I have a host that have this error BUG: soft lockup - CPU#4 stuck for 22s! [sh: 1534], and the host died this host was installed from fedora18 minimum installation, kernel version 3.8.2-206.fc18.x86_64. I have another host with the same configuration, and have no errors, until now. Any idea? Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users Just to let you know you're not alone. I have a few hosts which have the same problem. F18 minimal install, started at 3.6.x kernel. Every upgrade since has had this. Some reboots! at higher versions work, mostly not. Disabling multipath might help but then vdsm doesn't work. Hosts are HP ML110G5, HP DL360 G6, from memory Joop -- Sent from my Android phone with K-9 Mail. Please excuse my brevity. suporte@logicworks.pt wrote:
I have a host that have this error BUG: soft lockup - CPU#4 stuck for 22s! [sh: 1534], and the host died
this host was installed from fedora18 minimum installation, kernel version 3.8.2-206.fc18.x86_64. I have another host with the same configuration, and have no errors, until now.
Any idea?
------------------------------------------------------------------------
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Just to let you know you're not alone. I have a few hosts which have the same problem. F18 minimal install, started at 3.6.x kernel. Every upgrade since has had this. Some reboots at higher versions work, mostly not. Disabling multipath might help but then vdsm doesn't work. Hosts are HP ML110G5, HP DL360 G6, from memory Joop -- Sent from my Android phone with K-9 Mail. Please excuse my brevity.

On Fri, Mar 15, 2013 at 11:47 PM, wrote:
So, there is not a solid solution for this?
You can access this page: http://koji.fedoraproject.org/koji/packageinfo?packageID=8 and download the latest 3.7 kernel for f18 I'm using kernel-3.7.9-201.fc18 and run # yum localinstall kernel-3.7.9-201.fc18.x86_64.rpm or download the kernels ahead the problematic one that I presume are for testing and possibly addressing our problems if other people posted problems too.. kernel-3.8.2-209.fc18 kernel-3.8.3-201.fc18 HIH, Gianluca

Ok, thanks, I will try it and let you know. ----- Mensagem original ----- De: "Gianluca Cecchi" <gianluca.cecchi@gmail.com> Para: suporte@logicworks.pt Cc: "users" <Users@ovirt.org> Enviadas: Sexta-feira, 15 Março, 2013 23:12:10 Assunto: Re: [Users] BUG: soft lockup On Fri, Mar 15, 2013 at 11:47 PM, wrote:
So, there is not a solid solution for this?
You can access this page: http://koji.fedoraproject.org/koji/packageinfo?packageID=8 and download the latest 3.7 kernel for f18 I'm using kernel-3.7.9-201.fc18 and run # yum localinstall kernel-3.7.9-201.fc18.x86_64.rpm or download the kernels ahead the problematic one that I presume are for testing and possibly addressing our problems if other people posted problems too.. kernel-3.8.2-209.fc18 kernel-3.8.3-201.fc18 HIH, Gianluca

I have installed the kernel version 3.7.9-201.fc18.x86_64, but the problem still persist: kernel:[ 634.738490] BUG: soft lockup - CPU#5 stuck for 23s! [sh:1557] Now I'm lost ----- Mensagem original ----- De: suporte@logicworks.pt Para: "Gianluca Cecchi" <gianluca.cecchi@gmail.com> Cc: "users" <Users@ovirt.org> Enviadas: Sexta-feira, 15 Março, 2013 23:20:47 Assunto: Re: [Users] BUG: soft lockup Ok, thanks, I will try it and let you know. ----- Mensagem original ----- De: "Gianluca Cecchi" <gianluca.cecchi@gmail.com> Para: suporte@logicworks.pt Cc: "users" <Users@ovirt.org> Enviadas: Sexta-feira, 15 Março, 2013 23:12:10 Assunto: Re: [Users] BUG: soft lockup On Fri, Mar 15, 2013 at 11:47 PM, wrote:
So, there is not a solid solution for this?
You can access this page: http://koji.fedoraproject.org/koji/packageinfo?packageID=8 and download the latest 3.7 kernel for f18 I'm using kernel-3.7.9-201.fc18 and run # yum localinstall kernel-3.7.9-201.fc18.x86_64.rpm or download the kernels ahead the problematic one that I presume are for testing and possibly addressing our problems if other people posted problems too.. kernel-3.8.2-209.fc18 kernel-3.8.3-201.fc18 HIH, Gianluca _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
participants (3)
-
Gianluca Cecchi
-
Joop
-
suporte@logicworks.pt