[Users] Failed to start JBoss AS

Hi, I had a working engine, using 3.3.0 on Fedora 19. I rebooted my engine, and JBoss AS won't come back up. I get the attached errors from server.log. Has anyone seen this before? Any suggestions on how to recover? How I got here: I had a working Engine (mach1 F19) and Host (mach2 F19), but needed to play some musical chairs with my hardware to utilize a 3rd machine (mach3 RH6). My goal is to have mach2 become Engine, and mach3 Host. I did the following: 1. Created an NFS share on mach3, and used it as NFS Export Storage from mach2. 2. Exported my VMs and templates to the Export storage. 3. Removed mach2 from my Datacenter 4. Shut down mach1 5. Installed fresh F19 on mach2 (my old host), and installed Engine 3.3.0 on it 6. Added mach3 as Host 7. Imported my VMs and Templates from the Export Storage At this point all looked well. I rebooted mach2, but it won't come back up. I'd appreciate any insight. Thanks, Bob

Well, this looks to me like an instance of https://bugzilla.redhat.com/show_bug.cgi?format=multiple&id=996005 That bug has not been updated in almost 3 months, however. The last note says "Working on CI". Does that mean Code Integration? It looks like maybe it was fixed in version "is10", whatever that is, and is currently Verified by QA. Is that right? Unfortunately that bug report has no description whatsoever of what the underlying problem was. I would like to know: 1. Is there any workaround? 2. Can I start again by re-installing Engine and importing, or is the issue with the exported VMs and/or template so I'd just hit the problem again? It sort of looks like the problem was with a template (org.ovirt.engine.core.common.action.ImportVmTemplateParameters). If so, can I import the VMs and avoid the Templates? Or am I perhaps misinterpreting that message? Shouldn't bug reports always contain some sort of diagnostic information about the underlying cause and/or workarounds? Thanks, Bob On 11/12/2013 03:08 PM, Bob Doolittle wrote:
Hi,
I had a working engine, using 3.3.0 on Fedora 19. I rebooted my engine, and JBoss AS won't come back up. I get the attached errors from server.log.
Has anyone seen this before? Any suggestions on how to recover?
How I got here:
I had a working Engine (mach1 F19) and Host (mach2 F19), but needed to play some musical chairs with my hardware to utilize a 3rd machine (mach3 RH6). My goal is to have mach2 become Engine, and mach3 Host.
I did the following: 1. Created an NFS share on mach3, and used it as NFS Export Storage from mach2. 2. Exported my VMs and templates to the Export storage. 3. Removed mach2 from my Datacenter 4. Shut down mach1 5. Installed fresh F19 on mach2 (my old host), and installed Engine 3.3.0 on it 6. Added mach3 as Host 7. Imported my VMs and Templates from the Export Storage
At this point all looked well. I rebooted mach2, but it won't come back up.
I'd appreciate any insight.
Thanks, Bob

This is a multi-part message in MIME format. --------------040502070209060107050605 Content-Type: text/plain; charset=ISO-8859-1; format=flowed Content-Transfer-Encoding: 7bit On 11/13/2013 02:22 AM, Bob Doolittle wrote:
Well, this looks to me like an instance of https://bugzilla.redhat.com/show_bug.cgi?format=multiple&id=996005
That bug has not been updated in almost 3 months, however. The last note says "Working on CI". Does that mean Code Integration? It looks like maybe it was fixed in version "is10", whatever that is, and is currently Verified by QA. Is that right?
Unfortunately that bug report has no description whatsoever of what the underlying problem was. I would like to know: 1. Is there any workaround? 2. Can I start again by re-installing Engine and importing, or is the issue with the exported VMs and/or template so I'd just hit the problem again?
The patch that fixes the issue is http://gerrit.ovirt.org/#/c/18001/ and is available in ovirt 3.3.1
It sort of looks like the problem was with a template (org.ovirt.engine.core.common.action.ImportVmTemplateParameters). If so, can I import the VMs and avoid the Templates? Or am I perhaps misinterpreting that message?
Shouldn't bug reports always contain some sort of diagnostic information about the underlying cause and/or workarounds?
Thanks, Bob
On 11/12/2013 03:08 PM, Bob Doolittle wrote:
Hi,
I had a working engine, using 3.3.0 on Fedora 19. I rebooted my engine, and JBoss AS won't come back up. I get the attached errors from server.log.
Has anyone seen this before? Any suggestions on how to recover?
How I got here:
I had a working Engine (mach1 F19) and Host (mach2 F19), but needed to play some musical chairs with my hardware to utilize a 3rd machine (mach3 RH6). My goal is to have mach2 become Engine, and mach3 Host.
I did the following: 1. Created an NFS share on mach3, and used it as NFS Export Storage from mach2. 2. Exported my VMs and templates to the Export storage. 3. Removed mach2 from my Datacenter 4. Shut down mach1 5. Installed fresh F19 on mach2 (my old host), and installed Engine 3.3.0 on it 6. Added mach3 as Host 7. Imported my VMs and Templates from the Export Storage
At this point all looked well. I rebooted mach2, but it won't come back up.
I'd appreciate any insight.
Thanks, Bob
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
--------------040502070209060107050605 Content-Type: text/html; charset=ISO-8859-1 Content-Transfer-Encoding: 7bit <html> <head> <meta content="text/html; charset=ISO-8859-1" http-equiv="Content-Type"> </head> <body bgcolor="#FFFFFF" text="#000000"> <br> <div class="moz-cite-prefix">On 11/13/2013 02:22 AM, Bob Doolittle wrote:<br> </div> <blockquote cite="mid:528294FC.8090908@doolittle.us.com" type="cite">Well, this looks to me like an instance of <a class="moz-txt-link-freetext" href="https://bugzilla.redhat.com/show_bug.cgi?format=multiple&id=996005">https://bugzilla.redhat.com/show_bug.cgi?format=multiple&id=996005</a> <br> <br> That bug has not been updated in almost 3 months, however. The last note says "Working on CI". Does that mean Code Integration? It looks like maybe it was fixed in version "is10", whatever that is, and is currently Verified by QA. Is that right? <br> <br> Unfortunately that bug report has no description whatsoever of what the underlying problem was. I would like to know: <br> 1. Is there any workaround? <br> 2. Can I start again by re-installing Engine and importing, or is the issue with the exported VMs and/or template so I'd just hit the problem again? <br> </blockquote> <br> The patch that fixes the issue is <meta http-equiv="content-type" content="text/html; charset=ISO-8859-1"> <meta http-equiv="content-type" content="text/html; charset=ISO-8859-1"> <a href="http://gerrit.ovirt.org/#/c/18001/">http://gerrit.ovirt.org/#/c/18001/</a> and is available in ovirt 3.3.1<br> <br> <blockquote cite="mid:528294FC.8090908@doolittle.us.com" type="cite"> <br> It sort of looks like the problem was with a template (org.ovirt.engine.core.common.action.ImportVmTemplateParameters). If so, can I import the VMs and avoid the Templates? Or am I perhaps misinterpreting that message? <br> <br> Shouldn't bug reports always contain some sort of diagnostic information about the underlying cause and/or workarounds? <br> <br> Thanks, <br> Bob <br> <br> On 11/12/2013 03:08 PM, Bob Doolittle wrote: <br> <blockquote type="cite">Hi, <br> <br> I had a working engine, using 3.3.0 on Fedora 19. I rebooted my engine, and JBoss AS won't come back up. I get the attached errors from server.log. <br> <br> Has anyone seen this before? Any suggestions on how to recover? <br> <br> How I got here: <br> <br> I had a working Engine (mach1 F19) and Host (mach2 F19), but needed to play some musical chairs with my hardware to utilize a 3rd machine (mach3 RH6). My goal is to have mach2 become Engine, and mach3 Host. <br> <br> I did the following: <br> 1. Created an NFS share on mach3, and used it as NFS Export Storage from mach2. <br> 2. Exported my VMs and templates to the Export storage. <br> 3. Removed mach2 from my Datacenter <br> 4. Shut down mach1 <br> 5. Installed fresh F19 on mach2 (my old host), and installed Engine 3.3.0 on it <br> 6. Added mach3 as Host <br> 7. Imported my VMs and Templates from the Export Storage <br> <br> At this point all looked well. I rebooted mach2, but it won't come back up. <br> <br> I'd appreciate any insight. <br> <br> Thanks, <br> Bob <br> <br> </blockquote> <br> _______________________________________________ <br> Users mailing list <br> <a class="moz-txt-link-abbreviated" href="mailto:Users@ovirt.org">Users@ovirt.org</a> <br> <a class="moz-txt-link-freetext" href="http://lists.ovirt.org/mailman/listinfo/users">http://lists.ovirt.org/mailman/listinfo/users</a> <br> </blockquote> <br> </body> </html> --------------040502070209060107050605--

On 12/11/13 10:52 PM, Bob Doolittle wrote:
Well, this looks to me like an instance of https://bugzilla.redhat.com/show_bug.cgi?format=multiple&id=996005
That bug has not been updated in almost 3 months, however. The last note says "Working on CI". Does that mean Code Integration? It looks like maybe it was fixed in version "is10", whatever that is, and is currently Verified by QA. Is that right?
Unfortunately that bug report has no description whatsoever of what the underlying problem was. I would like to know: 1. Is there any workaround? 2. Can I start again by re-installing Engine and importing, or is the issue with the exported VMs and/or template so I'd just hit the problem again?
It sort of looks like the problem was with a template (org.ovirt.engine.core.common.action.ImportVmTemplateParameters). If so, can I import the VMs and avoid the Templates? Or am I perhaps misinterpreting that message?
Shouldn't bug reports always contain some sort of diagnostic information about the underlying cause and/or workarounds?
Thanks, Bob
On 11/12/2013 03:08 PM, Bob Doolittle wrote:
Hi,
I had a working engine, using 3.3.0 on Fedora 19. I rebooted my engine, and JBoss AS won't come back up. I get the attached errors from server.log.
Has anyone seen this before? Any suggestions on how to recover?
How I got here:
I had a working Engine (mach1 F19) and Host (mach2 F19), but needed to play some musical chairs with my hardware to utilize a 3rd machine (mach3 RH6). My goal is to have mach2 become Engine, and mach3 Host.
I did the following: 1. Created an NFS share on mach3, and used it as NFS Export Storage from mach2. 2. Exported my VMs and templates to the Export storage. 3. Removed mach2 from my Datacenter 4. Shut down mach1 5. Installed fresh F19 on mach2 (my old host), and installed Engine 3.3.0 on it 6. Added mach3 as Host 7. Imported my VMs and Templates from the Export Storage
At this point all looked well. I rebooted mach2, but it won't come back up.
I'd appreciate any insight.
Thanks, Bob
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users Hi Bob and thanks for the feedback
indeed there is no diagnose in the bug and that will be fixed. Now let see if we can get this working - in the async task table there is a JSON structure with an attribute describing disks - diskMap problem is that its missing a concrete java type java.util.HashMap please paste the output of: psql ovirt postgres -x -c 'select task_parameters from async_tasks;' | grep -i diskmap Roy its missing a we can try to modify it

Hi, Thanks, but I couldn't wait and didn't anticipate that there would be anything I could do to repair the database manually. I have nuked my old engine configuration and created a new one. It would be great if we could update the bug report with an actual diagnosis and description of what operations can lead to corruption of the engine database. I mistakenly assumed (from the synopsis ImportVmTemplateParameters fails") that it only stemmed from Template imports, but it seems to affect VM imports as well (I discovered after corrupting two subsequent engine databases). Is there going to be a backport of this bug to ovirt-stable, or will we need to wait for 3.1.1? What is the timeframe for 3.1.1 final release? Thanks, Bob On 11/14/2013 04:31 AM, Roy Golan wrote:
On 12/11/13 10:52 PM, Bob Doolittle wrote:
Well, this looks to me like an instance of https://bugzilla.redhat.com/show_bug.cgi?format=multiple&id=996005
That bug has not been updated in almost 3 months, however. The last note says "Working on CI". Does that mean Code Integration? It looks like maybe it was fixed in version "is10", whatever that is, and is currently Verified by QA. Is that right?
Unfortunately that bug report has no description whatsoever of what the underlying problem was. I would like to know: 1. Is there any workaround? 2. Can I start again by re-installing Engine and importing, or is the issue with the exported VMs and/or template so I'd just hit the problem again?
It sort of looks like the problem was with a template (org.ovirt.engine.core.common.action.ImportVmTemplateParameters). If so, can I import the VMs and avoid the Templates? Or am I perhaps misinterpreting that message?
Shouldn't bug reports always contain some sort of diagnostic information about the underlying cause and/or workarounds?
Thanks, Bob
On 11/12/2013 03:08 PM, Bob Doolittle wrote:
Hi,
I had a working engine, using 3.3.0 on Fedora 19. I rebooted my engine, and JBoss AS won't come back up. I get the attached errors from server.log.
Has anyone seen this before? Any suggestions on how to recover?
How I got here:
I had a working Engine (mach1 F19) and Host (mach2 F19), but needed to play some musical chairs with my hardware to utilize a 3rd machine (mach3 RH6). My goal is to have mach2 become Engine, and mach3 Host.
I did the following: 1. Created an NFS share on mach3, and used it as NFS Export Storage from mach2. 2. Exported my VMs and templates to the Export storage. 3. Removed mach2 from my Datacenter 4. Shut down mach1 5. Installed fresh F19 on mach2 (my old host), and installed Engine 3.3.0 on it 6. Added mach3 as Host 7. Imported my VMs and Templates from the Export Storage
At this point all looked well. I rebooted mach2, but it won't come back up.
I'd appreciate any insight.
Thanks, Bob
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users Hi Bob and thanks for the feedback
indeed there is no diagnose in the bug and that will be fixed.
Now let see if we can get this working - in the async task table there is a JSON structure with an attribute describing disks - diskMap problem is that its missing a concrete java type java.util.HashMap
please paste the output of: psql ovirt postgres -x -c 'select task_parameters from async_tasks;' | grep -i diskmap
Roy
its missing a we can try to modify it
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

On 11/14/2013 10:30 AM, Bob Doolittle wrote:
Hi,
Thanks, but I couldn't wait and didn't anticipate that there would be anything I could do to repair the database manually. I have nuked my old engine configuration and created a new one.
It would be great if we could update the bug report with an actual diagnosis and description of what operations can lead to corruption of the engine database. I mistakenly assumed (from the synopsis ImportVmTemplateParameters fails") that it only stemmed from Template imports, but it seems to affect VM imports as well (I discovered after corrupting two subsequent engine databases).
Is there going to be a backport of this bug to ovirt-stable, or will we need to wait for 3.1.1? What is the timeframe for 3.1.1 final release?
s/3.1.1/3.3.1/ should be in update-testing for final testing. sandro - are you publishing its there for a few days, then if no blockers released next week (finally)?
participants (4)
-
Bob Doolittle
-
Itamar Heim
-
Roy Golan
-
Sahina Bose