[Users] Issues starting hosted engine VM

Hi, With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper. I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine). The issue starts here that the host finds itself not able to start the VM up again. VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/ It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}" I'm not sure where else to look. Suggestions? Cheers, Andrew

The interesting thing - trying it with the paused option vdsm seems to create the VM hosted-engine --vm-start-paused vdsm.log http://www.fpaste.org/69604/13900482/ But I'm not sure how to then proceed to "resume" it. On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew

I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059 On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com>wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew

------=_Part_4226523_1838033400.1390119688633 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: 7bit Thanks a lot for your efforts and the report! -- Didi ----- Original Message -----
From: "Andrew Lau" <andrew@andrewklau.com> To: "users" <users@ovirt.org> Sent: Saturday, January 18, 2014 3:20:22 PM Subject: Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau < andrew@andrewklau.com > wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau < andrew@andrewklau.com > wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/
ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs " 'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <span dir=3D"ltr"><<a href= =3D"mailto:andrew@andrewklau.com" target=3D"_blank" data-mce-href=3D"mailto= :andrew@andrewklau.com">andrew@andrewklau.com</a>></span> wrote:<br><blo= ckquote class=3D"gmail_quote" style=3D"margin:0px 0px 0px 0.8ex;border-left= -width:1px;border-left-color:rgb(204,204,204);border-left-style:solid;paddi= ng-left:1ex" data-mce-style=3D"margin: 0px 0px 0px 0.8ex; border-left-width= : 1px; border-left-color: #cccccc; border-left-style: solid; padding-left: = 1ex;"><div dir=3D"ltr"><div style=3D"font-family:tahoma,sans-serif" data-mc= e-style=3D"font-family: tahoma,sans-serif;">Hi,</div><div style=3D"font-fam= ily:tahoma,sans-serif" data-mce-style=3D"font-family: tahoma,sans-serif;"><= br></div><div style=3D"font-family:tahoma,sans-serif" data-mce-style=3D"fon= t-family: tahoma,sans-serif;">With the great help from sbonazzo, I managed = to step past the initial bug with the hosted-engine-setup but appear to hav= e run into another show stopper.</div><div style=3D"font-family:tahoma,sans= -serif" data-mce-style=3D"font-family: tahoma,sans-serif;"><br></div><div s= tyle=3D"font-family:tahoma,sans-serif" data-mce-style=3D"font-family: tahom= a,sans-serif;">I ran through the install process successfully up to the sta= ge where it completed and the engine VM was to be shutdown. (The engine has= already been installed on the VM and the host has been connected to the en= gine). </div><div style=3D"font-family:tahoma,sans-serif" data-mce-sty= le=3D"font-family: tahoma,sans-serif;"><br></div><div style=3D"font-family:= tahoma,sans-serif" data-mce-style=3D"font-family: tahoma,sans-serif;">The i= ssue starts here that the host finds itself not able to start the VM up aga= in.</div><div style=3D"font-family:tahoma,sans-serif" data-mce-style=3D"fon= t-family: tahoma,sans-serif;"><br></div><div style=3D"font-family:tahoma,sa= ns-serif" data-mce-style=3D"font-family: tahoma,sans-serif;">VDSM Logs:&nbs=
-- Didi ------=_Part_4226523_1838033400.1390119688633 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable <html><body><div style=3D"font-family: times new roman, new york, times, se= rif; font-size: 12pt; color: #000000"><div>Thanks a lot for your efforts an= d the report!</div><div>-- </div><div>Didi</div><div><br></div><hr id= =3D"zwchr"><blockquote style=3D"border-left:2px solid #1010FF;margin-left:5= px;padding-left:5px;color:#000;font-weight:normal;font-style:normal;text-de= coration:none;font-family:Helvetica,Arial,sans-serif;font-size:12pt;" data-= mce-style=3D"border-left: 2px solid #1010FF; margin-left: 5px; padding-left= : 5px; color: #000; font-weight: normal; font-style: normal; text-decoratio= n: none; font-family: Helvetica,Arial,sans-serif; font-size: 12pt;"><b>From= : </b>"Andrew Lau" <andrew@andrewklau.com><br><b>To: </b>"users" <= users@ovirt.org><br><b>Sent: </b>Saturday, January 18, 2014 3:20:22 PM<b= r><b>Subject: </b>Re: [Users] Issues starting hosted engine VM<br><div><br>= </div><div dir=3D"ltr"><div class=3D"gmail_default" style=3D"font-family:ta= homa,sans-serif" data-mce-style=3D"font-family: tahoma,sans-serif;">I belie= ve I found the issue and have reported it here <a href=3D"https://bugz= illa.redhat.com/show_bug.cgi?id=3D1055059" style=3D"font-family:arial" targ= et=3D"_blank" data-mce-href=3D"https://bugzilla.redhat.com/show_bug.cgi?id= =3D1055059" data-mce-style=3D"font-family: arial;">https://bugzilla.redhat.= com/show_bug.cgi?id=3D1055059</a></div><div class=3D"gmail_extra"><br><div = class=3D"gmail_quote">On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <span di= r=3D"ltr"><<a href=3D"mailto:andrew@andrewklau.com" target=3D"_blank" da= ta-mce-href=3D"mailto:andrew@andrewklau.com">andrew@andrewklau.com</a>><= /span> wrote:<br><blockquote class=3D"gmail_quote" style=3D"margin:0px 0px = 0px 0.8ex;border-left-width:1px;border-left-color:rgb(204,204,204);border-l= eft-style:solid;padding-left:1ex" data-mce-style=3D"margin: 0px 0px 0px 0.8= ex; border-left-width: 1px; border-left-color: #cccccc; border-left-style: = solid; padding-left: 1ex;"><div dir=3D"ltr"><div style=3D"font-family:tahom= a,sans-serif" data-mce-style=3D"font-family: tahoma,sans-serif;">The intere= sting thing - trying it with the paused option vdsm seems to create the VM<= /div><div style=3D"font-family:tahoma,sans-serif" data-mce-style=3D"font-fa= mily: tahoma,sans-serif;"><br></div><div style=3D"font-family:tahoma,sans-s= erif" data-mce-style=3D"font-family: tahoma,sans-serif;">hosted-engine --vm= -start-paused</div><div style=3D"font-family:tahoma,sans-serif" data-mce-st= yle=3D"font-family: tahoma,sans-serif;"><br></div><div style=3D"font-family= :tahoma,sans-serif" data-mce-style=3D"font-family: tahoma,sans-serif;">vdsm= .log <a href=3D"http://www.fpaste.org/69604/13900482/" target=3D"_blan= k" data-mce-href=3D"http://www.fpaste.org/69604/13900482/">http://www.fpast= e.org/69604/13900482/</a><br></div><div style=3D"font-family:tahoma,sans-se= rif" data-mce-style=3D"font-family: tahoma,sans-serif;"><br></div><div styl= e=3D"font-family:tahoma,sans-serif" data-mce-style=3D"font-family: tahoma,s= ans-serif;">But I'm not sure how to then proceed to "resume" it.</div><div>= <div class=3D"h5"><div class=3D"gmail_extra"><br><div class=3D"gmail_quote"= p;<a href=3D"http://www.fpaste.org/69592/00427141/" style=3D"font-family:ar= ial" target=3D"_blank" data-mce-href=3D"http://www.fpaste.org/69592/0042714= 1/" data-mce-style=3D"font-family: arial;">http://www.fpaste.org/69592/0042= 7141/</a></div><div style=3D"font-family:tahoma,sans-serif" data-mce-style= =3D"font-family: tahoma,sans-serif;">ovirt-hosted-engine-ha agent.log = <a href=3D"http://www.fpaste.org/69595/43609139/" style=3D"font-family:aria= l" target=3D"_blank" data-mce-href=3D"http://www.fpaste.org/69595/43609139/= " data-mce-style=3D"font-family: arial;">http://www.fpaste.org/69595/436091= 39/</a></div><div style=3D"font-family:tahoma,sans-serif" data-mce-style=3D= "font-family: tahoma,sans-serif;"><br></div><div style=3D"font-family:tahom= a,sans-serif" data-mce-style=3D"font-family: tahoma,sans-serif;">It seems t= o keep failing to start the VM.. when I restart the agent I can see the sco= re drop to 0 after 3 boot attempts. The interesting thing seems to be= in the VDSM Logs "<span style=3D"line-height:14.390625px;font-size:12px;fo= nt-family:monospace" data-mce-style=3D"line-height: 14.390625px; font-size:= 12px; font-family: monospace;">'Virtual machine does not exist', 'code': 1= }}"</span></div><div style=3D"font-family:tahoma,sans-serif" data-mce-style= =3D"font-family: tahoma,sans-serif;"><br></div><div style=3D"font-family:ta= homa,sans-serif" data-mce-style=3D"font-family: tahoma,sans-serif;">I'm not= sure where else to look. Suggestions?</div><div style=3D"font-family:tahom= a,sans-serif" data-mce-style=3D"font-family: tahoma,sans-serif;"><br></div>= <div><div style=3D"font-family:tahoma,sans-serif;display:inline" data-mce-s= tyle=3D"font-family: tahoma,sans-serif; display: inline;">Cheers,</div><spa= n face=3D"tahoma, sans-serif" data-mce-style=3D"font-family: tahoma, sans-s= erif;" style=3D"font-family: tahoma, sans-serif;"><br>Andrew</span><br></di= v></div></blockquote></div><br></div></div></div></div></blockquote></div><= br></div></div><br>_______________________________________________<br>Users= mailing list<br>Users@ovirt.org<br>http://lists.ovirt.org/mailman/listinfo= /users<br></blockquote><div><br><br></div><div><br></div><div>-- <br></div>= <div><span name=3D"x"></span>Didi<span name=3D"x"></span><br></div></div></= body></html> ------=_Part_4226523_1838033400.1390119688633--

Hi, Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ? When this happened, I manually forced it to resume using virsh On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com> *To: *"users" <users@ovirt.org> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com>wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com>wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Didi

I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption. On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron

I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state. I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz? Cheers, Andrew. On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com<mailto: didi@redhat.com>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------------------------------------ ------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron

I have opened this BZ 1055461 anyway just in case On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew. On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com<mailto: didi@redhat.com>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------------------------------------ ------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron

the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there? On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org <mailto:users@ovirt.org>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>> http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron

It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least. On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com<mailto: andrew@andrewklau.com>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org
<mailto:users@ovirt.org>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron

1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume the engine vm manually. Thanks, Leonid. ----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com> To: dron@redhat.com Cc: "Leonid Natapov" <lnatapov@redhat.com>, "Yedidyah Bar David" <didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least. On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com<mailto: andrew@andrewklau.com>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org
<mailto:users@ovirt.org>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron

On 01/20/2014 10:38 AM, Leonid Natapov wrote:
1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume
Are we talking only on the hosted engine vm or all other vm's? if I have other vm's they will also stop, will they be auto started as well?
the engine vm manually.
Thanks, Leonid.
----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com> To: dron@redhat.com Cc: "Leonid Natapov" <lnatapov@redhat.com>, "Yedidyah Bar David" <didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM
It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old
I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least.
On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com<mailto: andrew@andrewklau.com>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org
<mailto:users@ovirt.org>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron

All vms. Check this PRD: https://bugzilla.redhat.com/show_bug.cgi?id=723055 ----- Original Message ----- From: "Dafna Ron" <dron@redhat.com> To: "Leonid Natapov" <lnatapov@redhat.com> Cc: "Andrew Lau" <andrew@andrewklau.com>, "Yedidyah Bar David" <didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:44:46 PM Subject: Re: [Users] Issues starting hosted engine VM On 01/20/2014 10:38 AM, Leonid Natapov wrote:
1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume
Are we talking only on the hosted engine vm or all other vm's? if I have other vm's they will also stop, will they be auto started as well?
the engine vm manually.
Thanks, Leonid.
----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com> To: dron@redhat.com Cc: "Leonid Natapov" <lnatapov@redhat.com>, "Yedidyah Bar David" <didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM
It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old
I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least.
On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com<mailto: andrew@andrewklau.com>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org
<mailto:users@ovirt.org>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron

interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required? On 01/20/2014 11:02 AM, Leonid Natapov wrote:
All vms. Check this PRD: https://bugzilla.redhat.com/show_bug.cgi?id=723055
----- Original Message ----- From: "Dafna Ron" <dron@redhat.com> To: "Leonid Natapov" <lnatapov@redhat.com> Cc: "Andrew Lau" <andrew@andrewklau.com>, "Yedidyah Bar David" <didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:44:46 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 10:38 AM, Leonid Natapov wrote:
1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume Are we talking only on the hosted engine vm or all other vm's? if I have other vm's they will also stop, will they be auto started as well? the engine vm manually.
Thanks, Leonid.
----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com> To: dron@redhat.com Cc: "Leonid Natapov" <lnatapov@redhat.com>, "Yedidyah Bar David" <didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM
It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old
I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least.
On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com<mailto: andrew@andrewklau.com>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org
<mailto:users@ovirt.org>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron

Hi, That bug seems to be private :( I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on. On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required?
On 01/20/2014 11:02 AM, Leonid Natapov wrote:
All vms. Check this PRD: https://bugzilla.redhat.com/ show_bug.cgi?id=723055
----- Original Message ----- From: "Dafna Ron" <dron@redhat.com> To: "Leonid Natapov" <lnatapov@redhat.com> Cc: "Andrew Lau" <andrew@andrewklau.com>, "Yedidyah Bar David" < didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:44:46 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 10:38 AM, Leonid Natapov wrote:
1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume
Are we talking only on the hosted engine vm or all other vm's? if I have other vm's they will also stop, will they be auto started as well?
the engine vm manually.
Thanks, Leonid.
----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com> To: dron@redhat.com Cc: "Leonid Natapov" <lnatapov@redhat.com>, "Yedidyah Bar David" < didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM
It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old
I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least.
On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com> wrote:
the question is what was the vm paused on... this can be found in the
qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com <mailto: andrew@andrewklau.com>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org
<mailto:users@ovirt.org>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/ 13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
--
Dafna Ron
-- Dafna Ron

Itamar, was this only applied for hosted engine or was this added or planed to be added to all engine setups? On 01/20/2014 11:19 AM, Andrew Lau wrote:
Hi,
That bug seems to be private :(
I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on.
On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required?
On 01/20/2014 11:02 AM, Leonid Natapov wrote:
All vms. Check this PRD: https://bugzilla.redhat.com/show_bug.cgi?id=723055
----- Original Message ----- From: "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> To: "Leonid Natapov" <lnatapov@redhat.com <mailto:lnatapov@redhat.com>> Cc: "Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>>, "Yedidyah Bar David" <didi@redhat.com <mailto:didi@redhat.com>>, "users" <users@ovirt.org <mailto:users@ovirt.org>> Sent: Monday, January 20, 2014 12:44:46 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 10:38 AM, Leonid Natapov wrote:
1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume
Are we talking only on the hosted engine vm or all other vm's? if I have other vm's they will also stop, will they be auto started as well?
the engine vm manually.
Thanks, Leonid.
----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> To: dron@redhat.com <mailto:dron@redhat.com> Cc: "Leonid Natapov" <lnatapov@redhat.com <mailto:lnatapov@redhat.com>>, "Yedidyah Bar David" <didi@redhat.com <mailto:didi@redhat.com>>, "users" <users@ovirt.org <mailto:users@ovirt.org>> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM
It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old
I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least.
On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com><mailto: andrew@andrewklau.com <mailto:andrew@andrewklau.com>>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com> <mailto:dron@redhat.com <mailto:dron@redhat.com>>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>> <mailto:didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org <mailto:users@ovirt.org>> <mailto:users@ovirt.org <mailto:users@ovirt.org>
<mailto:users@ovirt.org <mailto:users@ovirt.org>>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>> <mailto:Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron

On 01/20/2014 01:22 PM, Dafna Ron wrote:
Itamar, was this only applied for hosted engine or was this added or planed to be added to all engine setups?
resume paused VMs is not related to hosted engine
On 01/20/2014 11:19 AM, Andrew Lau wrote:
Hi,
That bug seems to be private :(
I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on.
On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required?
On 01/20/2014 11:02 AM, Leonid Natapov wrote:
All vms. Check this PRD: https://bugzilla.redhat.com/show_bug.cgi?id=723055
----- Original Message ----- From: "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> To: "Leonid Natapov" <lnatapov@redhat.com <mailto:lnatapov@redhat.com>> Cc: "Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>>, "Yedidyah Bar David" <didi@redhat.com <mailto:didi@redhat.com>>, "users" <users@ovirt.org <mailto:users@ovirt.org>> Sent: Monday, January 20, 2014 12:44:46 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 10:38 AM, Leonid Natapov wrote:
1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume
Are we talking only on the hosted engine vm or all other vm's? if I have other vm's they will also stop, will they be auto started as well?
the engine vm manually.
Thanks, Leonid.
----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> To: dron@redhat.com <mailto:dron@redhat.com> Cc: "Leonid Natapov" <lnatapov@redhat.com <mailto:lnatapov@redhat.com>>, "Yedidyah Bar David" <didi@redhat.com <mailto:didi@redhat.com>>, "users" <users@ovirt.org <mailto:users@ovirt.org>> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM
It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old
I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least.
On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com><mailto: andrew@andrewklau.com <mailto:andrew@andrewklau.com>>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com> <mailto:dron@redhat.com <mailto:dron@redhat.com>>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>> <mailto:didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org <mailto:users@ovirt.org>> <mailto:users@ovirt.org <mailto:users@ovirt.org>
<mailto:users@ovirt.org <mailto:users@ovirt.org>>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>> <mailto:Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron

so, is in 3.3, should th vm's be auto resumed after a failure? can this be configured somehow? On 01/20/2014 11:23 AM, Itamar Heim wrote:
On 01/20/2014 01:22 PM, Dafna Ron wrote:
Itamar, was this only applied for hosted engine or was this added or planed to be added to all engine setups?
resume paused VMs is not related to hosted engine
On 01/20/2014 11:19 AM, Andrew Lau wrote:
Hi,
That bug seems to be private :(
I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on.
On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required?
On 01/20/2014 11:02 AM, Leonid Natapov wrote:
All vms. Check this PRD: https://bugzilla.redhat.com/show_bug.cgi?id=723055
----- Original Message ----- From: "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> To: "Leonid Natapov" <lnatapov@redhat.com <mailto:lnatapov@redhat.com>> Cc: "Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>>, "Yedidyah Bar David" <didi@redhat.com <mailto:didi@redhat.com>>, "users" <users@ovirt.org <mailto:users@ovirt.org>> Sent: Monday, January 20, 2014 12:44:46 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 10:38 AM, Leonid Natapov wrote:
1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume
Are we talking only on the hosted engine vm or all other vm's? if I have other vm's they will also stop, will they be auto started as well?
the engine vm manually.
Thanks, Leonid.
----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> To: dron@redhat.com <mailto:dron@redhat.com> Cc: "Leonid Natapov" <lnatapov@redhat.com <mailto:lnatapov@redhat.com>>, "Yedidyah Bar David" <didi@redhat.com <mailto:didi@redhat.com>>, "users" <users@ovirt.org <mailto:users@ovirt.org>> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM
It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old
I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least.
On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com><mailto: andrew@andrewklau.com <mailto:andrew@andrewklau.com>>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com> <mailto:dron@redhat.com <mailto:dron@redhat.com>>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>> <mailto:didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org <mailto:users@ovirt.org>> <mailto:users@ovirt.org <mailto:users@ovirt.org>
<mailto:users@ovirt.org <mailto:users@ovirt.org>>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>> <mailto:Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron

On 01/20/2014 01:19 PM, Andrew Lau wrote:
Hi,
That bug seems to be private :(
I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on.
did the storage domain go back to up and they remained down?
On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required?
On 01/20/2014 11:02 AM, Leonid Natapov wrote:
All vms. Check this PRD: https://bugzilla.redhat.com/__show_bug.cgi?id=723055 <https://bugzilla.redhat.com/show_bug.cgi?id=723055>
----- Original Message ----- From: "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> To: "Leonid Natapov" <lnatapov@redhat.com <mailto:lnatapov@redhat.com>> Cc: "Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>>, "Yedidyah Bar David" <didi@redhat.com <mailto:didi@redhat.com>>, "users" <users@ovirt.org <mailto:users@ovirt.org>> Sent: Monday, January 20, 2014 12:44:46 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 10:38 AM, Leonid Natapov wrote:
1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume
Are we talking only on the hosted engine vm or all other vm's? if I have other vm's they will also stop, will they be auto started as well?
the engine vm manually.
Thanks, Leonid.
----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> To: dron@redhat.com <mailto:dron@redhat.com> Cc: "Leonid Natapov" <lnatapov@redhat.com <mailto:lnatapov@redhat.com>>, "Yedidyah Bar David" <didi@redhat.com <mailto:didi@redhat.com>>, "users" <users@ovirt.org <mailto:users@ovirt.org>> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM
It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old
I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least.
On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com><mailto: andrew@andrewklau.com <mailto:andrew@andrewklau.com>>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com> <mailto:dron@redhat.com <mailto:dron@redhat.com>>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>> <mailto:didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------__------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>__>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org <mailto:users@ovirt.org>> <mailto:users@ovirt.org <mailto:users@ovirt.org>
<mailto:users@ovirt.org <mailto:users@ovirt.org>>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/__show_bug.cgi?id=1055059 <https://bugzilla.redhat.com/show_bug.cgi?id=1055059>
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>__>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/__13900482/ <http://www.fpaste.org/69604/13900482/>
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>> <mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>
<mailto:andrew@andrewklau.com <mailto:andrew@andrewklau.com>>__>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/__00427141/ <http://www.fpaste.org/69592/00427141/>
ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/__43609139/ <http://www.fpaste.org/69595/43609139/>
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_________________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>> <mailto:Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>>
http://lists.ovirt.org/__mailman/listinfo/users <http://lists.ovirt.org/mailman/listinfo/users>
-- Didi
_________________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>> http://lists.ovirt.org/__mailman/listinfo/users <http://lists.ovirt.org/mailman/listinfo/users>
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

On Mon, Jan 20, 2014 at 11:19 PM, Itamar Heim <iheim@redhat.com> wrote:
On 01/20/2014 01:19 PM, Andrew Lau wrote:
Hi,
That bug seems to be private :(
I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on.
did the storage domain go back to up and they remained down?
Yup, the storage domain went down and when it came back up the VMs remained paused.
On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com>> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required?

On 01/20/2014 02:27 PM, Andrew Lau wrote:
On Mon, Jan 20, 2014 at 11:19 PM, Itamar Heim <iheim@redhat.com <mailto:iheim@redhat.com>>wrote:
On 01/20/2014 01:19 PM, Andrew Lau wrote:
Hi,
That bug seems to be private :(
I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on.
did the storage domain go back to up and they remained down?
Yup, the storage domain went down and when it came back up the VMs remained paused.
please open a bug with repro steps in that case and attach logs. thanks
On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com> <mailto:dron@redhat.com <mailto:dron@redhat.com>>> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required?

----- Original Message -----
From: "Itamar Heim" <iheim@redhat.com> To: "Andrew Lau" <andrew@andrewklau.com> Cc: "users" <users@ovirt.org> Sent: Monday, January 20, 2014 2:33:08 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 02:27 PM, Andrew Lau wrote:
On Mon, Jan 20, 2014 at 11:19 PM, Itamar Heim <iheim@redhat.com <mailto:iheim@redhat.com>>wrote:
On 01/20/2014 01:19 PM, Andrew Lau wrote:
Hi,
That bug seems to be private :(
I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on.
did the storage domain go back to up and they remained down?
Yup, the storage domain went down and when it came back up the VMs remained paused.
please open a bug with repro steps in that case and attach logs. thanks
On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com> <mailto:dron@redhat.com <mailto:dron@redhat.com>>> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required?
Andrew, did you manage to open a bug for the resume issue?

Sorry, I must've overlooked this email - I'll try reproduce and open a new bz. On Fri, Jan 24, 2014 at 4:13 AM, Doron Fediuck <dfediuck@redhat.com> wrote:
----- Original Message -----
From: "Itamar Heim" <iheim@redhat.com> To: "Andrew Lau" <andrew@andrewklau.com> Cc: "users" <users@ovirt.org> Sent: Monday, January 20, 2014 2:33:08 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 02:27 PM, Andrew Lau wrote:
On Mon, Jan 20, 2014 at 11:19 PM, Itamar Heim <iheim@redhat.com <mailto:iheim@redhat.com>>wrote:
On 01/20/2014 01:19 PM, Andrew Lau wrote:
Hi,
That bug seems to be private :(
I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on.
did the storage domain go back to up and they remained down?
Yup, the storage domain went down and when it came back up the VMs remained paused.
please open a bug with repro steps in that case and attach logs. thanks
On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com> <mailto:dron@redhat.com <mailto:dron@redhat.com>>> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and
manual
intervention is required?
Andrew, did you manage to open a bug for the resume issue?

I've opened a BZ https://bugzilla.redhat.com/show_bug.cgi?id=1058300 Itamar, in BZ 1055461 is that last comment directed to me? I don't seem to have the option anywhere to set Target Release Andrew. On Fri, Jan 24, 2014 at 8:06 AM, Andrew Lau <andrew@andrewklau.com> wrote:
Sorry, I must've overlooked this email - I'll try reproduce and open a new bz.
On Fri, Jan 24, 2014 at 4:13 AM, Doron Fediuck <dfediuck@redhat.com>wrote:
----- Original Message -----
From: "Itamar Heim" <iheim@redhat.com> To: "Andrew Lau" <andrew@andrewklau.com> Cc: "users" <users@ovirt.org> Sent: Monday, January 20, 2014 2:33:08 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 02:27 PM, Andrew Lau wrote:
On Mon, Jan 20, 2014 at 11:19 PM, Itamar Heim <iheim@redhat.com <mailto:iheim@redhat.com>>wrote:
On 01/20/2014 01:19 PM, Andrew Lau wrote:
Hi,
That bug seems to be private :(
I'm interested also to hear about this feature, as with 3.3.2 I had my gluster vms go into paused state quite a few times and they actually couldn't be resumed at all, they needed to be forced off and back on.
did the storage domain go back to up and they remained down?
Yup, the storage domain went down and when it came back up the VMs remained paused.
please open a bug with repro steps in that case and attach logs. thanks
On Mon, Jan 20, 2014 at 10:13 PM, Dafna Ron <dron@redhat.com <mailto:dron@redhat.com> <mailto:dron@redhat.com <mailto:dron@redhat.com>>> wrote:
interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and
manual
intervention is required?
Andrew, did you manage to open a bug for the resume issue?

Auto resume depends on domain monitoring (failed domain coming back up causes VMs to be unpaused). VM wouldn't be resumed if the domain monitoring for this domain stopped for some reason. I don't think we have some kind of error or event saying to user something like "vm has failed to resume automatically,please resume it manually". ----- Original Message ----- From: "Dafna Ron" <dron@redhat.com> To: "Leonid Natapov" <lnatapov@redhat.com> Cc: "Andrew Lau" <andrew@andrewklau.com>, "Yedidyah Bar David" <didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 1:13:46 PM Subject: Re: [Users] Issues starting hosted engine VM interesting... :) so this is now configurable... what happens if qemu fails to start the vm (this happens sometimes - mostly on file type storage). do we have a re-try or a specific error telling the use that the activation failed and manual intervention is required? On 01/20/2014 11:02 AM, Leonid Natapov wrote:
All vms. Check this PRD: https://bugzilla.redhat.com/show_bug.cgi?id=723055
----- Original Message ----- From: "Dafna Ron" <dron@redhat.com> To: "Leonid Natapov" <lnatapov@redhat.com> Cc: "Andrew Lau" <andrew@andrewklau.com>, "Yedidyah Bar David" <didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:44:46 PM Subject: Re: [Users] Issues starting hosted engine VM
On 01/20/2014 10:38 AM, Leonid Natapov wrote:
1.hosted-engine --vm-start should start engine vm. There was no problem with it when I tested it. 2.hosted-engine --vm-start-paused was added for the case when something is wrong with engine vm and it can't start and requires user intervention. For example in case of kernel panic. User can start engine vm in paused mode ,connect to it and try to fix the problem by booting in single user mode ,etc. 3.When the connectivity to shared storage is lost engine vm becomes paused. VM should be automatically unpaused after connectivity resumes (we introduced this feature in 3.3) but in case of NFS it could take quite time.so may be we should add something like --vm-resume in order to resume Are we talking only on the hosted engine vm or all other vm's? if I have other vm's they will also stop, will they be auto started as well? the engine vm manually.
Thanks, Leonid.
----- Original Message ----- From: "Andrew Lau" <andrew@andrewklau.com> To: dron@redhat.com Cc: "Leonid Natapov" <lnatapov@redhat.com>, "Yedidyah Bar David" <didi@redhat.com>, "users" <users@ovirt.org> Sent: Monday, January 20, 2014 12:28:15 PM Subject: Re: [Users] Issues starting hosted engine VM
It was paused due to the connection loss to the NFS server, I would assume once the connection is restored it could attempt to restore it? But I can try dig up the vdsm logs if you want, they would only be a few hours old
I think having an option like --vm-resume would at least hide the reason of having to dig into virsh and messing with authentication at the very least.
On Mon, Jan 20, 2014 at 9:23 PM, Dafna Ron <dron@redhat.com> wrote:
the question is what was the vm paused on... this can be found in the qemu vm log. if the vm is paused it will not be auto started - so I am not sure what you expect to change? virsh requires authentication regardless to hosted engine :) Leonid, did you do any testing there?
On 01/20/2014 10:13 AM, Andrew Lau wrote:
I have opened this BZ 1055461 anyway just in case
On Mon, Jan 20, 2014 at 8:33 PM, Andrew Lau <andrew@andrewklau.com<mailto: andrew@andrewklau.com>> wrote:
I was more interested in how the score process would be calculated, the vm-status option considered the VM in a bad state.
I left it for a few minutes and nothing seemed to have changed, I think it relates to hosted engine as virsh requires authentication. Should I still open a bz?
Cheers, Andrew.
On Jan 20, 2014 7:48 PM, "Dafna Ron" <dron@redhat.com <mailto:dron@redhat.com>> wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com> <mailto:didi@redhat.com <mailto:didi@redhat.com>>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------ ------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org> <mailto:users@ovirt.org
<mailto:users@ovirt.org>>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com> <mailto:andrew@andrewklau.com
<mailto:andrew@andrewklau.com>>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> <mailto:Users@ovirt.org <mailto:Users@ovirt.org>>
http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Dafna Ron
-- Dafna Ron
-- Dafna Ron

On 01/20/2014 10:48 AM, Dafna Ron wrote:
I am not sure this is a hosted engine question as much as a qemu question. qemu-kvm will not support auto start of vm's after EIO because of remote possibility of corruption.
only if live migration is involved
On 01/20/2014 05:46 AM, Andrew Lau wrote:
Hi,
Quick question, in the scenario eg. the NFS server becomes unreachable and the hosted-engine goes into a paused state. Will other hosts attempt to bring it back up? Should there be a command eg. hosted-engine --vm-resume ?
When this happened, I manually forced it to resume using virsh
On Sun, Jan 19, 2014 at 7:21 PM, Yedidyah Bar David <didi@redhat.com <mailto:didi@redhat.com>> wrote:
Thanks a lot for your efforts and the report! -- Didi
------------------------------------------------------------------------
*From: *"Andrew Lau" <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> *To: *"users" <users@ovirt.org <mailto:users@ovirt.org>> *Sent: *Saturday, January 18, 2014 3:20:22 PM *Subject: *Re: [Users] Issues starting hosted engine VM
I believe I found the issue and have reported it here https://bugzilla.redhat.com/show_bug.cgi?id=1055059
On Sat, Jan 18, 2014 at 11:33 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> wrote:
The interesting thing - trying it with the paused option vdsm seems to create the VM
hosted-engine --vm-start-paused
vdsm.log http://www.fpaste.org/69604/13900482/
But I'm not sure how to then proceed to "resume" it.
On Sat, Jan 18, 2014 at 10:23 PM, Andrew Lau <andrew@andrewklau.com <mailto:andrew@andrewklau.com>> wrote:
Hi,
With the great help from sbonazzo, I managed to step past the initial bug with the hosted-engine-setup but appear to have run into another show stopper.
I ran through the install process successfully up to the stage where it completed and the engine VM was to be shutdown. (The engine has already been installed on the VM and the host has been connected to the engine).
The issue starts here that the host finds itself not able to start the VM up again.
VDSM Logs: http://www.fpaste.org/69592/00427141/ ovirt-hosted-engine-ha agent.log http://www.fpaste.org/69595/43609139/
It seems to keep failing to start the VM.. when I restart the agent I can see the score drop to 0 after 3 boot attempts. The interesting thing seems to be in the VDSM Logs "'Virtual machine does not exist', 'code': 1}}"
I'm not sure where else to look. Suggestions?
Cheers,
Andrew
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
-- Didi
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
participants (6)
-
Andrew Lau
-
Dafna Ron
-
Doron Fediuck
-
Itamar Heim
-
Leonid Natapov
-
Yedidyah Bar David