Re: [Users] single VM disappeared

Hello,=0A= =0A= continuing with my 3.3 error series. I just lost one of my VMs after=0A= restart of ovirt-engine. The things I remember are:=0A= =0A= - The machine was up and running.=0A= - I changed the cluster cpu from penryn to nehalem=0A= - Machine was stopped=0A= - Restart failed with error in ovirt-engine.log=0A= =0A= Failed creating vm win2k3std_de in vds =3D=0A= 30b9bd7e-ea05-4910-85a7-f014ba8a8bab :=0A= colovn1 error =3D=0A= org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException:=0A= org.apache.xmlrpc.common.XmlRpcExtensionException:=0A= Null values aren't supported, if isEnabledForExtensions() =3D=3D false= =0A= =0A= - Without any idea how to solve the error I restarted ovirt-engine=0A= - Now Machine is missing=0A= - Error log reads=0A= =0A= Correlation ID: 2c29cebd, Call Stack: null, Custom=0A= Event ID: -1, Message: Failed to import Vm win2k3std_de to=0A= Data Center Collogia, Cluster Produktion=0A= =0A= I uploaded the engine.log to http://pastebin.com/sHspGTgy=0A= =0A= Markus=0A= =0A= =0A= _______________________________________________=0A= Users mailing list=0A= Users@ovirt.org=0A= http://lists.ovirt.org/mailman/listinfo/users=0A= =0A= I think thats an emulated machine problem which I'm just takin care of.= =0A= =0A= to check:=0A= =0A= go to cluster, select your cluster, got to the genral subtab and check=0A= what is "emulated machine" value=0A= I guess its null. to make it work,=0A= 1.take one of your host which belongs to that cluster to maintenance=0A= 2.activate it back.=0A=
Von: Roy Golan [rgolan@redhat.com]=0A= Gesendet: Donnerstag, 12. September 2013 15:58=0A= An: Markus Stockhausen=0A= Betreff: Re: [Users] single VM disappeared=0A= =0A= On Thu 12 Sep 2013 02:57:13 PM IDT, Markus Stockhausen wrote:=0A= 3. check the emulated machine value is set now=0A= =0A= cause:=0A= Bug https://bugzilla.redhat.com/1004695=0A= Editing the cluster nullify the emulated machine value=0A= =0A= Roy=0A= =0A= Hello,=0A= =0A=
This is a multi-part message in MIME format. ------=_NextPartTM-000-9fe6758f-1f20-44e4-a5a7-2c9b16fc854b Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable thanks for helping me through that minefield. Ok here are the results =0A= when following your guide.=0A= =0A= - Emulated machine: pc-1.0=0A= - maintenance/activate one node of the cluster:=0A= - Emulated machine is still: pc-1.0=0A= =0A= I guess that is nothing you wanted to see. So I changed the CPU=0A= type of the cluster and was able to understand what you think of. =0A= The emulated machine will go blank. After reactivating a node=0A= the value goes back to pc-1.0=0A= =0A= I played with the cluster settings in between and cannot say if the=0A= value was empty when the error occured. Just in case the emulated =0A= machine value was empty during restart of ovirt-engine. Did I loose =0A= the machine because of that? As I'm testing that is no problem but =0A= I want to make sure if I can recover from that situation in any way.=0A= =0A= Best regards.=0A= =0A= Markus=0A= =0A= =0A= ------=_NextPartTM-000-9fe6758f-1f20-44e4-a5a7-2c9b16fc854b Content-Type: text/plain; name="InterScan_Disclaimer.txt" Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename="InterScan_Disclaimer.txt" **************************************************************************** Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und vernichten Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail ist nicht gestattet. Über das Internet versandte E-Mails können unter fremden Namen erstellt oder manipuliert werden. Deshalb ist diese als E-Mail verschickte Nachricht keine rechtsverbindliche Willenserklärung. Collogia Unternehmensberatung AG Ubierring 11 D-50678 Köln Vorstand: Kadir Akin Dr. Michael Höhnerbach Vorsitzender des Aufsichtsrates: Hans Kristian Langva Registergericht: Amtsgericht Köln Registernummer: HRB 52 497 This e-mail may contain confidential and/or privileged information. If you are not the intended recipient (or have received this e-mail in error) please notify the sender immediately and destroy this e-mail. Any unauthorized copying, disclosure or distribution of the material in this e-mail is strictly forbidden. e-mails sent over the internet may have been written under a wrong name or been manipulated. That is why this message sent as an e-mail is not a legally binding declaration of intention. Collogia Unternehmensberatung AG Ubierring 11 D-50678 Köln executive board: Kadir Akin Dr. Michael Höhnerbach President of the supervisory board: Hans Kristian Langva Registry office: district court Cologne Register number: HRB 52 497 **************************************************************************** ------=_NextPartTM-000-9fe6758f-1f20-44e4-a5a7-2c9b16fc854b--

Von: Roy Golan [rgolan@redhat.com] Gesendet: Donnerstag, 12. September 2013 15:58 An: Markus Stockhausen Betreff: Re: [Users] single VM disappeared
On Thu 12 Sep 2013 02:57:13 PM IDT, Markus Stockhausen wrote:
Hello,
continuing with my 3.3 error series. I just lost one of my VMs after restart of ovirt-engine. The things I remember are:
- The machine was up and running. - I changed the cluster cpu from penryn to nehalem here the emulated machine nullified. - Machine was stopped - Restart failed with error in ovirt-engine.log
Failed creating vm win2k3std_de in vds = 30b9bd7e-ea05-4910-85a7-f014ba8a8bab : colovn1 error = org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException: org.apache.xmlrpc.common.XmlRpcExtensionException: Null values aren't supported, if isEnabledForExtensions() == false
- Without any idea how to solve the error I restarted ovirt-engine - Now Machine is missing - Error log reads
Correlation ID: 2c29cebd, Call Stack: null, Custom Event ID: -1, Message: Failed to import Vm win2k3std_de to Data Center Collogia, Cluster Produktion
I uploaded the engine.log tohttp://pastebin.com/sHspGTgy
Markus
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users I think thats an emulated machine problem which I'm just takin care of.
to check:
go to cluster, select your cluster, got to the genral subtab and check what is "emulated machine" value I guess its null. to make it work, 1.take one of your host which belongs to that cluster to maintenance 2.activate it back. 3. check the emulated machine value is set now
cause: Bughttps://bugzilla.redhat.com/1004695 Editing the cluster nullify the emulated machine value
Roy Hello,
thanks for helping me through that minefield. Ok here are the results when following your guide.
On 09/12/2013 05:20 PM, Markus Stockhausen wrote: please provide the output of: psql engine postgres -c "select * from async_tasks;"
- Emulated machine: pc-1.0
right, you restart the engine so the host already was reactivated so this is actually expected.
- maintenance/activate one node of the cluster: - Emulated machine is still: pc-1.0
I guess that is nothing you wanted to see. So I changed the CPU type of the cluster and was able to understand what you think of. The emulated machine will go blank. After reactivating a node the value goes back to pc-1.0
I played with the cluster settings in between and cannot say if the value was empty when the error occured. Just in case the emulated machine value was empty during restart of ovirt-engine. Did I loose the machine because of that? no this can't lead to that but the restart of ovirt can.
Yair, seems like importVM task is still in the DB after an engine restart during run VM fail.
As I'm testing that is no problem but I want to make sure if I can recover from that situation in any way.
Best regards.
Markus

please provide the output of:=0A= =0A= psql engine postgres -c "select * from async_tasks;"=0A= =0A= Ok, this is going really crazy now. We are talking about two errors =0A=
This is a multi-part message in MIME format. ------=_NextPartTM-000-0c3b3637-12e0-4981-9cfa-170d77dbdd4d Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable that occured at nearly the same time. The first one is what you=0A= described. The bug on the CPU pane prohibits me from starting the=0A= VM because "Emulated machine" is empty. No problem with that=0A= I have a workaround.=0A= =0A= But now to the other bug. Took me quite some time to reproduce=0A= it. But the setup is very simple. =0A= =0A= 1) Import a VM from an export domain into the cluster. =0A= 2) Wait for the successful finished message=0A= 3) See the machine in the VM overview of the cluster=0A= Everything looks fine (vm, disks, ...)=0A= 4) stop and start cluster=0A= 5) machine is missing=0A= 6) cluster spits failure messages=0A= =0A= Have a look:=0A= =0A= 2013-Sep-12, 22:07 Failed to import Vm win2k3std_de to Data Center Collogia= , Cluster Produktion=0A= 2013-Sep-12, 22:07 Failed to import Vm win2k3std_de to Data Center Collogia= , Cluster Produktion=0A= 2013-Sep-12, 22:07 User admin@internal logged out.=0A= 2013-Sep-12, 22:07 Storage Pool Manager runs on Host colovn1 (Address: 192.= 168.10.51).=0A= 2013-Sep-12, 22:07 Data Center is being initialized, please wait for initia= lization to complete.=0A= 2013-Sep-12, 22:07 State was set to Up for host colovn1.=0A= 2013-Sep-12, 22:07 State was set to Up for host colovn3.=0A= 2013-Sep-12, 22:03 Vm win2k3std_de was imported successfully to Data Center= Collogia, Cluster Produktion=0A= 2013-Sep-12, 22:01 Used Network resources of host colovn1 [100%] exceeded d= efined threshold [95%].=0A= 2013-Sep-12, 22:01 Starting to import Vm win2k3std_de to Data Center Collog= ia, Cluster Produktion=0A= 2013-Sep-12, 22:01 VM win2k3std_de was successfully removed.=0A= =0A= Reproducing the issue two times was no problem. The=0A= content of async_tasks during downtime of the engine=0A= is 600k. If you need this log because you cannot reproduce =0A= the bug yourself please do not hestitate to contact me.=0A= =0A= Markus=0A= ------=_NextPartTM-000-0c3b3637-12e0-4981-9cfa-170d77dbdd4d Content-Type: text/plain; name="InterScan_Disclaimer.txt" Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename="InterScan_Disclaimer.txt" **************************************************************************** Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und vernichten Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail ist nicht gestattet. Über das Internet versandte E-Mails können unter fremden Namen erstellt oder manipuliert werden. Deshalb ist diese als E-Mail verschickte Nachricht keine rechtsverbindliche Willenserklärung. Collogia Unternehmensberatung AG Ubierring 11 D-50678 Köln Vorstand: Kadir Akin Dr. Michael Höhnerbach Vorsitzender des Aufsichtsrates: Hans Kristian Langva Registergericht: Amtsgericht Köln Registernummer: HRB 52 497 This e-mail may contain confidential and/or privileged information. If you are not the intended recipient (or have received this e-mail in error) please notify the sender immediately and destroy this e-mail. Any unauthorized copying, disclosure or distribution of the material in this e-mail is strictly forbidden. e-mails sent over the internet may have been written under a wrong name or been manipulated. That is why this message sent as an e-mail is not a legally binding declaration of intention. Collogia Unternehmensberatung AG Ubierring 11 D-50678 Köln executive board: Kadir Akin Dr. Michael Höhnerbach President of the supervisory board: Hans Kristian Langva Registry office: district court Cologne Register number: HRB 52 497 **************************************************************************** ------=_NextPartTM-000-0c3b3637-12e0-4981-9cfa-170d77dbdd4d--

On 12 Sep 2013, at 21:27, Markus Stockhausen <stockhausen@collogia.de> wrote:
please provide the output of:
psql engine postgres -c "select * from async_tasks;"
Ok, this is going really crazy now. We are talking about two errors that occured at nearly the same time. The first one is what you described. The bug on the CPU pane prohibits me from starting the VM because "Emulated machine" is empty. No problem with that I have a workaround.
But now to the other bug. Took me quite some time to reproduce it. But the setup is very simple.
1) Import a VM from an export domain into the cluster. 2) Wait for the successful finished message 3) See the machine in the VM overview of the cluster Everything looks fine (vm, disks, ...) 4) stop and start cluster 5) machine is missing 6) cluster spits failure messages
This exact thing is happening to me. I'm trying to import oVirt 3.2 VM's from an export domain. The import is successful, to the point I can fire up the VM's. Restart ovirt-engine and the VM is gone. Prior to the latest ovirt-engine update, only the VM container was removed and the disk remained. Since updating to ovirt-engine-3.3.0-3.el6.noarch a few moments ago, the container still gets removed, but now the disk is gone too! CentOS 6.4 x86_64 / oVirt 3.3

This exact thing is happening to me. I'm trying to import oVirt 3.2 VM's= from an export domain. The import is successful, to the point I can fire = up the VM's.=0A= =0A= Restart ovirt-engine and the VM is gone. Prior to the latest ovirt-engin= e update, only the VM container was removed and the disk remained. Since u=
This is a multi-part message in MIME format. ------=_NextPartTM-000-2b3a5f67-7363-4253-a0b9-0965121deba6 Content-Type: text/plain; charset="us-ascii" Content-Transfer-Encoding: quoted-printable pdating > to ovirt-engine-3.3.0-3.el6.noarch a few moments ago, the contain= er still gets removed, but now the disk is gone too!=0A=
=0A= CentOS 6.4 x86_64 / oVirt 3.3=0A= _______________________________________________=0A= Users mailing list=0A= Users@ovirt.org=0A= http://lists.ovirt.org/mailman/listinfo/users=0A= =0A= Two people facing the same issue so I opened a ticket. I opted to set the s= everity to high.=0A= =0A= Markus=0A= ------=_NextPartTM-000-2b3a5f67-7363-4253-a0b9-0965121deba6 Content-Type: text/plain; name="InterScan_Disclaimer.txt" Content-Transfer-Encoding: 7bit Content-Disposition: attachment; filename="InterScan_Disclaimer.txt"
**************************************************************************** Diese E-Mail enthält vertrauliche und/oder rechtlich geschützte Informationen. Wenn Sie nicht der richtige Adressat sind oder diese E-Mail irrtümlich erhalten haben, informieren Sie bitte sofort den Absender und vernichten Sie diese Mail. Das unerlaubte Kopieren sowie die unbefugte Weitergabe dieser Mail ist nicht gestattet. Über das Internet versandte E-Mails können unter fremden Namen erstellt oder manipuliert werden. Deshalb ist diese als E-Mail verschickte Nachricht keine rechtsverbindliche Willenserklärung. Collogia Unternehmensberatung AG Ubierring 11 D-50678 Köln Vorstand: Kadir Akin Dr. Michael Höhnerbach Vorsitzender des Aufsichtsrates: Hans Kristian Langva Registergericht: Amtsgericht Köln Registernummer: HRB 52 497 This e-mail may contain confidential and/or privileged information. If you are not the intended recipient (or have received this e-mail in error) please notify the sender immediately and destroy this e-mail. Any unauthorized copying, disclosure or distribution of the material in this e-mail is strictly forbidden. e-mails sent over the internet may have been written under a wrong name or been manipulated. That is why this message sent as an e-mail is not a legally binding declaration of intention. Collogia Unternehmensberatung AG Ubierring 11 D-50678 Köln executive board: Kadir Akin Dr. Michael Höhnerbach President of the supervisory board: Hans Kristian Langva Registry office: district court Cologne Register number: HRB 52 497 **************************************************************************** ------=_NextPartTM-000-2b3a5f67-7363-4253-a0b9-0965121deba6--
participants (3)
-
James Wilson
-
Markus Stockhausen
-
Roy Golan