[Users] Host discovery failing due to host network being lost

Hi ovirt users, I was trying to check on IRC the below, did not get any replies, so sending this mail. Can someone help me understand what could be the issue that causing my managed host to lose network settings, when i try to discover it from ovirt-engine ? I have to then manually (using a remote console) re-setup the network.. it even overrites my ifcfg-eth0 file I see a new bridge getting added called ovirtmgmt and route entry corr. to that.. the ifcfg-eth0 also has that entry I feel that ovirt scripts running on the host should atleast take a backup of the ifcfg* files they modify so that in case of issues like these, user can login via remote console and restore the original ifcfg file. This is what i asked on IRC... responses appreciated... deepakcs> Hello, i just configured ovirt-engine and discovering my first managed host.. got some Qs <deepakcs> what diff does it make when i select "override ip tables" check box during new Host workflow ? * djasa has quit (Ping timeout: 480 seconds) <deepakcs> this is the second time i am discovering this host and everytime the host discovery hangs during "Installing" and if i check the host, it goes out of network.... has anybody faced this problem before ? <deepakcs> While discovering the host, the host network ipv4 address is gone ! and thus the host goes out of network Some more updates.. 1) I manually re-setup the host network, made it pingable, esp from the ovirt-engine node, and clicked on Re-install on the web gui, this time with the 'override ip tables' check enabled 2) this time it successfully completed the node bootstrap steps (as seen from the Events window) and status was Reboot 3) But after reboot the same thing happened.. the host lost ipv4 addr, route entries are completely gone and onthe web gui the host is seen as non-responsive, which is expected bcos the host is not on the network anymore... what is the reason the above is happening ?

Hi ovirt users, =20 I was trying to check on IRC the below, did not get any replies, so sendin= g this mail. Can someone help me understand what could be the issue that causing my man= aged host to lose network settings, when i try to discover it from ovirt-eng= ine ? I have to then manually (using a remote console) re-setup the network.. it= even overrites my ifcfg-eth0 file =20 I see a new bridge getting added called ovirtmgmt and route entry corr. to=
I feel that ovirt scripts running on the host should atleast take a backup= of the ifcfg* files they modify so that in case of issues like these, user c= an login via remote console and restore the original ifcfg file. =20 This is what i asked on IRC... responses appreciated... =20 deepakcs> Hello, i just configured ovirt-engine and discovering my first m= anaged host.. got some Qs <deepakcs> what diff does it make when i select "override ip tables" check= box during new Host workflow ? * djasa has quit (Ping timeout: 480 seconds) <deepakcs> this is the second time i am discovering this host and everytim= e the host discovery hangs during "Installing" and if i check the host, it g= oes out of network.... has anybody faced this problem before ? <deepakcs> While discovering the host, the host network ipv4 address is go= ne ! and thus the host goes out of network =20 Some more updates.. 1) I manually re-setup the host network, made it pingable, esp from the ov= irt-engine node, and clicked on Re-install on the web gui, this time with th= e 'override ip tables' check enabled 2) this time it successfully completed the node bootstrap steps (as seen f= rom the Events window) and status was Reboot 3) But after reboot the same thing happened.. the host lost ipv4 addr, rou= te entries are completely gone and onthe web gui the host is seen as non-res=
--Apple-Mail-AC421548-58A6-42D0-9436-F4095405E006 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii On Jan 28, 2012, at 10:27, Deepak C Shetty <deepakcs@linux.vnet.ibm.com> wro= te: that.. the ifcfg-eth0 also has that entry ponsive, which is expected bcos the host is not on the network anymore...
=20 what is the reason the above is happening ?
Hi,=20 First, in order to understand the reason for bootstrap failures, please atta= ch engine.log (ovirt-engine) and node logs (under /tmp/vds_bootstrap.log, vd= s_installer.log). Second, what you describe regarding the loss of network could derive from th= e following reasons: - host is configured with bonding - but bonding is not configured correctly.= - NetworkManager interfere with networking config. - bridge (ovirtmgmt) is not set to with any BOOTPROTO or ONBOOT. Please attach network scripts (ifcfg-eth0, ifcfg-ovirtmgmt) and /var/log/mes= sages . Haim
=20 _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
--Apple-Mail-AC421548-58A6-42D0-9436-F4095405E006 Content-Transfer-Encoding: base64 Content-Type: text/html; charset=utf-8 PGh0bWw+PGhlYWQ+PC9oZWFkPjxib2R5IGJnY29sb3I9IiNGRkZGRkYiPjxkaXY+PGRpdiBzdHls ZT0idGV4dC1hbGlnbjogcmlnaHQ7ZGlyZWN0aW9uOiBydGw7ICI+PGJyPjwvZGl2PjxkaXYgc3R5 bGU9InRleHQtYWxpZ246IC13ZWJraXQtYXV0bztkaXJlY3Rpb246IHJ0bDsgIj48YnI+PC9kaXY+ PC9kaXY+PGRpdj48ZGl2IHN0eWxlPSJ0ZXh0LWFsaWduOiByaWdodDtkaXJlY3Rpb246IHJ0bDsg Ij48YnI+PC9kaXY+T24gSmFuIDI4LCAyMDEyLCBhdCAxMDoyNywgRGVlcGFrIEMgU2hldHR5ICZs dDs8YSBocmVmPSJtYWlsdG86ZGVlcGFrY3NAbGludXgudm5ldC5pYm0uY29tIj5kZWVwYWtjc0Bs aW51eC52bmV0LmlibS5jb208L2E+Jmd0OyB3cm90ZTo8YnI+PGJyPjwvZGl2PjxkaXY+PC9kaXY+ PGJsb2NrcXVvdGUgdHlwZT0iY2l0ZSI+PGRpdj48c3Bhbj5IaSBvdmlydCB1c2Vycyw8L3NwYW4+ PGJyPjxzcGFuPjwvc3Bhbj48YnI+PHNwYW4+SSB3YXMgdHJ5aW5nIHRvIGNoZWNrIG9uIElSQyB0 aGUgYmVsb3csIGRpZCBub3QgZ2V0IGFueSByZXBsaWVzLCBzbyBzZW5kaW5nIHRoaXMgbWFpbC48 L3NwYW4+PGJyPjxzcGFuPkNhbiBzb21lb25lIGhlbHAgbWUgdW5kZXJzdGFuZCB3aGF0IGNvdWxk IGJlIHRoZSBpc3N1ZSB0aGF0IGNhdXNpbmcgbXkgbWFuYWdlZCBob3N0IHRvIGxvc2UgbmV0d29y ayBzZXR0aW5ncywgd2hlbiBpIHRyeSB0byBkaXNjb3ZlciBpdCBmcm9tIG92aXJ0LWVuZ2luZSA/ PC9zcGFuPjxicj48c3Bhbj5JIGhhdmUgdG8gdGhlbiBtYW51YWxseSAodXNpbmcgYSByZW1vdGUg Y29uc29sZSkgcmUtc2V0dXAgdGhlIG5ldHdvcmsuLiBpdCBldmVuIG92ZXJyaXRlcyBteSBpZmNm Zy1ldGgwIGZpbGU8L3NwYW4+PGJyPjxzcGFuPjwvc3Bhbj48YnI+PHNwYW4+SSBzZWUgYSBuZXcg YnJpZGdlIGdldHRpbmcgYWRkZWQgY2FsbGVkIG92aXJ0bWdtdCBhbmQgcm91dGUgZW50cnkgY29y ci4gdG8gdGhhdC4uIHRoZSBpZmNmZy1ldGgwIGFsc28gaGFzIHRoYXQgZW50cnk8L3NwYW4+PGJy PjxzcGFuPkkgZmVlbCB0aGF0IG92aXJ0IHNjcmlwdHMgcnVubmluZyBvbiB0aGUgaG9zdCBzaG91 bGQgYXRsZWFzdCB0YWtlIGEgYmFja3VwIG9mIHRoZSBpZmNmZyogZmlsZXMgdGhleSBtb2RpZnkg c28gdGhhdCBpbiBjYXNlIG9mIGlzc3VlcyBsaWtlIHRoZXNlLCB1c2VyIGNhbiBsb2dpbiB2aWEg cmVtb3RlIGNvbnNvbGUgYW5kIHJlc3RvcmUgdGhlIG9yaWdpbmFsIGlmY2ZnIGZpbGUuPC9zcGFu Pjxicj48c3Bhbj48L3NwYW4+PGJyPjxzcGFuPlRoaXMgaXMgd2hhdCBpIGFza2VkIG9uIElSQy4u LiByZXNwb25zZXMgYXBwcmVjaWF0ZWQuLi48L3NwYW4+PGJyPjxzcGFuPjwvc3Bhbj48YnI+PHNw YW4+ZGVlcGFrY3MmZ3Q7IEhlbGxvLCBpIGp1c3QgY29uZmlndXJlZCBvdmlydC1lbmdpbmUgYW5k IGRpc2NvdmVyaW5nIG15IGZpcnN0IG1hbmFnZWQgaG9zdC4uIGdvdCBzb21lIFFzPC9zcGFuPjxi cj48c3Bhbj4mbHQ7ZGVlcGFrY3MmZ3Q7IHdoYXQgZGlmZiBkb2VzIGl0IG1ha2Ugd2hlbiBpIHNl bGVjdCAib3ZlcnJpZGUgaXAgdGFibGVzIiBjaGVjayBib3ggZHVyaW5nIG5ldyBIb3N0IHdvcmtm bG93ID88L3NwYW4+PGJyPjxzcGFuPiogZGphc2EgaGFzIHF1aXQgKFBpbmcgdGltZW91dDogNDgw IHNlY29uZHMpPC9zcGFuPjxicj48c3Bhbj4mbHQ7ZGVlcGFrY3MmZ3Q7IHRoaXMgaXMgdGhlIHNl Y29uZCB0aW1lIGkgYW0gZGlzY292ZXJpbmcgdGhpcyBob3N0IGFuZCBldmVyeXRpbWUgdGhlIGhv c3QgZGlzY292ZXJ5IGhhbmdzIGR1cmluZyAiSW5zdGFsbGluZyIgYW5kIGlmIGkgY2hlY2sgdGhl IGhvc3QsIGl0IGdvZXMgb3V0IG9mIG5ldHdvcmsuLi4uIGhhcyBhbnlib2R5IGZhY2VkIHRoaXMg cHJvYmxlbSBiZWZvcmUgPzwvc3Bhbj48YnI+PHNwYW4+Jmx0O2RlZXBha2NzJmd0OyBXaGlsZSBk aXNjb3ZlcmluZyB0aGUgaG9zdCwgdGhlIGhvc3QgbmV0d29yayBpcHY0IGFkZHJlc3MgaXMgZ29u ZSAhIGFuZCB0aHVzIHRoZSBob3N0IGdvZXMgb3V0IG9mIG5ldHdvcms8L3NwYW4+PGJyPjxzcGFu Pjwvc3Bhbj48YnI+PHNwYW4+U29tZSBtb3JlIHVwZGF0ZXMuLjwvc3Bhbj48YnI+PHNwYW4+MSkg SSBtYW51YWxseSByZS1zZXR1cCB0aGUgaG9zdCBuZXR3b3JrLCBtYWRlIGl0IHBpbmdhYmxlLCBl c3AgZnJvbSB0aGUgb3ZpcnQtZW5naW5lIG5vZGUsIGFuZCBjbGlja2VkIG9uIFJlLWluc3RhbGwg b24gdGhlIHdlYiBndWksIHRoaXMgdGltZSB3aXRoIHRoZSAnb3ZlcnJpZGUgaXAgdGFibGVzJyBj aGVjayBlbmFibGVkPC9zcGFuPjxicj48c3Bhbj4yKSB0aGlzIHRpbWUgaXQgc3VjY2Vzc2Z1bGx5 IGNvbXBsZXRlZCB0aGUgbm9kZSBib290c3RyYXAgc3RlcHMgKGFzIHNlZW4gZnJvbSB0aGUgRXZl bnRzIHdpbmRvdykgYW5kIHN0YXR1cyB3YXMgUmVib290PC9zcGFuPjxicj48c3Bhbj4zKSBCdXQg YWZ0ZXIgcmVib290IHRoZSBzYW1lIHRoaW5nIGhhcHBlbmVkLi4gdGhlIGhvc3QgbG9zdCBpcHY0 IGFkZHIsIHJvdXRlIGVudHJpZXMgYXJlIGNvbXBsZXRlbHkgZ29uZSBhbmQgb250aGUgd2ViIGd1 aSB0aGUgaG9zdCBpcyBzZWVuIGFzIG5vbi1yZXNwb25zaXZlLCB3aGljaCBpcyBleHBlY3RlZCBi Y29zIHRoZSBob3N0IGlzIG5vdCBvbiB0aGUgbmV0d29yayBhbnltb3JlLi4uPC9zcGFuPjxicj48 c3Bhbj48L3NwYW4+PGJyPjxzcGFuPndoYXQgaXMgdGhlIHJlYXNvbiB0aGUgYWJvdmUgaXMgaGFw cGVuaW5nID88L3NwYW4+PGJyPjwvZGl2PjwvYmxvY2txdW90ZT48ZGl2IHN0eWxlPSJ0ZXh0LWFs aWduOiByaWdodDtkaXJlY3Rpb246IHJ0bDsgIj48YnI+PC9kaXY+PGRpdiBzdHlsZT0idGV4dC1h bGlnbjogbGVmdDtkaXJlY3Rpb246IGx0cjsgIj5IaSwmbmJzcDs8L2Rpdj48ZGl2IHN0eWxlPSJ0 ZXh0LWFsaWduOiBsZWZ0O2RpcmVjdGlvbjogbHRyOyAiPjxicj48L2Rpdj48ZGl2IHN0eWxlPSJ0 ZXh0LWFsaWduOiBsZWZ0O2RpcmVjdGlvbjogbHRyOyAiPkZpcnN0LCBpbiBvcmRlciB0byB1bmRl cnN0YW5kIHRoZSByZWFzb24gZm9yIGJvb3RzdHJhcCBmYWlsdXJlcywgcGxlYXNlIGF0dGFjaCBl bmdpbmUubG9nIChvdmlydC1lbmdpbmUpIGFuZCBub2RlIGxvZ3MgKHVuZGVyIC90bXAvdmRzX2Jv b3RzdHJhcC5sb2csIHZkc19pbnN0YWxsZXIubG9nKS48L2Rpdj48ZGl2IHN0eWxlPSJ0ZXh0LWFs aWduOiBsZWZ0O2RpcmVjdGlvbjogbHRyOyAiPjxicj48L2Rpdj48ZGl2IHN0eWxlPSJ0ZXh0LWFs aWduOiBsZWZ0O2RpcmVjdGlvbjogbHRyOyAiPlNlY29uZCwgd2hhdCB5b3UgZGVzY3JpYmUgcmVn YXJkaW5nIHRoZSBsb3NzIG9mIG5ldHdvcmsgY291bGQgZGVyaXZlIGZyb20gdGhlIGZvbGxvd2lu ZyByZWFzb25zOjwvZGl2PjxkaXYgc3R5bGU9InRleHQtYWxpZ246IGxlZnQ7ZGlyZWN0aW9uOiBs dHI7ICI+LSBob3N0IGlzIGNvbmZpZ3VyZWQgd2l0aCBib25kaW5nIC0gYnV0IGJvbmRpbmcgaXMg bm90IGNvbmZpZ3VyZWQgY29ycmVjdGx5LjwvZGl2PjxkaXYgc3R5bGU9InRleHQtYWxpZ246IGxl ZnQ7ZGlyZWN0aW9uOiBsdHI7ICI+LSBOZXR3b3JrTWFuYWdlciBpbnRlcmZlcmUgJm5ic3A7d2l0 aCBuZXR3b3JraW5nIGNvbmZpZy48L2Rpdj48ZGl2IHN0eWxlPSJ0ZXh0LWFsaWduOiBsZWZ0O2Rp cmVjdGlvbjogbHRyOyAiPi0gYnJpZGdlIChvdmlydG1nbXQpIGlzIG5vdCBzZXQgdG8gd2l0aCBh bnkgQk9PVFBST1RPIG9yIE9OQk9PVC48L2Rpdj48ZGl2IHN0eWxlPSJ0ZXh0LWFsaWduOiBsZWZ0 O2RpcmVjdGlvbjogbHRyOyAiPjxicj48L2Rpdj48ZGl2IHN0eWxlPSJ0ZXh0LWFsaWduOiBsZWZ0 O2RpcmVjdGlvbjogbHRyOyAiPlBsZWFzZSBhdHRhY2ggbmV0d29yayBzY3JpcHRzIChpZmNmZy1l dGgwLCBpZmNmZy1vdmlydG1nbXQpIGFuZCAvdmFyL2xvZy9tZXNzYWdlcyAuPC9kaXY+PGRpdiBz dHlsZT0idGV4dC1hbGlnbjogbGVmdDtkaXJlY3Rpb246IGx0cjsgIj48YnI+PC9kaXY+PGRpdiBz dHlsZT0idGV4dC1hbGlnbjogbGVmdDtkaXJlY3Rpb246IGx0cjsgIj5IYWltPC9kaXY+PGJsb2Nr cXVvdGUgdHlwZT0iY2l0ZSI+PGRpdj48c3Bhbj48L3NwYW4+PGJyPjxzcGFuPl9fX19fX19fX19f X19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fX19fPC9zcGFuPjxicj48c3Bhbj5Vc2Vy cyBtYWlsaW5nIGxpc3Q8L3NwYW4+PGJyPjxzcGFuPjxhIGhyZWY9Im1haWx0bzpVc2Vyc0Bvdmly dC5vcmciPlVzZXJzQG92aXJ0Lm9yZzwvYT48L3NwYW4+PGJyPjxzcGFuPjxhIGhyZWY9Imh0dHA6 Ly9saXN0cy5vdmlydC5vcmcvbWFpbG1hbi9saXN0aW5mby91c2VycyI+aHR0cDovL2xpc3RzLm92 aXJ0Lm9yZy9tYWlsbWFuL2xpc3RpbmZvL3VzZXJzPC9hPjwvc3Bhbj48YnI+PC9kaXY+PC9ibG9j a3F1b3RlPjwvYm9keT48L2h0bWw+ --Apple-Mail-AC421548-58A6-42D0-9436-F4095405E006--

This is a multi-part message in MIME format. --------------020008080101020100040909 Content-Type: text/plain; charset=UTF-8; format=flowed Content-Transfer-Encoding: 7bit
Hi,
First, in order to understand the reason for bootstrap failures, please attach engine.log (ovirt-engine) and node logs (under /tmp/vds_bootstrap.log, vds_installer.log).
Second, what you describe regarding the loss of network could derive from the following reasons: - host is configured with bonding - but bonding is not configured correctly. - NetworkManager interfere with networking config. - bridge (ovirtmgmt) is not set to with any BOOTPROTO or ONBOOT.
Please attach network scripts (ifcfg-eth0, ifcfg-ovirtmgmt) and /var/log/messages .
Hi, I forgot to backup the ifcfg-eth0 for ovirtmgmt but i think ONBOOT was probably not there. BTW, before i attach any files here, is there a way to remove the host and re-discover it afresh ? ovirt web gui does not give me any option to remove.. the Remove is diabled and host is in Non Responsive state. How do i remove and start from scratch and then i can try to see if i can fix ifcfg-eth0 for ovirtmgmt ? Currently i am unable to remove the host. --------------020008080101020100040909 Content-Type: text/html; charset=UTF-8 Content-Transfer-Encoding: 8bit <!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN"> <html> <head> <meta content="text/html; charset=UTF-8" http-equiv="Content-Type"> </head> <body text="#000000" bgcolor="#ffffff"> <br> <blockquote cite="mid:E548E5C8-2D77-4E58-950D-26DE58B1F8F2@redhat.com" type="cite"> <div style="text-align: left; direction: ltr;">Hi, </div> <div style="text-align: left; direction: ltr;"><br> </div> <div style="text-align: left; direction: ltr;">First, in order to understand the reason for bootstrap failures, please attach engine.log (ovirt-engine) and node logs (under /tmp/vds_bootstrap.log, vds_installer.log).</div> <div style="text-align: left; direction: ltr;"><br> </div> <div style="text-align: left; direction: ltr;">Second, what you describe regarding the loss of network could derive from the following reasons:</div> <div style="text-align: left; direction: ltr;">- host is configured with bonding - but bonding is not configured correctly.</div> <div style="text-align: left; direction: ltr;">- NetworkManager interfere with networking config.</div> <div style="text-align: left; direction: ltr;">- bridge (ovirtmgmt) is not set to with any BOOTPROTO or ONBOOT.</div> <div style="text-align: left; direction: ltr;"><br> </div> <div style="text-align: left; direction: ltr;">Please attach network scripts (ifcfg-eth0, ifcfg-ovirtmgmt) and /var/log/messages .</div> <br> </blockquote> <br> <tt>Hi,<br> I forgot to backup the ifcfg-eth0 for ovirtmgmt but i think ONBOOT was probably not there.<br> BTW, before i attach any files here, is there a way to remove the host and re-discover it afresh ? <br> ovirt web gui does not give me any option to remove.. the Remove is diabled and host is in Non Responsive state.<br> How do i remove and start from scratch and then i can try to see if i can fix ifcfg-eth0 for ovirtmgmt ? Currently i am unable to remove the host.<br> </tt> </body> </html> --------------020008080101020100040909--

=20
Hi,=20 =20 First, in order to understand the reason for bootstrap failures, please a= ttach engine.log (ovirt-engine) and node logs (under /tmp/vds_bootstrap.log,= vds_installer.log). =20 Second, what you describe regarding the loss of network could derive from=
- host is configured with bonding - but bonding is not configured correct= ly. - NetworkManager interfere with networking config. - bridge (ovirtmgmt) is not set to with any BOOTPROTO or ONBOOT. =20 Please attach network scripts (ifcfg-eth0, ifcfg-ovirtmgmt) and /var/log/= messages . =20 =20 Hi, I forgot to backup the ifcfg-eth0 for ovirtmgmt but i think ONBOOT was=
--Apple-Mail-6FEA0081-096F-43B7-982A-2A0C8455E896 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii On Jan 28, 2012, at 11:45, Deepak C Shetty <deepakcs@linux.vnet.ibm.com> wro= te: the following reasons: probably not there.
BTW, before i attach any files here, is there a way to remove the host and= re-discover it afresh ?=20 ovirt web gui does not give me any option to remove.. the Remove is diable= d and host is in Non Responsive state. How do i remove and start from scratch and then i can try to see if i can f= ix ifcfg-eth0 for ovirtmgmt ? Currently i am unable to remove the host.
Please move host to maintenance, under 'general' tab, you will find re-insta= ll option, or slightly remove the host and add it again. Please make sure to delete ovirtmgmt bridge before the re-install phase. --Apple-Mail-6FEA0081-096F-43B7-982A-2A0C8455E896 Content-Transfer-Encoding: 7bit Content-Type: text/html; charset=utf-8 <html><head></head><body bgcolor="#FFFFFF"><div><br><br></div><div><br>On Jan 28, 2012, at 11:45, Deepak C Shetty <<a href="mailto:deepakcs@linux.vnet.ibm.com">deepakcs@linux.vnet.ibm.com</a>> wrote:<br><br></div><div></div><blockquote type="cite"><div> <meta content="text/html; charset=UTF-8" http-equiv="Content-Type"> <br> <blockquote cite="mid:E548E5C8-2D77-4E58-950D-26DE58B1F8F2@redhat.com" type="cite"> <div style="text-align: left; direction: ltr;">Hi, </div> <div style="text-align: left; direction: ltr;"><br> </div> <div style="text-align: left; direction: ltr;">First, in order to understand the reason for bootstrap failures, please attach engine.log (ovirt-engine) and node logs (under /tmp/vds_bootstrap.log, vds_installer.log).</div> <div style="text-align: left; direction: ltr;"><br> </div> <div style="text-align: left; direction: ltr;">Second, what you describe regarding the loss of network could derive from the following reasons:</div> <div style="text-align: left; direction: ltr;">- host is configured with bonding - but bonding is not configured correctly.</div> <div style="text-align: left; direction: ltr;">- NetworkManager interfere with networking config.</div> <div style="text-align: left; direction: ltr;">- bridge (ovirtmgmt) is not set to with any BOOTPROTO or ONBOOT.</div> <div style="text-align: left; direction: ltr;"><br> </div> <div style="text-align: left; direction: ltr;">Please attach network scripts (ifcfg-eth0, ifcfg-ovirtmgmt) and /var/log/messages .</div> <br> </blockquote> <br> <tt>Hi,<br> I forgot to backup the ifcfg-eth0 for ovirtmgmt but i think ONBOOT was probably not there.<br> BTW, before i attach any files here, is there a way to remove the host and re-discover it afresh ? <br> ovirt web gui does not give me any option to remove.. the Remove is diabled and host is in Non Responsive state.<br> How do i remove and start from scratch and then i can try to see if i can fix ifcfg-eth0 for ovirtmgmt ? Currently i am unable to remove the host.<br> </tt> </div></blockquote><br><div>Please move host to maintenance, under 'general' tab, you will find re-install option, or slightly remove the host and add it again.</div><div>Please make sure to delete ovirtmgmt bridge before the re-install phase.</div><div><br></div></body></html> --Apple-Mail-6FEA0081-096F-43B7-982A-2A0C8455E896--

On 01/28/2012 03:24 PM, Haim Ateya wrote:
Hi, I forgot to backup the ifcfg-eth0 for ovirtmgmt but i think ONBOOT was probably not there. BTW, before i attach any files here, is there a way to remove the host and re-discover it afresh ? ovirt web gui does not give me any option to remove.. the Remove is diabled and host is in Non Responsive state. How do i remove and start from scratch and then i can try to see if i can fix ifcfg-eth0 for ovirtmgmt ? Currently i am unable to remove the host.
Please move host to maintenance, under 'general' tab, you will find re-install option, or slightly remove the host and add it again. Please make sure to delete ovirtmgmt bridge before the re-install phase.
Host cannot be moved to Maintenance mode, since there is no network connectivity between ovirt and host (due to the above issues).. ovirt tries to put the host into maint mode, but does not and reverts the status back to non responsive, because of this I am never ever able to delete the host.. i think this is a bug because such issues will be common in user environments & there should be a way for the user to delete the host and start afresh irrespective of the host status.. any reason why Remove is disabled when host is in non responsive state ?

On 01/31/2012 07:23 AM, Deepak C Shetty wrote:
On 01/28/2012 03:24 PM, Haim Ateya wrote:
Hi, I forgot to backup the ifcfg-eth0 for ovirtmgmt but i think ONBOOT was probably not there. BTW, before i attach any files here, is there a way to remove the host and re-discover it afresh ? ovirt web gui does not give me any option to remove.. the Remove is diabled and host is in Non Responsive state. How do i remove and start from scratch and then i can try to see if i can fix ifcfg-eth0 for ovirtmgmt ? Currently i am unable to remove the host.
Please move host to maintenance, under 'general' tab, you will find re-install option, or slightly remove the host and add it again. Please make sure to delete ovirtmgmt bridge before the re-install phase.
Host cannot be moved to Maintenance mode, since there is no network connectivity between ovirt and host (due to the above issues).. ovirt tries to put the host into maint mode, but does not and reverts the status back to non responsive, because of this I am never ever able to delete the host.. i think this is a bug because such issues will be common in user environments & there should be a way for the user to delete the host and start afresh irrespective of the host status.. any reason why Remove is disabled when host is in non responsive state ?
not being able to move the host to maintenance sounds like a bug indeed, can you please attach engine logs to the bug. for the network issue- i would have looked at https://bugzilla.redhat.com/show_bug.cgi?id=785557 as well, to see this isn't your problem. Moran.
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

On 01/31/2012 12:29 PM, Moran Goldboim wrote:
On 01/31/2012 07:23 AM, Deepak C Shetty wrote:
Host cannot be moved to Maintenance mode, since there is no network connectivity between ovirt and host (due to the above issues).. ovirt tries to put the host into maint mode, but does not and reverts the status back to non responsive, because of this I am never ever able to delete the host.. i think this is a bug because such issues will be common in user environments & there should be a way for the user to delete the host and start afresh irrespective of the host status.. any reason why Remove is disabled when host is in non responsive state ?
not being able to move the host to maintenance sounds like a bug indeed, can you please attach engine logs to the bug. for the network issue- i would have looked at https://bugzilla.redhat.com/show_bug.cgi?id=785557 as well, to see this isn't your problem. Moran.
Where are the engine logs, in /tmp, i tried to look but could not find.

On 01/31/2012 01:07 PM, Deepak C Shetty wrote:
On 01/31/2012 12:29 PM, Moran Goldboim wrote:
On 01/31/2012 07:23 AM, Deepak C Shetty wrote:
Host cannot be moved to Maintenance mode, since there is no network connectivity between ovirt and host (due to the above issues).. ovirt tries to put the host into maint mode, but does not and reverts the status back to non responsive, because of this I am never ever able to delete the host.. i think this is a bug because such issues will be common in user environments & there should be a way for the user to delete the host and start afresh irrespective of the host status.. any reason why Remove is disabled when host is in non responsive state ?
not being able to move the host to maintenance sounds like a bug indeed, can you please attach engine logs to the bug. for the network issue- i would have looked at https://bugzilla.redhat.com/show_bug.cgi?id=785557 as well, to see this isn't your problem. Moran.
Where are the engine logs, in /tmp, i tried to look but could not find.
you can find the engine logs at : /var/log/ovirt-engine/ the log needed for that matter is engine.log (latest), you can also use the engine-log-collector tool installed. Moran.

On 01/31/2012 07:23 AM, Deepak C Shetty wrote:
On 01/28/2012 03:24 PM, Haim Ateya wrote:
Hi, I forgot to backup the ifcfg-eth0 for ovirtmgmt but i think ONBOOT was probably not there. BTW, before i attach any files here, is there a way to remove the host and re-discover it afresh ? ovirt web gui does not give me any option to remove.. the Remove is diabled and host is in Non Responsive state. How do i remove and start from scratch and then i can try to see if i can fix ifcfg-eth0 for ovirtmgmt ? Currently i am unable to remove the host.
Please move host to maintenance, under 'general' tab, you will find re-install option, or slightly remove the host and add it again. Please make sure to delete ovirtmgmt bridge before the re-install phase.
Host cannot be moved to Maintenance mode, since there is no network connectivity between ovirt and host (due to the above issues).. ovirt tries to put the host into maint mode, but does not and reverts the status back to non responsive, because of this I am never ever able to delete the host.. i think this is a bug because such issues will be common in user environments & there should be a way for the user to delete the host and start afresh irrespective of the host status.. any reason why Remove is disabled when host is in non responsive state ?
is the host the SPM or running any VMs? you should be able to fence it via power management, or just right click and "confirm manual shutdown" so engine will know it is not consuming these resources any more.

is the host the SPM or running any VMs? you should be able to fence it via power management, or just right click and "confirm manual shutdown" so engine will know it is not consuming these resources any more.
No, the host is not running in VMs. I am not sure how to look for spm in the host. In fact right now even vdsm i dont see it under ps output. I did not configure power mgmt, since for the same host i had tried configuring pwrmgmt and ovirt gave some error, so when i tried afresh, i did not check pwrmgmt. I don't see confirm manual shutdown option, when i right click the host. I see Confirm reboot.... i say yes and ok, but it still goes back to non responsive state. Do i need to open a BZ for reporting this issue, or I can start by attaching logs here in the mail ?

On 01/31/2012 01:11 PM, Deepak C Shetty wrote:
is the host the SPM or running any VMs? you should be able to fence it via power management, or just right click and "confirm manual shutdown" so engine will know it is not consuming these resources any more.
No, the host is not running in VMs. I am not sure how to look for spm in the host. In fact right now even vdsm i dont see it under ps output. I did not configure power mgmt, since for the same host i had tried configuring pwrmgmt and ovirt gave some error, so when i tried afresh, i did not check pwrmgmt.
SPM is the last column in the hosts main tab.
I don't see confirm manual shutdown option, when i right click the host. I see Confirm reboot.... i say yes and ok, but it still goes back to non responsive state.
Do i need to open a BZ for reporting this issue, or I can start by attaching logs here in the mail ?
i suspect a bug will be more productive, but doing a few rounds with log excerpts is valid as well.

Hi Itamar, My reply with the attachments is on hold since the size is around 114KB I tried with and without .tar, the same happens, so not sure how to send the attachments otherwise to this email thread.

On 01/31/2012 04:55 PM, Deepak C Shetty wrote:
Hi Itamar, My reply with the attachments is on hold since the size is around 114KB I tried with and without .tar, the same happens, so not sure how to send the attachments otherwise to this email thread.
1. i got it, since you cc'd me directly. 2. karsten - can we change the limit on users@ovirt.org to something in MBs? thanks

On 01/31/2012 05:09 PM, Itamar Heim wrote:
On 01/31/2012 04:55 PM, Deepak C Shetty wrote:
Hi Itamar, My reply with the attachments is on hold since the size is around 114KB I tried with and without .tar, the same happens, so not sure how to send the attachments otherwise to this email thread.
1. i got it, since you cc'd me directly. 2. karsten - can we change the limit on users@ovirt.org to something in MBs?
thanks _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
Deepak, can you please look for SetVdsStatusVDSCommand following a MaintananceVdsCommand in /var/log/ovirt-engine/engine.log if you were having a problem with this command it is most likely for an ERROR to appear around there. Moran.

On 01/31/2012 09:55 AM, Deepak C Shetty wrote:
Hi Itamar, My reply with the attachments is on hold since the size is around 114KB I tried with and without .tar, the same happens, so not sure how to send the attachments otherwise to this email thread.
___________
Either attach them to the wiki, and then link the page. just specify the id you want and someone can create you an account. Or the other option is to attach them to a BZ. The reason we moderate attachments to the list is to not spam all users with attachments. :-) Carl.

On 02/01/2012 12:02 AM, Carl Trieloff wrote:
On 01/31/2012 09:55 AM, Deepak C Shetty wrote:
Hi Itamar, My reply with the attachments is on hold since the size is around 114KB I tried with and without .tar, the same happens, so not sure how to send the attachments otherwise to this email thread.
___________
Either attach them to the wiki, and then link the page. just specify the id you want and someone can create you an account. Or the other option is to attach them to a BZ.
The reason we moderate attachments to the list is to not spam all users with attachments. :-)
what is the current limit? 114KB means even a screen shot won't pass through.

Last time I tried to send an attachment I found the limit to be 40KB. - Chris -----Original Message----- From: users-bounces@ovirt.org [mailto:users-bounces@ovirt.org] On Behalf Of Carl Trieloff Sent: Tuesday, January 31, 2012 4:14 PM To: Itamar Heim Cc: users@ovirt.org Subject: Re: [Users] Host discovery failing due to host network being lost On 01/31/2012 05:14 PM, Itamar Heim wrote:
what is the current limit? 114KB means even a screen shot won't pass through.
Don't know who all is admin on user list to look -- Karsten? Carl. _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

-----BEGIN PGP SIGNED MESSAGE----- Hash: SHA1 On 01/31/2012 02:20 PM, Brown, Chris (GE Healthcare) wrote:
Last time I tried to send an attachment I found the limit to be 40KB.
Yes, that is the default. I just upped it to 5000KB. Hopefully that won't be a problem for anyone, and we should consider alternate ways to share files than this list if we find we need to regularly. (I think 100+KB log files are fine for the list, fwiw.) BTW, if anyone is interested in helping administrate this list, I'm always looking to spread the responsibility. If you haven't admin'd a Mailman list before, this is a good opportunity to learn. - - Karsten
-----Original Message----- From: users-bounces@ovirt.org [mailto:users-bounces@ovirt.org] On Behalf Of Carl Trieloff Sent: Tuesday, January 31, 2012 4:14 PM To: Itamar Heim Cc: users@ovirt.org Subject: Re: [Users] Host discovery failing due to host network being lost
On 01/31/2012 05:14 PM, Itamar Heim wrote:
what is the current limit? 114KB means even a screen shot won't pass through.
Don't know who all is admin on user list to look -- Karsten?
Carl. _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
- -- name: Karsten 'quaid' Wade, Sr. Community Architect team: Red Hat Community Architecture & Leadership uri: http://communityleadershipteam.org http://TheOpenSourceWay.org gpg: AD0E0C41 -----BEGIN PGP SIGNATURE----- Version: GnuPG v1.4.11 (GNU/Linux) Comment: Using GnuPG with Mozilla - http://enigmail.mozdev.org/ iD8DBQFPKHKU2ZIOBq0ODEERAo+RAJ0TyklGtReMXQDHpHfYpx8eSeYkMgCgwCyT 104W6iWjhZKiVW3u7WpHiJM= =7bVI -----END PGP SIGNATURE-----
participants (7)
-
Brown, Chris (GE Healthcare)
-
Carl Trieloff
-
Deepak C Shetty
-
Haim Ateya
-
Itamar Heim
-
Karsten 'quaid' Wade
-
Moran Goldboim