From danken at redhat.com Thu May 2 06:53:46 2013 From: danken at redhat.com (Dan Kenigsberg) Date: Thu, 2 May 2013 09:53:46 +0300 Subject: Direct Host Address In-Reply-To: <1437433971.1365545.1361099555481.JavaMail.root@redhat.com> References: <20130217104402.GG23843@redhat.com> <1437433971.1365545.1361099555481.JavaMail.root@redhat.com> Message-ID: <20130502065346.GA20465@redhat.com> On Sun, Feb 17, 2013 at 06:12:35AM -0500, Alon Bar-Lev wrote: > > > ----- Original Message ----- > > From: "Dan Kenigsberg" > > To: "Alon Bar-Lev" > > Cc: "Muli Salem" , arch at ovirt.org > > Sent: Sunday, February 17, 2013 12:44:02 PM > > Subject: Re: Direct Host Address > > > > On Thu, Feb 14, 2013 at 03:27:57PM -0500, Alon Bar-Lev wrote: > > > > > > > > > ----- Original Message ----- > > > > From: "Muli Salem" > > > > To: "Mike Kolesnik" > > > > Cc: arch at ovirt.org > > > > Sent: Sunday, February 10, 2013 6:09:30 PM > > > > Subject: Re: Direct Host Address > > > > > > > > > > "Current behaviour assumes the network interface with the > > > > > specified > > > > > address is configured properly in the engine although this may > > > > > not > > > > > be the case initially" > > > > > > > > > > I don't understand what does this mean, which interface are you > > > > > referring to and what does it have to do with being configured > > > > > in > > > > > the engine? > > > > > The next line is also unclear to me: > > > > > "The direct address allows the engine to connect to the host, > > > > > without > > > > > knowing the exact configuration of the network interface that > > > > > has > > > > > the address. " > > > > > > > > > > > > > Regarding the last two sentences you quoted: > > > > > > > > I am referring to the interface that has the IP that the user > > > > gives > > > > us (with regards to current behavior). > > > > At the moment, we assume that the given IP is for an interface > > > > that > > > > can communicate with the engine (when in practice, this may not > > > > be > > > > the case). > > > > So separating the two addresses, allows us to ask the admin for > > > > an > > > > alternate IP address that will allow communication without > > > > needing > > > > to know the specific configuration (for example, whether this is > > > > a > > > > VLAN network or not). > > > > > > > > Perhaps the wording should be changed a bit to clarify. > > > > > > I still don't get it... can you please provide real world use case? > > > > > > When can we access the alternate address and not the management > > > address? > > > > We have customers who want to install vdsm via native connection, but > > manage it over a VLAN. If you want to add a fresh host, you cannot > > use > > its management address (that sits inside the vlan). > > > > So as far as I understand the prerequisite of this feature is to perform host deploy (ovirt-host-deploy) without constructing the management bridge. > > In this mode, during host deployment (ovirt-host-deploy) the engine uses the host name only to be able to construct proper CN field for the certificate of VDSM. > > Then the engine performs provisioning of the host to define the host name's ip address on the optional vlan id that is shared by the entire cluster. > > Maybe there will not be a management bridge at all, which makes me happy! > > This means that: > 1. No management bridge name is to be provided to ovirt-host-deply (this is to do in the prerequisite of the feature). > 2. New parameter for the VdsDeploy at engine side of IP address to SSH. > 3. After VdsDeploy is finished, there is provisioning to define the management address on the host using standard VDSM communication. > > Am I missing anything? You haven't - but apprently I have. Your point (2) would be useful only when we can use it to tunnel getVdsCaps and setupNetworks verbs on top of ssh. Without this ability, we gain hardly anything: the first Engine-to-vdsm call would have to go to the management interface, using its proper CN, meaning that we have to have the management interface up and running and answering xmlrpc before we have a chance to setup a vlan tag for it. Thus, we decided to scrap this point (2) for now. From dneary at redhat.com Mon May 6 22:55:19 2013 From: dneary at redhat.com (Dave Neary) Date: Tue, 07 May 2013 00:55:19 +0200 Subject: 3.3 release engineering - awesome page! Message-ID: <518834D7.3030708@redhat.com> Hi all, I came across this page today when looking for information on the 3.3 release timeline: http://www.ovirt.org/oVirt_3.3_release-management This is awesome! I was a little surprised, because I hadn't seen a lot of discussion on the arch list about the 3.3 release planning, but this is a great resource to have. Can I suggest one improvement which could be done this week, please? There's no way of knowing from this page which features in the MUST, SHOULD lists are done and just awaiting release, in need of QE and testing, being actively worked on (and if so, by who?) and which features are there aspirationally and won't be worked on unless someone volunteers to pick them up. This would also give an opportunity to see whether the 2013-05-31 beta release date is at all realistic now, and whether we need a feature bump/feature freeze meeting soon which allows us to stay on schedule. If you're working on a feature that's in this page and it's not finished, would you mind adding a link to the feature page (if there is one) and adding your name to the feature, please? And if the feature is done, please link to the feature page, or some other resource that allows people to use and test it - and (if possible) the changelog entry/gerrit where the patch was included. Thanks! Dave. -- Dave Neary - Community Action and Impact Open Source and Standards, Red Hat - http://community.redhat.com Ph: +33 9 50 71 55 62 / Cell: +33 6 77 01 92 13 From masayag at redhat.com Tue May 7 11:22:19 2013 From: masayag at redhat.com (Moti Asayag) Date: Tue, 7 May 2013 07:22:19 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <20130101124757.GI7274@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> Message-ID: <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> I stumbled upon few issues with the current design while implementing it: There seems to be a requirement to reboot the host after the installation is completed in order to assure the host is recoverable. Therefore, the building blocks of the installation process of 3.3 are: 1. host deploy which installs the host expect configuring its management network. 2. SetupNetwork (and CommitNetworkChanges) - for creating the management network on the host and persisting the network configuration. 3. Reboot the host - This is a missing piece. (engine has FenceVds command, but it requires the power management to be configured prior to the installation and might be irrelevant for hosts without PM.) So, there are couple of issues here: 1. How to reboot the host? 1.1. By exposing new RebootNode verb in VDSM and invoking it from the engine 1.2. By opening ssh dialog to the host in order to execute the reboot 2. When to perform the reboot? 2.1. After host deploy, by utilizing the host deploy to perform the reboot. It requires to configure the network by the monitor when the host is detected by the engine, detached from the installation flow. However it is a step toward the non-persistent network feature yet to be defined. 2.2. After setupNetwork is done and network was configured and persisted on the host. There is no special advantage from recoverable aspect, as setupNetwork is constantly used to persist the network configuration (by the complementary CommitNetworkChanges command). In case and network configuration fails, VDSM will revert to the last well known configuration - so connectivity with engine should be restored. Design wise, it fits to configure the management network as part of the installation sequence. If the network configuration fails in this context, the host status will be set to "InstallFailed" rather than "NonOperational", as might occur as a result of a failed setupNetwork command. Your inputs are welcome. Thanks, Moti ----- Original Message ----- > From: "Dan Kenigsberg" > To: "Simon Grinberg" , "Moti Asayag" > Cc: "arch" > Sent: Tuesday, January 1, 2013 2:47:57 PM > Subject: Re: feature suggestion: initial generation of management network > > On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > > > > > > ----- Original Message ----- > > > From: "Dan Kenigsberg" > > > To: "Simon Grinberg" > > > Cc: "arch" > > > Sent: Thursday, December 27, 2012 2:14:06 PM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > ----- Original Message ----- > > > > > From: "Dan Kenigsberg" > > > > > To: "arch" > > > > > Sent: Tuesday, December 25, 2012 2:27:22 PM > > > > > Subject: feature suggestion: initial generation of management > > > > > network > > > > > > > > > > Current condition: > > > > > ================== > > > > > The management network, named ovirtmgmt, is created during host > > > > > bootstrap. It consists of a bridge device, connected to the > > > > > network > > > > > device that was used to communicate with Engine (nic, bonding or > > > > > vlan). > > > > > It inherits its ip settings from the latter device. > > > > > > > > > > Why Is the Management Network Needed? > > > > > ===================================== > > > > > Understandably, some may ask why do we need to have a management > > > > > network - why having a host with IPv4 configured on it is not > > > > > enough. > > > > > The answer is twofold: > > > > > 1. In oVirt, a network is an abstraction of the resources > > > > > required > > > > > for > > > > > connectivity of a host for a specific usage. This is true for > > > > > the > > > > > management network just as it is for VM network or a display > > > > > network. > > > > > The network entity is the key for adding/changing nics and IP > > > > > address. > > > > > 2. In many occasions (such as small setups) the management > > > > > network is > > > > > used as a VM/display network as well. > > > > > > > > > > Problems in current connectivity: > > > > > ================================ > > > > > According to alonbl of ovirt-host-deploy fame, and with no > > > > > conflict > > > > > to > > > > > my own experience, creating the management network is the most > > > > > fragile, > > > > > error-prone step of bootstrap. > > > > > > > > +1, > > > > I've raise that repeatedly in the past, bootstrap should not create > > > > the management network but pick up the existing configuration and > > > > let the engine override later with it's own configuration if it > > > > differs , I'm glad that we finally get to that. > > > > > > > > > > > > > > Currently it always creates a bridged network (even if the DC > > > > > requires a > > > > > non-bridged ovirtmgmt), it knows nothing about the defined MTU > > > > > for > > > > > ovirtmgmt, it uses ping to guess on top of which device to build > > > > > (and > > > > > thus requires Vdsm-to-Engine reverse connectivity), and is the > > > > > sole > > > > > remaining user of the addNetwork/vdsm-store-net-conf scripts. > > > > > > > > > > Suggested feature: > > > > > ================== > > > > > Bootstrap would avoid creating a management network. Instead, > > > > > after > > > > > bootstrapping a host, Engine would send a getVdsCaps probe to the > > > > > installed host, receiving a complete picture of the network > > > > > configuration on the host. Among this picture is the device that > > > > > holds > > > > > the host's management IP address. > > > > > > > > > > Engine would send setupNetwork command to generate ovirtmgmt with > > > > > details devised from this picture, and according to the DC > > > > > definition > > > > > of > > > > > ovirtmgmt. For example, if Vdsm reports: > > > > > > > > > > - vlan bond4.3000 has the host's IP, configured to use dhcp. > > > > > - bond4 is comprises eth2 and eth3 > > > > > - ovirtmgmt is defined as a VM network with MTU 9000 > > > > > > > > > > then Engine sends the likes of: > > > > > setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, > > > > > bonding=bond4: {eth2,eth3}, MTU=9000) > > > > > > > > Just one comment here, > > > > In order to save time and confusion - if the ovirtmgmt is defined > > > > with default values meaning the user did not bother to touch it, > > > > let it pick up the VLAN configuration from the first host added in > > > > the Data Center. > > > > > > > > Otherwise, you may override the host VLAN and loose connectivity. > > > > > > > > This will also solve the situation many users encounter today. > > > > 1. The engine in on a host that actually has VLAN defined > > > > 2. The ovirtmgmt network was not updated in the DC > > > > 3. A host, with VLAN already defined is added - everything works > > > > fine > > > > 4. Any number of hosts are now added, again everything seems to > > > > work fine. > > > > > > > > But, now try to use setupNetworks, and you'll find out that you > > > > can't do much on the interface that contains the ovirtmgmt since > > > > the definition does not match. You can't sync (Since this will > > > > remove the VLAN and cause connectivity lose) you can't add more > > > > networks on top since it already has non-VLAN network on top > > > > according to the DC definition, etc. > > > > > > > > On the other hand you can't update the ovirtmgmt definition on the > > > > DC since there are clusters in the DC that use the network. > > > > > > > > The only workaround not involving DB hack to change the VLAN on the > > > > network is to: > > > > 1. Create new DC > > > > 2. Do not use the wizard that pops up to create your cluster. > > > > 3. Modify the ovirtmgmt network to have VLANs > > > > 4. Now create a cluster and add your hosts. > > > > > > > > If you insist on using the default DC and cluster then before > > > > adding the first host, create an additional DC and move the > > > > Default cluster over there. You may then change the network on the > > > > Default cluster and then move the Default cluster back > > > > > > > > Both are ugly. And should be solved by the proposal above. > > > > > > > > We do something similar for the Default cluster CPU level, where we > > > > set the intial level based on the first host added to the cluster. > > > > > > I'm not sure what Engine has for Default cluster CPU level. But I > > > have > > > reservation of the hysteresis in your proposal - after a host is > > > added, > > > the DC cannot forget ovirtmgmt's vlan. > > > > > > How about letting the admin edit ovirtmgmt's vlan in the DC level, > > > thus > > > rendering all hosts out-of-sync. The the admin could manually, or > > > through a script, or in the future through a distributed operation, > > > sync > > > all the hosts to the definition? > > > > Usually if you do that you will loose connectivity to the hosts. > > Yes, changing the management vlan id (or ip address) is never fun, and > requires out-of-band intervention. > > > I'm not insisting on the automatic adjustment of the ovirtmgmt network to > > match the hosts' (that is just a nice touch) we can take the allow edit > > approach. > > > > But allow to change VLAN on the ovirtmgmt network will indeed solve the > > issue I'm trying to solve while creating another issue of user expecting > > that we'll be able to re-tag the host from the engine side, which is > > challenging to do. > > > > On the other hand, if we allow to change the VLAN as long as the change > > matches the hosts' configuration, it will both solve the issue while not > > eluding the user to think that we really can solve the chicken and egg > > issue of re-tag the entire system. > > > > Now with the above ability you do get a flow to do the re-tag. > > 1. Place all the hosts in maintenance > > 2. Re-tag the ovirtmgmt on all the hosts > > 3. Re-tag the hosts on which the engine on > > 4. Activate the hosts - this should work well now since connectivity exist > > 5. Change the tag on ovirtmgmt on the engine to match the hosts' > > > > Simple and clear process. > > > > When the workaround of creating another DC was not possible since the > > system was already long in use and the need was re-tag of the network the > > above is what I've recommended in the, except that steps 4-5 where done > > as: > > 4. Stop the engine > > 5. Change the tag in the DB > > 6. Start the engine > > 7. Activate the hosts > > Sounds reasonable to me - but as far as I am aware this is not tightly > related to the $Subject, which is the post-boot ovirtmgmt definition. > > I've added a few details to > http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > and I would apreciate a review from someone with intimate Engine > know-how. > > Dan. > From mkolesni at redhat.com Tue May 7 11:33:15 2013 From: mkolesni at redhat.com (Mike Kolesnik) Date: Tue, 7 May 2013 07:33:15 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> Message-ID: <1633347300.8227458.1367926395327.JavaMail.root@redhat.com> ----- Original Message ----- > I stumbled upon few issues with the current design while implementing it: > > There seems to be a requirement to reboot the host after the installation > is completed in order to assure the host is recoverable. > > Therefore, the building blocks of the installation process of 3.3 are: > 1. host deploy which installs the host expect configuring its management > network. > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > network > on the host and persisting the network configuration. > 3. Reboot the host - This is a missing piece. (engine has FenceVds command, > but it > requires the power management to be configured prior to the installation and > might > be irrelevant for hosts without PM.) > > So, there are couple of issues here: > 1. How to reboot the host? > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the engine This sounds like a solid and good API to me. > 1.2. By opening ssh dialog to the host in order to execute the reboot How would you do this? > > 2. When to perform the reboot? > 2.1. After host deploy, by utilizing the host deploy to perform the reboot. > It requires to configure the network by the monitor when the host is detected > by the engine, > detached from the installation flow. However it is a step toward the > non-persistent network feature > yet to be defined. I am not sure this statement has merit, if the feature is yet to be defined, how can we know if this is a step towards it or not? Anyway, I'm not sure that this is a good design - should we setup the network when host returns from non-responsive status? > 2.2. After setupNetwork is done and network was configured and persisted on > the host. > There is no special advantage from recoverable aspect, as setupNetwork is > constantly > used to persist the network configuration (by the complementary > CommitNetworkChanges command). > In case and network configuration fails, VDSM will revert to the last well > known configuration > - so connectivity with engine should be restored. Design wise, it fits to > configure the management > network as part of the installation sequence. > If the network configuration fails in this context, the host status will be > set to "InstallFailed" rather than "NonOperational", > as might occur as a result of a failed setupNetwork command. This sounds like the good solution to me, design wise. The host is installed and with that the communication with the management network is configured. If this communication is not possible, the host failed to install (also meaning it's not operational). I see no problem with this approach. > > > Your inputs are welcome. > > Thanks, > Moti > ----- Original Message ----- > > From: "Dan Kenigsberg" > > To: "Simon Grinberg" , "Moti Asayag" > > Cc: "arch" > > Sent: Tuesday, January 1, 2013 2:47:57 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > > > > > > > > > ----- Original Message ----- > > > > From: "Dan Kenigsberg" > > > > To: "Simon Grinberg" > > > > Cc: "arch" > > > > Sent: Thursday, December 27, 2012 2:14:06 PM > > > > Subject: Re: feature suggestion: initial generation of management > > > > network > > > > > > > > On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > > > > ----- Original Message ----- > > > > > > From: "Dan Kenigsberg" > > > > > > To: "arch" > > > > > > Sent: Tuesday, December 25, 2012 2:27:22 PM > > > > > > Subject: feature suggestion: initial generation of management > > > > > > network > > > > > > > > > > > > Current condition: > > > > > > ================== > > > > > > The management network, named ovirtmgmt, is created during host > > > > > > bootstrap. It consists of a bridge device, connected to the > > > > > > network > > > > > > device that was used to communicate with Engine (nic, bonding or > > > > > > vlan). > > > > > > It inherits its ip settings from the latter device. > > > > > > > > > > > > Why Is the Management Network Needed? > > > > > > ===================================== > > > > > > Understandably, some may ask why do we need to have a management > > > > > > network - why having a host with IPv4 configured on it is not > > > > > > enough. > > > > > > The answer is twofold: > > > > > > 1. In oVirt, a network is an abstraction of the resources > > > > > > required > > > > > > for > > > > > > connectivity of a host for a specific usage. This is true for > > > > > > the > > > > > > management network just as it is for VM network or a display > > > > > > network. > > > > > > The network entity is the key for adding/changing nics and IP > > > > > > address. > > > > > > 2. In many occasions (such as small setups) the management > > > > > > network is > > > > > > used as a VM/display network as well. > > > > > > > > > > > > Problems in current connectivity: > > > > > > ================================ > > > > > > According to alonbl of ovirt-host-deploy fame, and with no > > > > > > conflict > > > > > > to > > > > > > my own experience, creating the management network is the most > > > > > > fragile, > > > > > > error-prone step of bootstrap. > > > > > > > > > > +1, > > > > > I've raise that repeatedly in the past, bootstrap should not create > > > > > the management network but pick up the existing configuration and > > > > > let the engine override later with it's own configuration if it > > > > > differs , I'm glad that we finally get to that. > > > > > > > > > > > > > > > > > Currently it always creates a bridged network (even if the DC > > > > > > requires a > > > > > > non-bridged ovirtmgmt), it knows nothing about the defined MTU > > > > > > for > > > > > > ovirtmgmt, it uses ping to guess on top of which device to build > > > > > > (and > > > > > > thus requires Vdsm-to-Engine reverse connectivity), and is the > > > > > > sole > > > > > > remaining user of the addNetwork/vdsm-store-net-conf scripts. > > > > > > > > > > > > Suggested feature: > > > > > > ================== > > > > > > Bootstrap would avoid creating a management network. Instead, > > > > > > after > > > > > > bootstrapping a host, Engine would send a getVdsCaps probe to the > > > > > > installed host, receiving a complete picture of the network > > > > > > configuration on the host. Among this picture is the device that > > > > > > holds > > > > > > the host's management IP address. > > > > > > > > > > > > Engine would send setupNetwork command to generate ovirtmgmt with > > > > > > details devised from this picture, and according to the DC > > > > > > definition > > > > > > of > > > > > > ovirtmgmt. For example, if Vdsm reports: > > > > > > > > > > > > - vlan bond4.3000 has the host's IP, configured to use dhcp. > > > > > > - bond4 is comprises eth2 and eth3 > > > > > > - ovirtmgmt is defined as a VM network with MTU 9000 > > > > > > > > > > > > then Engine sends the likes of: > > > > > > setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, > > > > > > bonding=bond4: {eth2,eth3}, MTU=9000) > > > > > > > > > > Just one comment here, > > > > > In order to save time and confusion - if the ovirtmgmt is defined > > > > > with default values meaning the user did not bother to touch it, > > > > > let it pick up the VLAN configuration from the first host added in > > > > > the Data Center. > > > > > > > > > > Otherwise, you may override the host VLAN and loose connectivity. > > > > > > > > > > This will also solve the situation many users encounter today. > > > > > 1. The engine in on a host that actually has VLAN defined > > > > > 2. The ovirtmgmt network was not updated in the DC > > > > > 3. A host, with VLAN already defined is added - everything works > > > > > fine > > > > > 4. Any number of hosts are now added, again everything seems to > > > > > work fine. > > > > > > > > > > But, now try to use setupNetworks, and you'll find out that you > > > > > can't do much on the interface that contains the ovirtmgmt since > > > > > the definition does not match. You can't sync (Since this will > > > > > remove the VLAN and cause connectivity lose) you can't add more > > > > > networks on top since it already has non-VLAN network on top > > > > > according to the DC definition, etc. > > > > > > > > > > On the other hand you can't update the ovirtmgmt definition on the > > > > > DC since there are clusters in the DC that use the network. > > > > > > > > > > The only workaround not involving DB hack to change the VLAN on the > > > > > network is to: > > > > > 1. Create new DC > > > > > 2. Do not use the wizard that pops up to create your cluster. > > > > > 3. Modify the ovirtmgmt network to have VLANs > > > > > 4. Now create a cluster and add your hosts. > > > > > > > > > > If you insist on using the default DC and cluster then before > > > > > adding the first host, create an additional DC and move the > > > > > Default cluster over there. You may then change the network on the > > > > > Default cluster and then move the Default cluster back > > > > > > > > > > Both are ugly. And should be solved by the proposal above. > > > > > > > > > > We do something similar for the Default cluster CPU level, where we > > > > > set the intial level based on the first host added to the cluster. > > > > > > > > I'm not sure what Engine has for Default cluster CPU level. But I > > > > have > > > > reservation of the hysteresis in your proposal - after a host is > > > > added, > > > > the DC cannot forget ovirtmgmt's vlan. > > > > > > > > How about letting the admin edit ovirtmgmt's vlan in the DC level, > > > > thus > > > > rendering all hosts out-of-sync. The the admin could manually, or > > > > through a script, or in the future through a distributed operation, > > > > sync > > > > all the hosts to the definition? > > > > > > Usually if you do that you will loose connectivity to the hosts. > > > > Yes, changing the management vlan id (or ip address) is never fun, and > > requires out-of-band intervention. > > > > > I'm not insisting on the automatic adjustment of the ovirtmgmt network to > > > match the hosts' (that is just a nice touch) we can take the allow edit > > > approach. > > > > > > But allow to change VLAN on the ovirtmgmt network will indeed solve the > > > issue I'm trying to solve while creating another issue of user expecting > > > that we'll be able to re-tag the host from the engine side, which is > > > challenging to do. > > > > > > On the other hand, if we allow to change the VLAN as long as the change > > > matches the hosts' configuration, it will both solve the issue while not > > > eluding the user to think that we really can solve the chicken and egg > > > issue of re-tag the entire system. > > > > > > Now with the above ability you do get a flow to do the re-tag. > > > 1. Place all the hosts in maintenance > > > 2. Re-tag the ovirtmgmt on all the hosts > > > 3. Re-tag the hosts on which the engine on > > > 4. Activate the hosts - this should work well now since connectivity > > > exist > > > 5. Change the tag on ovirtmgmt on the engine to match the hosts' > > > > > > Simple and clear process. > > > > > > When the workaround of creating another DC was not possible since the > > > system was already long in use and the need was re-tag of the network the > > > above is what I've recommended in the, except that steps 4-5 where done > > > as: > > > 4. Stop the engine > > > 5. Change the tag in the DB > > > 6. Start the engine > > > 7. Activate the hosts > > > > Sounds reasonable to me - but as far as I am aware this is not tightly > > related to the $Subject, which is the post-boot ovirtmgmt definition. > > > > I've added a few details to > > http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > > and I would apreciate a review from someone with intimate Engine > > know-how. > > > > Dan. > > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From ofrenkel at redhat.com Tue May 7 13:11:05 2013 From: ofrenkel at redhat.com (Omer Frenkel) Date: Tue, 7 May 2013 09:11:05 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> Message-ID: <497587874.11990711.1367932265918.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Moti Asayag" > To: "arch" > Cc: "Alon Bar-Lev" > Sent: Tuesday, May 7, 2013 2:22:19 PM > Subject: Re: feature suggestion: initial generation of management network > > I stumbled upon few issues with the current design while implementing it: > > There seems to be a requirement to reboot the host after the installation > is completed in order to assure the host is recoverable. > > Therefore, the building blocks of the installation process of 3.3 are: > 1. host deploy which installs the host expect configuring its management > network. > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > network > on the host and persisting the network configuration. > 3. Reboot the host - This is a missing piece. (engine has FenceVds command, > but it > requires the power management to be configured prior to the installation and > might > be irrelevant for hosts without PM.) > > So, there are couple of issues here: > 1. How to reboot the host? > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the engine > 1.2. By opening ssh dialog to the host in order to execute the reboot > why not send a reboot flag to the CommitNetworkChanges which is sent anyway, one less call (or connection if you choose ssh) and easier to do. > 2. When to perform the reboot? > 2.1. After host deploy, by utilizing the host deploy to perform the reboot. > It requires to configure the network by the monitor when the host is detected > by the engine, > detached from the installation flow. However it is a step toward the > non-persistent network feature > yet to be defined. > 2.2. After setupNetwork is done and network was configured and persisted on > the host. > There is no special advantage from recoverable aspect, as setupNetwork is > constantly > used to persist the network configuration (by the complementary > CommitNetworkChanges command). > In case and network configuration fails, VDSM will revert to the last well > known configuration > - so connectivity with engine should be restored. Design wise, it fits to > configure the management > network as part of the installation sequence. > If the network configuration fails in this context, the host status will be > set to "InstallFailed" rather than "NonOperational", > as might occur as a result of a failed setupNetwork command. > > > Your inputs are welcome. > > Thanks, > Moti > ----- Original Message ----- > > From: "Dan Kenigsberg" > > To: "Simon Grinberg" , "Moti Asayag" > > Cc: "arch" > > Sent: Tuesday, January 1, 2013 2:47:57 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > > > > > > > > > ----- Original Message ----- > > > > From: "Dan Kenigsberg" > > > > To: "Simon Grinberg" > > > > Cc: "arch" > > > > Sent: Thursday, December 27, 2012 2:14:06 PM > > > > Subject: Re: feature suggestion: initial generation of management > > > > network > > > > > > > > On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > > > > ----- Original Message ----- > > > > > > From: "Dan Kenigsberg" > > > > > > To: "arch" > > > > > > Sent: Tuesday, December 25, 2012 2:27:22 PM > > > > > > Subject: feature suggestion: initial generation of management > > > > > > network > > > > > > > > > > > > Current condition: > > > > > > ================== > > > > > > The management network, named ovirtmgmt, is created during host > > > > > > bootstrap. It consists of a bridge device, connected to the > > > > > > network > > > > > > device that was used to communicate with Engine (nic, bonding or > > > > > > vlan). > > > > > > It inherits its ip settings from the latter device. > > > > > > > > > > > > Why Is the Management Network Needed? > > > > > > ===================================== > > > > > > Understandably, some may ask why do we need to have a management > > > > > > network - why having a host with IPv4 configured on it is not > > > > > > enough. > > > > > > The answer is twofold: > > > > > > 1. In oVirt, a network is an abstraction of the resources > > > > > > required > > > > > > for > > > > > > connectivity of a host for a specific usage. This is true for > > > > > > the > > > > > > management network just as it is for VM network or a display > > > > > > network. > > > > > > The network entity is the key for adding/changing nics and IP > > > > > > address. > > > > > > 2. In many occasions (such as small setups) the management > > > > > > network is > > > > > > used as a VM/display network as well. > > > > > > > > > > > > Problems in current connectivity: > > > > > > ================================ > > > > > > According to alonbl of ovirt-host-deploy fame, and with no > > > > > > conflict > > > > > > to > > > > > > my own experience, creating the management network is the most > > > > > > fragile, > > > > > > error-prone step of bootstrap. > > > > > > > > > > +1, > > > > > I've raise that repeatedly in the past, bootstrap should not create > > > > > the management network but pick up the existing configuration and > > > > > let the engine override later with it's own configuration if it > > > > > differs , I'm glad that we finally get to that. > > > > > > > > > > > > > > > > > Currently it always creates a bridged network (even if the DC > > > > > > requires a > > > > > > non-bridged ovirtmgmt), it knows nothing about the defined MTU > > > > > > for > > > > > > ovirtmgmt, it uses ping to guess on top of which device to build > > > > > > (and > > > > > > thus requires Vdsm-to-Engine reverse connectivity), and is the > > > > > > sole > > > > > > remaining user of the addNetwork/vdsm-store-net-conf scripts. > > > > > > > > > > > > Suggested feature: > > > > > > ================== > > > > > > Bootstrap would avoid creating a management network. Instead, > > > > > > after > > > > > > bootstrapping a host, Engine would send a getVdsCaps probe to the > > > > > > installed host, receiving a complete picture of the network > > > > > > configuration on the host. Among this picture is the device that > > > > > > holds > > > > > > the host's management IP address. > > > > > > > > > > > > Engine would send setupNetwork command to generate ovirtmgmt with > > > > > > details devised from this picture, and according to the DC > > > > > > definition > > > > > > of > > > > > > ovirtmgmt. For example, if Vdsm reports: > > > > > > > > > > > > - vlan bond4.3000 has the host's IP, configured to use dhcp. > > > > > > - bond4 is comprises eth2 and eth3 > > > > > > - ovirtmgmt is defined as a VM network with MTU 9000 > > > > > > > > > > > > then Engine sends the likes of: > > > > > > setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, > > > > > > bonding=bond4: {eth2,eth3}, MTU=9000) > > > > > > > > > > Just one comment here, > > > > > In order to save time and confusion - if the ovirtmgmt is defined > > > > > with default values meaning the user did not bother to touch it, > > > > > let it pick up the VLAN configuration from the first host added in > > > > > the Data Center. > > > > > > > > > > Otherwise, you may override the host VLAN and loose connectivity. > > > > > > > > > > This will also solve the situation many users encounter today. > > > > > 1. The engine in on a host that actually has VLAN defined > > > > > 2. The ovirtmgmt network was not updated in the DC > > > > > 3. A host, with VLAN already defined is added - everything works > > > > > fine > > > > > 4. Any number of hosts are now added, again everything seems to > > > > > work fine. > > > > > > > > > > But, now try to use setupNetworks, and you'll find out that you > > > > > can't do much on the interface that contains the ovirtmgmt since > > > > > the definition does not match. You can't sync (Since this will > > > > > remove the VLAN and cause connectivity lose) you can't add more > > > > > networks on top since it already has non-VLAN network on top > > > > > according to the DC definition, etc. > > > > > > > > > > On the other hand you can't update the ovirtmgmt definition on the > > > > > DC since there are clusters in the DC that use the network. > > > > > > > > > > The only workaround not involving DB hack to change the VLAN on the > > > > > network is to: > > > > > 1. Create new DC > > > > > 2. Do not use the wizard that pops up to create your cluster. > > > > > 3. Modify the ovirtmgmt network to have VLANs > > > > > 4. Now create a cluster and add your hosts. > > > > > > > > > > If you insist on using the default DC and cluster then before > > > > > adding the first host, create an additional DC and move the > > > > > Default cluster over there. You may then change the network on the > > > > > Default cluster and then move the Default cluster back > > > > > > > > > > Both are ugly. And should be solved by the proposal above. > > > > > > > > > > We do something similar for the Default cluster CPU level, where we > > > > > set the intial level based on the first host added to the cluster. > > > > > > > > I'm not sure what Engine has for Default cluster CPU level. But I > > > > have > > > > reservation of the hysteresis in your proposal - after a host is > > > > added, > > > > the DC cannot forget ovirtmgmt's vlan. > > > > > > > > How about letting the admin edit ovirtmgmt's vlan in the DC level, > > > > thus > > > > rendering all hosts out-of-sync. The the admin could manually, or > > > > through a script, or in the future through a distributed operation, > > > > sync > > > > all the hosts to the definition? > > > > > > Usually if you do that you will loose connectivity to the hosts. > > > > Yes, changing the management vlan id (or ip address) is never fun, and > > requires out-of-band intervention. > > > > > I'm not insisting on the automatic adjustment of the ovirtmgmt network to > > > match the hosts' (that is just a nice touch) we can take the allow edit > > > approach. > > > > > > But allow to change VLAN on the ovirtmgmt network will indeed solve the > > > issue I'm trying to solve while creating another issue of user expecting > > > that we'll be able to re-tag the host from the engine side, which is > > > challenging to do. > > > > > > On the other hand, if we allow to change the VLAN as long as the change > > > matches the hosts' configuration, it will both solve the issue while not > > > eluding the user to think that we really can solve the chicken and egg > > > issue of re-tag the entire system. > > > > > > Now with the above ability you do get a flow to do the re-tag. > > > 1. Place all the hosts in maintenance > > > 2. Re-tag the ovirtmgmt on all the hosts > > > 3. Re-tag the hosts on which the engine on > > > 4. Activate the hosts - this should work well now since connectivity > > > exist > > > 5. Change the tag on ovirtmgmt on the engine to match the hosts' > > > > > > Simple and clear process. > > > > > > When the workaround of creating another DC was not possible since the > > > system was already long in use and the need was re-tag of the network the > > > above is what I've recommended in the, except that steps 4-5 where done > > > as: > > > 4. Stop the engine > > > 5. Change the tag in the DB > > > 6. Start the engine > > > 7. Activate the hosts > > > > Sounds reasonable to me - but as far as I am aware this is not tightly > > related to the $Subject, which is the post-boot ovirtmgmt definition. > > > > I've added a few details to > > http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > > and I would apreciate a review from someone with intimate Engine > > know-how. > > > > Dan. > > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From masayag at redhat.com Tue May 7 13:31:47 2013 From: masayag at redhat.com (Moti Asayag) Date: Tue, 7 May 2013 09:31:47 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <497587874.11990711.1367932265918.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> Message-ID: <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Omer Frenkel" > To: "Moti Asayag" > Cc: "arch" , "Alon Bar-Lev" > Sent: Tuesday, May 7, 2013 4:11:05 PM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Moti Asayag" > > To: "arch" > > Cc: "Alon Bar-Lev" > > Sent: Tuesday, May 7, 2013 2:22:19 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > I stumbled upon few issues with the current design while implementing it: > > > > There seems to be a requirement to reboot the host after the installation > > is completed in order to assure the host is recoverable. > > > > Therefore, the building blocks of the installation process of 3.3 are: > > 1. host deploy which installs the host expect configuring its management > > network. > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > > network > > on the host and persisting the network configuration. > > 3. Reboot the host - This is a missing piece. (engine has FenceVds command, > > but it > > requires the power management to be configured prior to the installation > > and > > might > > be irrelevant for hosts without PM.) > > > > So, there are couple of issues here: > > 1. How to reboot the host? > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > engine > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > > > why not send a reboot flag to the CommitNetworkChanges which is sent anyway, > one less call (or connection if you choose ssh) and easier to do. > Adding a reboot parameter to the CommitNetworkChanges (setSafeNetworkConfig on vdsm side) exceeds its logical scope which is persisting the network changes. Needless to say if such functionally will be required elsewhere, it couldn't be properly reused if implemented as part of that command. Adding Dan to comment on this as well. > > 2. When to perform the reboot? > > 2.1. After host deploy, by utilizing the host deploy to perform the reboot. > > It requires to configure the network by the monitor when the host is > > detected > > by the engine, > > detached from the installation flow. However it is a step toward the > > non-persistent network feature > > yet to be defined. > > 2.2. After setupNetwork is done and network was configured and persisted on > > the host. > > There is no special advantage from recoverable aspect, as setupNetwork is > > constantly > > used to persist the network configuration (by the complementary > > CommitNetworkChanges command). > > In case and network configuration fails, VDSM will revert to the last well > > known configuration > > - so connectivity with engine should be restored. Design wise, it fits to > > configure the management > > network as part of the installation sequence. > > If the network configuration fails in this context, the host status will be > > set to "InstallFailed" rather than "NonOperational", > > as might occur as a result of a failed setupNetwork command. > > > > > > Your inputs are welcome. > > > > Thanks, > > Moti > > ----- Original Message ----- > > > From: "Dan Kenigsberg" > > > To: "Simon Grinberg" , "Moti Asayag" > > > > > > Cc: "arch" > > > Sent: Tuesday, January 1, 2013 2:47:57 PM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > ----- Original Message ----- > > > > > From: "Dan Kenigsberg" > > > > > To: "Simon Grinberg" > > > > > Cc: "arch" > > > > > Sent: Thursday, December 27, 2012 2:14:06 PM > > > > > Subject: Re: feature suggestion: initial generation of management > > > > > network > > > > > > > > > > On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > > > > > > > ----- Original Message ----- > > > > > > > From: "Dan Kenigsberg" > > > > > > > To: "arch" > > > > > > > Sent: Tuesday, December 25, 2012 2:27:22 PM > > > > > > > Subject: feature suggestion: initial generation of management > > > > > > > network > > > > > > > > > > > > > > Current condition: > > > > > > > ================== > > > > > > > The management network, named ovirtmgmt, is created during host > > > > > > > bootstrap. It consists of a bridge device, connected to the > > > > > > > network > > > > > > > device that was used to communicate with Engine (nic, bonding or > > > > > > > vlan). > > > > > > > It inherits its ip settings from the latter device. > > > > > > > > > > > > > > Why Is the Management Network Needed? > > > > > > > ===================================== > > > > > > > Understandably, some may ask why do we need to have a management > > > > > > > network - why having a host with IPv4 configured on it is not > > > > > > > enough. > > > > > > > The answer is twofold: > > > > > > > 1. In oVirt, a network is an abstraction of the resources > > > > > > > required > > > > > > > for > > > > > > > connectivity of a host for a specific usage. This is true for > > > > > > > the > > > > > > > management network just as it is for VM network or a display > > > > > > > network. > > > > > > > The network entity is the key for adding/changing nics and IP > > > > > > > address. > > > > > > > 2. In many occasions (such as small setups) the management > > > > > > > network is > > > > > > > used as a VM/display network as well. > > > > > > > > > > > > > > Problems in current connectivity: > > > > > > > ================================ > > > > > > > According to alonbl of ovirt-host-deploy fame, and with no > > > > > > > conflict > > > > > > > to > > > > > > > my own experience, creating the management network is the most > > > > > > > fragile, > > > > > > > error-prone step of bootstrap. > > > > > > > > > > > > +1, > > > > > > I've raise that repeatedly in the past, bootstrap should not create > > > > > > the management network but pick up the existing configuration and > > > > > > let the engine override later with it's own configuration if it > > > > > > differs , I'm glad that we finally get to that. > > > > > > > > > > > > > > > > > > > > Currently it always creates a bridged network (even if the DC > > > > > > > requires a > > > > > > > non-bridged ovirtmgmt), it knows nothing about the defined MTU > > > > > > > for > > > > > > > ovirtmgmt, it uses ping to guess on top of which device to build > > > > > > > (and > > > > > > > thus requires Vdsm-to-Engine reverse connectivity), and is the > > > > > > > sole > > > > > > > remaining user of the addNetwork/vdsm-store-net-conf scripts. > > > > > > > > > > > > > > Suggested feature: > > > > > > > ================== > > > > > > > Bootstrap would avoid creating a management network. Instead, > > > > > > > after > > > > > > > bootstrapping a host, Engine would send a getVdsCaps probe to the > > > > > > > installed host, receiving a complete picture of the network > > > > > > > configuration on the host. Among this picture is the device that > > > > > > > holds > > > > > > > the host's management IP address. > > > > > > > > > > > > > > Engine would send setupNetwork command to generate ovirtmgmt with > > > > > > > details devised from this picture, and according to the DC > > > > > > > definition > > > > > > > of > > > > > > > ovirtmgmt. For example, if Vdsm reports: > > > > > > > > > > > > > > - vlan bond4.3000 has the host's IP, configured to use dhcp. > > > > > > > - bond4 is comprises eth2 and eth3 > > > > > > > - ovirtmgmt is defined as a VM network with MTU 9000 > > > > > > > > > > > > > > then Engine sends the likes of: > > > > > > > setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, > > > > > > > bonding=bond4: {eth2,eth3}, MTU=9000) > > > > > > > > > > > > Just one comment here, > > > > > > In order to save time and confusion - if the ovirtmgmt is defined > > > > > > with default values meaning the user did not bother to touch it, > > > > > > let it pick up the VLAN configuration from the first host added in > > > > > > the Data Center. > > > > > > > > > > > > Otherwise, you may override the host VLAN and loose connectivity. > > > > > > > > > > > > This will also solve the situation many users encounter today. > > > > > > 1. The engine in on a host that actually has VLAN defined > > > > > > 2. The ovirtmgmt network was not updated in the DC > > > > > > 3. A host, with VLAN already defined is added - everything works > > > > > > fine > > > > > > 4. Any number of hosts are now added, again everything seems to > > > > > > work fine. > > > > > > > > > > > > But, now try to use setupNetworks, and you'll find out that you > > > > > > can't do much on the interface that contains the ovirtmgmt since > > > > > > the definition does not match. You can't sync (Since this will > > > > > > remove the VLAN and cause connectivity lose) you can't add more > > > > > > networks on top since it already has non-VLAN network on top > > > > > > according to the DC definition, etc. > > > > > > > > > > > > On the other hand you can't update the ovirtmgmt definition on the > > > > > > DC since there are clusters in the DC that use the network. > > > > > > > > > > > > The only workaround not involving DB hack to change the VLAN on the > > > > > > network is to: > > > > > > 1. Create new DC > > > > > > 2. Do not use the wizard that pops up to create your cluster. > > > > > > 3. Modify the ovirtmgmt network to have VLANs > > > > > > 4. Now create a cluster and add your hosts. > > > > > > > > > > > > If you insist on using the default DC and cluster then before > > > > > > adding the first host, create an additional DC and move the > > > > > > Default cluster over there. You may then change the network on the > > > > > > Default cluster and then move the Default cluster back > > > > > > > > > > > > Both are ugly. And should be solved by the proposal above. > > > > > > > > > > > > We do something similar for the Default cluster CPU level, where we > > > > > > set the intial level based on the first host added to the cluster. > > > > > > > > > > I'm not sure what Engine has for Default cluster CPU level. But I > > > > > have > > > > > reservation of the hysteresis in your proposal - after a host is > > > > > added, > > > > > the DC cannot forget ovirtmgmt's vlan. > > > > > > > > > > How about letting the admin edit ovirtmgmt's vlan in the DC level, > > > > > thus > > > > > rendering all hosts out-of-sync. The the admin could manually, or > > > > > through a script, or in the future through a distributed operation, > > > > > sync > > > > > all the hosts to the definition? > > > > > > > > Usually if you do that you will loose connectivity to the hosts. > > > > > > Yes, changing the management vlan id (or ip address) is never fun, and > > > requires out-of-band intervention. > > > > > > > I'm not insisting on the automatic adjustment of the ovirtmgmt network > > > > to > > > > match the hosts' (that is just a nice touch) we can take the allow edit > > > > approach. > > > > > > > > But allow to change VLAN on the ovirtmgmt network will indeed solve the > > > > issue I'm trying to solve while creating another issue of user > > > > expecting > > > > that we'll be able to re-tag the host from the engine side, which is > > > > challenging to do. > > > > > > > > On the other hand, if we allow to change the VLAN as long as the change > > > > matches the hosts' configuration, it will both solve the issue while > > > > not > > > > eluding the user to think that we really can solve the chicken and egg > > > > issue of re-tag the entire system. > > > > > > > > Now with the above ability you do get a flow to do the re-tag. > > > > 1. Place all the hosts in maintenance > > > > 2. Re-tag the ovirtmgmt on all the hosts > > > > 3. Re-tag the hosts on which the engine on > > > > 4. Activate the hosts - this should work well now since connectivity > > > > exist > > > > 5. Change the tag on ovirtmgmt on the engine to match the hosts' > > > > > > > > Simple and clear process. > > > > > > > > When the workaround of creating another DC was not possible since the > > > > system was already long in use and the need was re-tag of the network > > > > the > > > > above is what I've recommended in the, except that steps 4-5 where done > > > > as: > > > > 4. Stop the engine > > > > 5. Change the tag in the DB > > > > 6. Start the engine > > > > 7. Activate the hosts > > > > > > Sounds reasonable to me - but as far as I am aware this is not tightly > > > related to the $Subject, which is the post-boot ovirtmgmt definition. > > > > > > I've added a few details to > > > http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > > > and I would apreciate a review from someone with intimate Engine > > > know-how. > > > > > > Dan. > > > > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > From masayag at redhat.com Wed May 8 07:39:17 2013 From: masayag at redhat.com (Moti Asayag) Date: Wed, 8 May 2013 03:39:17 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <1633347300.8227458.1367926395327.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <1633347300.8227458.1367926395327.JavaMail.root@redhat.com> Message-ID: <803219126.8564890.1367998757682.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Mike Kolesnik" > To: "Moti Asayag" > Cc: "arch" , "Alon Bar-Lev" > Sent: Tuesday, May 7, 2013 2:33:15 PM > Subject: Re: feature suggestion: initial generation of management network > > ----- Original Message ----- > > I stumbled upon few issues with the current design while implementing it: > > > > There seems to be a requirement to reboot the host after the installation > > is completed in order to assure the host is recoverable. > > > > Therefore, the building blocks of the installation process of 3.3 are: > > 1. host deploy which installs the host expect configuring its management > > network. > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > > network > > on the host and persisting the network configuration. > > 3. Reboot the host - This is a missing piece. (engine has FenceVds command, > > but it > > requires the power management to be configured prior to the installation > > and > > might > > be irrelevant for hosts without PM.) > > > > So, there are couple of issues here: > > 1. How to reboot the host? > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > engine > > This sounds like a solid and good API to me. > > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > How would you do this? > Either by reusing the same ssh dialog used by the VdsDeploy or by opening a new one. But there is a downside for this solution, as the vdsm is already operational and was used to configure the network, therefore accessing the host directly and a vdsm-detour will break the engagement method from engine to vdsm. In addition, rebooting a host might require resource releasing, therefore it is better to have this logic inside of the VDSM. > > > > 2. When to perform the reboot? > > 2.1. After host deploy, by utilizing the host deploy to perform the reboot. > > It requires to configure the network by the monitor when the host is > > detected > > by the engine, > > detached from the installation flow. However it is a step toward the > > non-persistent network feature > > yet to be defined. > > I am not sure this statement has merit, if the feature is yet to be defined, > how > can we know if this is a step towards it or not? > > Anyway, I'm not sure that this is a good design - should we setup the network > when host returns from non-responsive status? I raised this option as a second alternative. I'm in favour of the second one. > > > 2.2. After setupNetwork is done and network was configured and persisted on > > the host. > > There is no special advantage from recoverable aspect, as setupNetwork is > > constantly > > used to persist the network configuration (by the complementary > > CommitNetworkChanges command). > > In case and network configuration fails, VDSM will revert to the last well > > known configuration > > - so connectivity with engine should be restored. Design wise, it fits to > > configure the management > > network as part of the installation sequence. > > If the network configuration fails in this context, the host status will be > > set to "InstallFailed" rather than "NonOperational", > > as might occur as a result of a failed setupNetwork command. > > This sounds like the good solution to me, design wise. The host is installed > and with that the communication with the management network is configured. > If this communication is not possible, the host failed to install (also > meaning > it's not operational). I see no problem with this approach. > > > > > > > Your inputs are welcome. > > > > Thanks, > > Moti > > ----- Original Message ----- > > > From: "Dan Kenigsberg" > > > To: "Simon Grinberg" , "Moti Asayag" > > > > > > Cc: "arch" > > > Sent: Tuesday, January 1, 2013 2:47:57 PM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > ----- Original Message ----- > > > > > From: "Dan Kenigsberg" > > > > > To: "Simon Grinberg" > > > > > Cc: "arch" > > > > > Sent: Thursday, December 27, 2012 2:14:06 PM > > > > > Subject: Re: feature suggestion: initial generation of management > > > > > network > > > > > > > > > > On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > > > > > > > ----- Original Message ----- > > > > > > > From: "Dan Kenigsberg" > > > > > > > To: "arch" > > > > > > > Sent: Tuesday, December 25, 2012 2:27:22 PM > > > > > > > Subject: feature suggestion: initial generation of management > > > > > > > network > > > > > > > > > > > > > > Current condition: > > > > > > > ================== > > > > > > > The management network, named ovirtmgmt, is created during host > > > > > > > bootstrap. It consists of a bridge device, connected to the > > > > > > > network > > > > > > > device that was used to communicate with Engine (nic, bonding or > > > > > > > vlan). > > > > > > > It inherits its ip settings from the latter device. > > > > > > > > > > > > > > Why Is the Management Network Needed? > > > > > > > ===================================== > > > > > > > Understandably, some may ask why do we need to have a management > > > > > > > network - why having a host with IPv4 configured on it is not > > > > > > > enough. > > > > > > > The answer is twofold: > > > > > > > 1. In oVirt, a network is an abstraction of the resources > > > > > > > required > > > > > > > for > > > > > > > connectivity of a host for a specific usage. This is true for > > > > > > > the > > > > > > > management network just as it is for VM network or a display > > > > > > > network. > > > > > > > The network entity is the key for adding/changing nics and IP > > > > > > > address. > > > > > > > 2. In many occasions (such as small setups) the management > > > > > > > network is > > > > > > > used as a VM/display network as well. > > > > > > > > > > > > > > Problems in current connectivity: > > > > > > > ================================ > > > > > > > According to alonbl of ovirt-host-deploy fame, and with no > > > > > > > conflict > > > > > > > to > > > > > > > my own experience, creating the management network is the most > > > > > > > fragile, > > > > > > > error-prone step of bootstrap. > > > > > > > > > > > > +1, > > > > > > I've raise that repeatedly in the past, bootstrap should not create > > > > > > the management network but pick up the existing configuration and > > > > > > let the engine override later with it's own configuration if it > > > > > > differs , I'm glad that we finally get to that. > > > > > > > > > > > > > > > > > > > > Currently it always creates a bridged network (even if the DC > > > > > > > requires a > > > > > > > non-bridged ovirtmgmt), it knows nothing about the defined MTU > > > > > > > for > > > > > > > ovirtmgmt, it uses ping to guess on top of which device to build > > > > > > > (and > > > > > > > thus requires Vdsm-to-Engine reverse connectivity), and is the > > > > > > > sole > > > > > > > remaining user of the addNetwork/vdsm-store-net-conf scripts. > > > > > > > > > > > > > > Suggested feature: > > > > > > > ================== > > > > > > > Bootstrap would avoid creating a management network. Instead, > > > > > > > after > > > > > > > bootstrapping a host, Engine would send a getVdsCaps probe to the > > > > > > > installed host, receiving a complete picture of the network > > > > > > > configuration on the host. Among this picture is the device that > > > > > > > holds > > > > > > > the host's management IP address. > > > > > > > > > > > > > > Engine would send setupNetwork command to generate ovirtmgmt with > > > > > > > details devised from this picture, and according to the DC > > > > > > > definition > > > > > > > of > > > > > > > ovirtmgmt. For example, if Vdsm reports: > > > > > > > > > > > > > > - vlan bond4.3000 has the host's IP, configured to use dhcp. > > > > > > > - bond4 is comprises eth2 and eth3 > > > > > > > - ovirtmgmt is defined as a VM network with MTU 9000 > > > > > > > > > > > > > > then Engine sends the likes of: > > > > > > > setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, > > > > > > > bonding=bond4: {eth2,eth3}, MTU=9000) > > > > > > > > > > > > Just one comment here, > > > > > > In order to save time and confusion - if the ovirtmgmt is defined > > > > > > with default values meaning the user did not bother to touch it, > > > > > > let it pick up the VLAN configuration from the first host added in > > > > > > the Data Center. > > > > > > > > > > > > Otherwise, you may override the host VLAN and loose connectivity. > > > > > > > > > > > > This will also solve the situation many users encounter today. > > > > > > 1. The engine in on a host that actually has VLAN defined > > > > > > 2. The ovirtmgmt network was not updated in the DC > > > > > > 3. A host, with VLAN already defined is added - everything works > > > > > > fine > > > > > > 4. Any number of hosts are now added, again everything seems to > > > > > > work fine. > > > > > > > > > > > > But, now try to use setupNetworks, and you'll find out that you > > > > > > can't do much on the interface that contains the ovirtmgmt since > > > > > > the definition does not match. You can't sync (Since this will > > > > > > remove the VLAN and cause connectivity lose) you can't add more > > > > > > networks on top since it already has non-VLAN network on top > > > > > > according to the DC definition, etc. > > > > > > > > > > > > On the other hand you can't update the ovirtmgmt definition on the > > > > > > DC since there are clusters in the DC that use the network. > > > > > > > > > > > > The only workaround not involving DB hack to change the VLAN on the > > > > > > network is to: > > > > > > 1. Create new DC > > > > > > 2. Do not use the wizard that pops up to create your cluster. > > > > > > 3. Modify the ovirtmgmt network to have VLANs > > > > > > 4. Now create a cluster and add your hosts. > > > > > > > > > > > > If you insist on using the default DC and cluster then before > > > > > > adding the first host, create an additional DC and move the > > > > > > Default cluster over there. You may then change the network on the > > > > > > Default cluster and then move the Default cluster back > > > > > > > > > > > > Both are ugly. And should be solved by the proposal above. > > > > > > > > > > > > We do something similar for the Default cluster CPU level, where we > > > > > > set the intial level based on the first host added to the cluster. > > > > > > > > > > I'm not sure what Engine has for Default cluster CPU level. But I > > > > > have > > > > > reservation of the hysteresis in your proposal - after a host is > > > > > added, > > > > > the DC cannot forget ovirtmgmt's vlan. > > > > > > > > > > How about letting the admin edit ovirtmgmt's vlan in the DC level, > > > > > thus > > > > > rendering all hosts out-of-sync. The the admin could manually, or > > > > > through a script, or in the future through a distributed operation, > > > > > sync > > > > > all the hosts to the definition? > > > > > > > > Usually if you do that you will loose connectivity to the hosts. > > > > > > Yes, changing the management vlan id (or ip address) is never fun, and > > > requires out-of-band intervention. > > > > > > > I'm not insisting on the automatic adjustment of the ovirtmgmt network > > > > to > > > > match the hosts' (that is just a nice touch) we can take the allow edit > > > > approach. > > > > > > > > But allow to change VLAN on the ovirtmgmt network will indeed solve the > > > > issue I'm trying to solve while creating another issue of user > > > > expecting > > > > that we'll be able to re-tag the host from the engine side, which is > > > > challenging to do. > > > > > > > > On the other hand, if we allow to change the VLAN as long as the change > > > > matches the hosts' configuration, it will both solve the issue while > > > > not > > > > eluding the user to think that we really can solve the chicken and egg > > > > issue of re-tag the entire system. > > > > > > > > Now with the above ability you do get a flow to do the re-tag. > > > > 1. Place all the hosts in maintenance > > > > 2. Re-tag the ovirtmgmt on all the hosts > > > > 3. Re-tag the hosts on which the engine on > > > > 4. Activate the hosts - this should work well now since connectivity > > > > exist > > > > 5. Change the tag on ovirtmgmt on the engine to match the hosts' > > > > > > > > Simple and clear process. > > > > > > > > When the workaround of creating another DC was not possible since the > > > > system was already long in use and the need was re-tag of the network > > > > the > > > > above is what I've recommended in the, except that steps 4-5 where done > > > > as: > > > > 4. Stop the engine > > > > 5. Change the tag in the DB > > > > 6. Start the engine > > > > 7. Activate the hosts > > > > > > Sounds reasonable to me - but as far as I am aware this is not tightly > > > related to the $Subject, which is the post-boot ovirtmgmt definition. > > > > > > I've added a few details to > > > http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > > > and I would apreciate a review from someone with intimate Engine > > > know-how. > > > > > > Dan. > > > > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > From danken at redhat.com Wed May 8 13:35:49 2013 From: danken at redhat.com (Dan Kenigsberg) Date: Wed, 8 May 2013 16:35:49 +0300 Subject: feature suggestion: initial generation of management network In-Reply-To: <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> Message-ID: <20130508133549.GA17279@redhat.com> On Tue, May 07, 2013 at 09:31:47AM -0400, Moti Asayag wrote: > > > ----- Original Message ----- > > From: "Omer Frenkel" > > To: "Moti Asayag" > > Cc: "arch" , "Alon Bar-Lev" > > Sent: Tuesday, May 7, 2013 4:11:05 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > > > > > ----- Original Message ----- > > > From: "Moti Asayag" > > > To: "arch" > > > Cc: "Alon Bar-Lev" > > > Sent: Tuesday, May 7, 2013 2:22:19 PM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > I stumbled upon few issues with the current design while implementing it: > > > > > > There seems to be a requirement to reboot the host after the installation > > > is completed in order to assure the host is recoverable. > > > > > > Therefore, the building blocks of the installation process of 3.3 are: > > > 1. host deploy which installs the host expect configuring its management > > > network. > > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > > > network > > > on the host and persisting the network configuration. > > > 3. Reboot the host - This is a missing piece. (engine has FenceVds command, > > > but it > > > requires the power management to be configured prior to the installation > > > and > > > might > > > be irrelevant for hosts without PM.) > > > > > > So, there are couple of issues here: > > > 1. How to reboot the host? > > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > > engine > > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > > > > > > why not send a reboot flag to the CommitNetworkChanges which is sent anyway, > > one less call (or connection if you choose ssh) and easier to do. > > > > Adding a reboot parameter to the CommitNetworkChanges (setSafeNetworkConfig on vdsm side) > exceeds its logical scope which is persisting the network changes. > > Needless to say if such functionally will be required elsewhere, it couldn't be > properly reused if implemented as part of that command. > > Adding Dan to comment on this as well. Yeah, a "reboot-after-me" flag defies my sense of cleanliness. If reboot-after-initial-net-config is crucial, we would need to add a special verb for that (or use the fenceNode verb if available). However, I am not sure that this reboot is unavoidable. Originally the reboot had two important goal: - make sure that the updated kernel is running - make sure that the network, which we tweak during bootstrap, is accessible after boot Nowadays, the kernels does not change THAT often, for all ovirt can matter. running an oldish kernel is not the end of the world. And with Moti's feature implemented, we no longer tweak net config blindly during boot. We use a well-define setupNetwork API, with a well-tested rollback mechanism. The bottom line is, that in my opinion, reboot-after-install can be skipped these days. > > > > 2. When to perform the reboot? > > > 2.1. After host deploy, by utilizing the host deploy to perform the reboot. > > > It requires to configure the network by the monitor when the host is > > > detected > > > by the engine, > > > detached from the installation flow. However it is a step toward the > > > non-persistent network feature > > > yet to be defined. > > > 2.2. After setupNetwork is done and network was configured and persisted on > > > the host. > > > There is no special advantage from recoverable aspect, as setupNetwork is > > > constantly > > > used to persist the network configuration (by the complementary > > > CommitNetworkChanges command). > > > In case and network configuration fails, VDSM will revert to the last well > > > known configuration > > > - so connectivity with engine should be restored. Design wise, it fits to > > > configure the management > > > network as part of the installation sequence. > > > If the network configuration fails in this context, the host status will be > > > set to "InstallFailed" rather than "NonOperational", > > > as might occur as a result of a failed setupNetwork command. > > > > > > > > > Your inputs are welcome. > > > > > > Thanks, > > > Moti From mburns at redhat.com Wed May 8 15:26:00 2013 From: mburns at redhat.com (Mike Burns) Date: Wed, 08 May 2013 11:26:00 -0400 Subject: Updates-Testing repo In-Reply-To: <518A3CB4.8010501@redhat.com> References: <518A3CB4.8010501@redhat.com> Message-ID: <518A6E88.6020106@redhat.com> oVirt Engine RPMS for the 3.2.2 update are uploaded to this repo. Thanks Mike On 05/08/2013 07:53 AM, Mike Burns wrote: > A new repo is now available on resources.ovirt.org. It contains > packages that are targeted to be shipped as updates to the current > stable oVirt release. > > This repo is located at: > > http://resources.ovirt.org/releases/updates-testing/ > > New ovirt-release packages are also available that contain the repo > (disabled by default). > > http://resources.ovirt.org/releases/ovirt-release-fedora-6-1.noarch.rpm > http://resources.ovirt.org/releases/ovirt-release-el6-6-1.noarch.rpm > > Let me know if there are any questions. > > Thanks > > Mike > _______________________________________________ > Announce mailing list > Announce at ovirt.org > http://lists.ovirt.org/mailman/listinfo/announce From lpeer at redhat.com Thu May 9 05:42:28 2013 From: lpeer at redhat.com (Livnat Peer) Date: Thu, 09 May 2013 08:42:28 +0300 Subject: feature suggestion: initial generation of management network In-Reply-To: <20130508133549.GA17279@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> Message-ID: <518B3744.8020209@redhat.com> On 05/08/2013 04:35 PM, Dan Kenigsberg wrote: > On Tue, May 07, 2013 at 09:31:47AM -0400, Moti Asayag wrote: >> >> >> ----- Original Message ----- >>> From: "Omer Frenkel" >>> To: "Moti Asayag" >>> Cc: "arch" , "Alon Bar-Lev" >>> Sent: Tuesday, May 7, 2013 4:11:05 PM >>> Subject: Re: feature suggestion: initial generation of management network >>> >>> >>> >>> ----- Original Message ----- >>>> From: "Moti Asayag" >>>> To: "arch" >>>> Cc: "Alon Bar-Lev" >>>> Sent: Tuesday, May 7, 2013 2:22:19 PM >>>> Subject: Re: feature suggestion: initial generation of management network >>>> >>>> I stumbled upon few issues with the current design while implementing it: >>>> >>>> There seems to be a requirement to reboot the host after the installation >>>> is completed in order to assure the host is recoverable. >>>> >>>> Therefore, the building blocks of the installation process of 3.3 are: >>>> 1. host deploy which installs the host expect configuring its management >>>> network. >>>> 2. SetupNetwork (and CommitNetworkChanges) - for creating the management >>>> network >>>> on the host and persisting the network configuration. >>>> 3. Reboot the host - This is a missing piece. (engine has FenceVds command, >>>> but it >>>> requires the power management to be configured prior to the installation >>>> and >>>> might >>>> be irrelevant for hosts without PM.) >>>> >>>> So, there are couple of issues here: >>>> 1. How to reboot the host? >>>> 1.1. By exposing new RebootNode verb in VDSM and invoking it from the >>>> engine >>>> 1.2. By opening ssh dialog to the host in order to execute the reboot >>>> >>> >>> why not send a reboot flag to the CommitNetworkChanges which is sent anyway, >>> one less call (or connection if you choose ssh) and easier to do. >>> >> >> Adding a reboot parameter to the CommitNetworkChanges (setSafeNetworkConfig on vdsm side) >> exceeds its logical scope which is persisting the network changes. >> >> Needless to say if such functionally will be required elsewhere, it couldn't be >> properly reused if implemented as part of that command. >> >> Adding Dan to comment on this as well. > > Yeah, a "reboot-after-me" flag defies my sense of cleanliness. > If reboot-after-initial-net-config is crucial, we would need to add a > special verb for that (or use the fenceNode verb if available). > +1 > > However, I am not sure that this reboot is unavoidable. > Originally the reboot had two important goal: > - make sure that the updated kernel is running > - make sure that the network, which we tweak during bootstrap, is > accessible after boot > > Nowadays, the kernels does not change THAT often, for all ovirt can > matter. running an oldish kernel is not the end of the world. > > And with Moti's feature implemented, we no longer tweak net config > blindly during boot. We use a well-define setupNetwork API, with a > well-tested rollback mechanism. > > The bottom line is, that in my opinion, reboot-after-install can be > skipped these days. > Adding Barak to the thread as I think he had some concern about removing the reboot after install. >> >>>> 2. When to perform the reboot? >>>> 2.1. After host deploy, by utilizing the host deploy to perform the reboot. >>>> It requires to configure the network by the monitor when the host is >>>> detected >>>> by the engine, >>>> detached from the installation flow. However it is a step toward the >>>> non-persistent network feature >>>> yet to be defined. >>>> 2.2. After setupNetwork is done and network was configured and persisted on >>>> the host. >>>> There is no special advantage from recoverable aspect, as setupNetwork is >>>> constantly >>>> used to persist the network configuration (by the complementary >>>> CommitNetworkChanges command). >>>> In case and network configuration fails, VDSM will revert to the last well >>>> known configuration >>>> - so connectivity with engine should be restored. Design wise, it fits to >>>> configure the management >>>> network as part of the installation sequence. >>>> If the network configuration fails in this context, the host status will be >>>> set to "InstallFailed" rather than "NonOperational", >>>> as might occur as a result of a failed setupNetwork command. >>>> >>>> >>>> Your inputs are welcome. >>>> >>>> Thanks, >>>> Moti > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From alonbl at redhat.com Thu May 9 14:42:09 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Thu, 9 May 2013 10:42:09 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <1633347300.8227458.1367926395327.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <1633347300.8227458.1367926395327.JavaMail.root@redhat.com> Message-ID: <868990068.5761536.1368110529578.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Mike Kolesnik" > To: "Moti Asayag" > Cc: "Alon Bar-Lev" , "arch" > Sent: Tuesday, May 7, 2013 2:33:15 PM > Subject: Re: feature suggestion: initial generation of management network > > ----- Original Message ----- > > I stumbled upon few issues with the current design while implementing it: > > > > There seems to be a requirement to reboot the host after the installation > > is completed in order to assure the host is recoverable. > > > > Therefore, the building blocks of the installation process of 3.3 are: > > 1. host deploy which installs the host expect configuring its management > > network. > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > > network > > on the host and persisting the network configuration. > > 3. Reboot the host - This is a missing piece. (engine has FenceVds command, > > but it > > requires the power management to be configured prior to the installation > > and > > might > > be irrelevant for hosts without PM.) > > > > So, there are couple of issues here: > > 1. How to reboot the host? > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > engine > > This sounds like a solid and good API to me. > > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > How would you do this? > > > > > 2. When to perform the reboot? > > 2.1. After host deploy, by utilizing the host deploy to perform the reboot. > > It requires to configure the network by the monitor when the host is > > detected > > by the engine, > > detached from the installation flow. However it is a step toward the > > non-persistent network feature > > yet to be defined. > > I am not sure this statement has merit, if the feature is yet to be defined, > how > can we know if this is a step towards it or not? > > Anyway, I'm not sure that this is a good design - should we setup the network > when host returns from non-responsive status? Exactly. Imagine that after a reboot only single interface is configured to allow communication to engine. Once the engine connects to the host, it re-configure anything it needs on that host. A completely stateless host. > > 2.2. After setupNetwork is done and network was configured and persisted on > > the host. > > There is no special advantage from recoverable aspect, as setupNetwork is > > constantly > > used to persist the network configuration (by the complementary > > CommitNetworkChanges command). > > In case and network configuration fails, VDSM will revert to the last well > > known configuration > > - so connectivity with engine should be restored. Design wise, it fits to > > configure the management > > network as part of the installation sequence. > > If the network configuration fails in this context, the host status will be > > set to "InstallFailed" rather than "NonOperational", > > as might occur as a result of a failed setupNetwork command. > > This sounds like the good solution to me, design wise. The host is installed > and with that the communication with the management network is configured. > If this communication is not possible, the host failed to install (also > meaning > it's not operational). I see no problem with this approach. > > > > > > > Your inputs are welcome. > > > > Thanks, > > Moti > > ----- Original Message ----- > > > From: "Dan Kenigsberg" > > > To: "Simon Grinberg" , "Moti Asayag" > > > > > > Cc: "arch" > > > Sent: Tuesday, January 1, 2013 2:47:57 PM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > ----- Original Message ----- > > > > > From: "Dan Kenigsberg" > > > > > To: "Simon Grinberg" > > > > > Cc: "arch" > > > > > Sent: Thursday, December 27, 2012 2:14:06 PM > > > > > Subject: Re: feature suggestion: initial generation of management > > > > > network > > > > > > > > > > On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > > > > > > > ----- Original Message ----- > > > > > > > From: "Dan Kenigsberg" > > > > > > > To: "arch" > > > > > > > Sent: Tuesday, December 25, 2012 2:27:22 PM > > > > > > > Subject: feature suggestion: initial generation of management > > > > > > > network > > > > > > > > > > > > > > Current condition: > > > > > > > ================== > > > > > > > The management network, named ovirtmgmt, is created during host > > > > > > > bootstrap. It consists of a bridge device, connected to the > > > > > > > network > > > > > > > device that was used to communicate with Engine (nic, bonding or > > > > > > > vlan). > > > > > > > It inherits its ip settings from the latter device. > > > > > > > > > > > > > > Why Is the Management Network Needed? > > > > > > > ===================================== > > > > > > > Understandably, some may ask why do we need to have a management > > > > > > > network - why having a host with IPv4 configured on it is not > > > > > > > enough. > > > > > > > The answer is twofold: > > > > > > > 1. In oVirt, a network is an abstraction of the resources > > > > > > > required > > > > > > > for > > > > > > > connectivity of a host for a specific usage. This is true for > > > > > > > the > > > > > > > management network just as it is for VM network or a display > > > > > > > network. > > > > > > > The network entity is the key for adding/changing nics and IP > > > > > > > address. > > > > > > > 2. In many occasions (such as small setups) the management > > > > > > > network is > > > > > > > used as a VM/display network as well. > > > > > > > > > > > > > > Problems in current connectivity: > > > > > > > ================================ > > > > > > > According to alonbl of ovirt-host-deploy fame, and with no > > > > > > > conflict > > > > > > > to > > > > > > > my own experience, creating the management network is the most > > > > > > > fragile, > > > > > > > error-prone step of bootstrap. > > > > > > > > > > > > +1, > > > > > > I've raise that repeatedly in the past, bootstrap should not create > > > > > > the management network but pick up the existing configuration and > > > > > > let the engine override later with it's own configuration if it > > > > > > differs , I'm glad that we finally get to that. > > > > > > > > > > > > > > > > > > > > Currently it always creates a bridged network (even if the DC > > > > > > > requires a > > > > > > > non-bridged ovirtmgmt), it knows nothing about the defined MTU > > > > > > > for > > > > > > > ovirtmgmt, it uses ping to guess on top of which device to build > > > > > > > (and > > > > > > > thus requires Vdsm-to-Engine reverse connectivity), and is the > > > > > > > sole > > > > > > > remaining user of the addNetwork/vdsm-store-net-conf scripts. > > > > > > > > > > > > > > Suggested feature: > > > > > > > ================== > > > > > > > Bootstrap would avoid creating a management network. Instead, > > > > > > > after > > > > > > > bootstrapping a host, Engine would send a getVdsCaps probe to the > > > > > > > installed host, receiving a complete picture of the network > > > > > > > configuration on the host. Among this picture is the device that > > > > > > > holds > > > > > > > the host's management IP address. > > > > > > > > > > > > > > Engine would send setupNetwork command to generate ovirtmgmt with > > > > > > > details devised from this picture, and according to the DC > > > > > > > definition > > > > > > > of > > > > > > > ovirtmgmt. For example, if Vdsm reports: > > > > > > > > > > > > > > - vlan bond4.3000 has the host's IP, configured to use dhcp. > > > > > > > - bond4 is comprises eth2 and eth3 > > > > > > > - ovirtmgmt is defined as a VM network with MTU 9000 > > > > > > > > > > > > > > then Engine sends the likes of: > > > > > > > setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, > > > > > > > bonding=bond4: {eth2,eth3}, MTU=9000) > > > > > > > > > > > > Just one comment here, > > > > > > In order to save time and confusion - if the ovirtmgmt is defined > > > > > > with default values meaning the user did not bother to touch it, > > > > > > let it pick up the VLAN configuration from the first host added in > > > > > > the Data Center. > > > > > > > > > > > > Otherwise, you may override the host VLAN and loose connectivity. > > > > > > > > > > > > This will also solve the situation many users encounter today. > > > > > > 1. The engine in on a host that actually has VLAN defined > > > > > > 2. The ovirtmgmt network was not updated in the DC > > > > > > 3. A host, with VLAN already defined is added - everything works > > > > > > fine > > > > > > 4. Any number of hosts are now added, again everything seems to > > > > > > work fine. > > > > > > > > > > > > But, now try to use setupNetworks, and you'll find out that you > > > > > > can't do much on the interface that contains the ovirtmgmt since > > > > > > the definition does not match. You can't sync (Since this will > > > > > > remove the VLAN and cause connectivity lose) you can't add more > > > > > > networks on top since it already has non-VLAN network on top > > > > > > according to the DC definition, etc. > > > > > > > > > > > > On the other hand you can't update the ovirtmgmt definition on the > > > > > > DC since there are clusters in the DC that use the network. > > > > > > > > > > > > The only workaround not involving DB hack to change the VLAN on the > > > > > > network is to: > > > > > > 1. Create new DC > > > > > > 2. Do not use the wizard that pops up to create your cluster. > > > > > > 3. Modify the ovirtmgmt network to have VLANs > > > > > > 4. Now create a cluster and add your hosts. > > > > > > > > > > > > If you insist on using the default DC and cluster then before > > > > > > adding the first host, create an additional DC and move the > > > > > > Default cluster over there. You may then change the network on the > > > > > > Default cluster and then move the Default cluster back > > > > > > > > > > > > Both are ugly. And should be solved by the proposal above. > > > > > > > > > > > > We do something similar for the Default cluster CPU level, where we > > > > > > set the intial level based on the first host added to the cluster. > > > > > > > > > > I'm not sure what Engine has for Default cluster CPU level. But I > > > > > have > > > > > reservation of the hysteresis in your proposal - after a host is > > > > > added, > > > > > the DC cannot forget ovirtmgmt's vlan. > > > > > > > > > > How about letting the admin edit ovirtmgmt's vlan in the DC level, > > > > > thus > > > > > rendering all hosts out-of-sync. The the admin could manually, or > > > > > through a script, or in the future through a distributed operation, > > > > > sync > > > > > all the hosts to the definition? > > > > > > > > Usually if you do that you will loose connectivity to the hosts. > > > > > > Yes, changing the management vlan id (or ip address) is never fun, and > > > requires out-of-band intervention. > > > > > > > I'm not insisting on the automatic adjustment of the ovirtmgmt network > > > > to > > > > match the hosts' (that is just a nice touch) we can take the allow edit > > > > approach. > > > > > > > > But allow to change VLAN on the ovirtmgmt network will indeed solve the > > > > issue I'm trying to solve while creating another issue of user > > > > expecting > > > > that we'll be able to re-tag the host from the engine side, which is > > > > challenging to do. > > > > > > > > On the other hand, if we allow to change the VLAN as long as the change > > > > matches the hosts' configuration, it will both solve the issue while > > > > not > > > > eluding the user to think that we really can solve the chicken and egg > > > > issue of re-tag the entire system. > > > > > > > > Now with the above ability you do get a flow to do the re-tag. > > > > 1. Place all the hosts in maintenance > > > > 2. Re-tag the ovirtmgmt on all the hosts > > > > 3. Re-tag the hosts on which the engine on > > > > 4. Activate the hosts - this should work well now since connectivity > > > > exist > > > > 5. Change the tag on ovirtmgmt on the engine to match the hosts' > > > > > > > > Simple and clear process. > > > > > > > > When the workaround of creating another DC was not possible since the > > > > system was already long in use and the need was re-tag of the network > > > > the > > > > above is what I've recommended in the, except that steps 4-5 where done > > > > as: > > > > 4. Stop the engine > > > > 5. Change the tag in the DB > > > > 6. Start the engine > > > > 7. Activate the hosts > > > > > > Sounds reasonable to me - but as far as I am aware this is not tightly > > > related to the $Subject, which is the post-boot ovirtmgmt definition. > > > > > > I've added a few details to > > > http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > > > and I would apreciate a review from someone with intimate Engine > > > know-how. > > > > > > Dan. > > > > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From asegurap at redhat.com Thu May 9 14:52:38 2013 From: asegurap at redhat.com (Antoni Segura Puimedon) Date: Thu, 9 May 2013 10:52:38 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <868990068.5761536.1368110529578.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <1633347300.8227458.1367926395327.JavaMail.root@redhat.com> <868990068.5761536.1368110529578.JavaMail.root@redhat.com> Message-ID: <708444892.5769916.1368111158149.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Alon Bar-Lev" > To: "Mike Kolesnik" > Cc: "arch" > Sent: Thursday, May 9, 2013 4:42:09 PM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Mike Kolesnik" > > To: "Moti Asayag" > > Cc: "Alon Bar-Lev" , "arch" > > Sent: Tuesday, May 7, 2013 2:33:15 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > > I stumbled upon few issues with the current design while implementing it: > > > > > > There seems to be a requirement to reboot the host after the installation > > > is completed in order to assure the host is recoverable. > > > > > > Therefore, the building blocks of the installation process of 3.3 are: > > > 1. host deploy which installs the host expect configuring its management > > > network. > > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > > > network > > > on the host and persisting the network configuration. > > > 3. Reboot the host - This is a missing piece. (engine has FenceVds > > > command, > > > but it > > > requires the power management to be configured prior to the installation > > > and > > > might > > > be irrelevant for hosts without PM.) > > > > > > So, there are couple of issues here: > > > 1. How to reboot the host? > > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > > engine > > > > This sounds like a solid and good API to me. > > > > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > > > How would you do this? > > > > > > > > 2. When to perform the reboot? > > > 2.1. After host deploy, by utilizing the host deploy to perform the > > > reboot. > > > It requires to configure the network by the monitor when the host is > > > detected > > > by the engine, > > > detached from the installation flow. However it is a step toward the > > > non-persistent network feature > > > yet to be defined. > > > > I am not sure this statement has merit, if the feature is yet to be > > defined, > > how > > can we know if this is a step towards it or not? > > > > Anyway, I'm not sure that this is a good design - should we setup the > > network > > when host returns from non-responsive status? > > Exactly. > Imagine that after a reboot only single interface is configured to allow > communication to engine. > Once the engine connects to the host, it re-configure anything it needs on > that host. > A completely stateless host. Not completely stateless we'd have to keep the configuration for this privileged interface in case of reboots. Even so, the less state the better, of course. > > > > 2.2. After setupNetwork is done and network was configured and persisted > > > on > > > the host. > > > There is no special advantage from recoverable aspect, as setupNetwork is > > > constantly > > > used to persist the network configuration (by the complementary > > > CommitNetworkChanges command). > > > In case and network configuration fails, VDSM will revert to the last > > > well > > > known configuration > > > - so connectivity with engine should be restored. Design wise, it fits to > > > configure the management > > > network as part of the installation sequence. > > > If the network configuration fails in this context, the host status will > > > be > > > set to "InstallFailed" rather than "NonOperational", > > > as might occur as a result of a failed setupNetwork command. > > > > This sounds like the good solution to me, design wise. The host is > > installed > > and with that the communication with the management network is configured. > > If this communication is not possible, the host failed to install (also > > meaning > > it's not operational). I see no problem with this approach. > > > > > > > > > > > Your inputs are welcome. > > > > > > Thanks, > > > Moti > > > ----- Original Message ----- > > > > From: "Dan Kenigsberg" > > > > To: "Simon Grinberg" , "Moti Asayag" > > > > > > > > Cc: "arch" > > > > Sent: Tuesday, January 1, 2013 2:47:57 PM > > > > Subject: Re: feature suggestion: initial generation of management > > > > network > > > > > > > > On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > > > > ----- Original Message ----- > > > > > > From: "Dan Kenigsberg" > > > > > > To: "Simon Grinberg" > > > > > > Cc: "arch" > > > > > > Sent: Thursday, December 27, 2012 2:14:06 PM > > > > > > Subject: Re: feature suggestion: initial generation of management > > > > > > network > > > > > > > > > > > > On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > > > > > > > > > > > > > > > > > > > > > ----- Original Message ----- > > > > > > > > From: "Dan Kenigsberg" > > > > > > > > To: "arch" > > > > > > > > Sent: Tuesday, December 25, 2012 2:27:22 PM > > > > > > > > Subject: feature suggestion: initial generation of management > > > > > > > > network > > > > > > > > > > > > > > > > Current condition: > > > > > > > > ================== > > > > > > > > The management network, named ovirtmgmt, is created during host > > > > > > > > bootstrap. It consists of a bridge device, connected to the > > > > > > > > network > > > > > > > > device that was used to communicate with Engine (nic, bonding > > > > > > > > or > > > > > > > > vlan). > > > > > > > > It inherits its ip settings from the latter device. > > > > > > > > > > > > > > > > Why Is the Management Network Needed? > > > > > > > > ===================================== > > > > > > > > Understandably, some may ask why do we need to have a > > > > > > > > management > > > > > > > > network - why having a host with IPv4 configured on it is not > > > > > > > > enough. > > > > > > > > The answer is twofold: > > > > > > > > 1. In oVirt, a network is an abstraction of the resources > > > > > > > > required > > > > > > > > for > > > > > > > > connectivity of a host for a specific usage. This is true > > > > > > > > for > > > > > > > > the > > > > > > > > management network just as it is for VM network or a display > > > > > > > > network. > > > > > > > > The network entity is the key for adding/changing nics and > > > > > > > > IP > > > > > > > > address. > > > > > > > > 2. In many occasions (such as small setups) the management > > > > > > > > network is > > > > > > > > used as a VM/display network as well. > > > > > > > > > > > > > > > > Problems in current connectivity: > > > > > > > > ================================ > > > > > > > > According to alonbl of ovirt-host-deploy fame, and with no > > > > > > > > conflict > > > > > > > > to > > > > > > > > my own experience, creating the management network is the most > > > > > > > > fragile, > > > > > > > > error-prone step of bootstrap. > > > > > > > > > > > > > > +1, > > > > > > > I've raise that repeatedly in the past, bootstrap should not > > > > > > > create > > > > > > > the management network but pick up the existing configuration and > > > > > > > let the engine override later with it's own configuration if it > > > > > > > differs , I'm glad that we finally get to that. > > > > > > > > > > > > > > > > > > > > > > > Currently it always creates a bridged network (even if the DC > > > > > > > > requires a > > > > > > > > non-bridged ovirtmgmt), it knows nothing about the defined MTU > > > > > > > > for > > > > > > > > ovirtmgmt, it uses ping to guess on top of which device to > > > > > > > > build > > > > > > > > (and > > > > > > > > thus requires Vdsm-to-Engine reverse connectivity), and is the > > > > > > > > sole > > > > > > > > remaining user of the addNetwork/vdsm-store-net-conf scripts. > > > > > > > > > > > > > > > > Suggested feature: > > > > > > > > ================== > > > > > > > > Bootstrap would avoid creating a management network. Instead, > > > > > > > > after > > > > > > > > bootstrapping a host, Engine would send a getVdsCaps probe to > > > > > > > > the > > > > > > > > installed host, receiving a complete picture of the network > > > > > > > > configuration on the host. Among this picture is the device > > > > > > > > that > > > > > > > > holds > > > > > > > > the host's management IP address. > > > > > > > > > > > > > > > > Engine would send setupNetwork command to generate ovirtmgmt > > > > > > > > with > > > > > > > > details devised from this picture, and according to the DC > > > > > > > > definition > > > > > > > > of > > > > > > > > ovirtmgmt. For example, if Vdsm reports: > > > > > > > > > > > > > > > > - vlan bond4.3000 has the host's IP, configured to use dhcp. > > > > > > > > - bond4 is comprises eth2 and eth3 > > > > > > > > - ovirtmgmt is defined as a VM network with MTU 9000 > > > > > > > > > > > > > > > > then Engine sends the likes of: > > > > > > > > setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, > > > > > > > > iface=bond4, > > > > > > > > bonding=bond4: {eth2,eth3}, MTU=9000) > > > > > > > > > > > > > > Just one comment here, > > > > > > > In order to save time and confusion - if the ovirtmgmt is defined > > > > > > > with default values meaning the user did not bother to touch it, > > > > > > > let it pick up the VLAN configuration from the first host added > > > > > > > in > > > > > > > the Data Center. > > > > > > > > > > > > > > Otherwise, you may override the host VLAN and loose connectivity. > > > > > > > > > > > > > > This will also solve the situation many users encounter today. > > > > > > > 1. The engine in on a host that actually has VLAN defined > > > > > > > 2. The ovirtmgmt network was not updated in the DC > > > > > > > 3. A host, with VLAN already defined is added - everything works > > > > > > > fine > > > > > > > 4. Any number of hosts are now added, again everything seems to > > > > > > > work fine. > > > > > > > > > > > > > > But, now try to use setupNetworks, and you'll find out that you > > > > > > > can't do much on the interface that contains the ovirtmgmt since > > > > > > > the definition does not match. You can't sync (Since this will > > > > > > > remove the VLAN and cause connectivity lose) you can't add more > > > > > > > networks on top since it already has non-VLAN network on top > > > > > > > according to the DC definition, etc. > > > > > > > > > > > > > > On the other hand you can't update the ovirtmgmt definition on > > > > > > > the > > > > > > > DC since there are clusters in the DC that use the network. > > > > > > > > > > > > > > The only workaround not involving DB hack to change the VLAN on > > > > > > > the > > > > > > > network is to: > > > > > > > 1. Create new DC > > > > > > > 2. Do not use the wizard that pops up to create your cluster. > > > > > > > 3. Modify the ovirtmgmt network to have VLANs > > > > > > > 4. Now create a cluster and add your hosts. > > > > > > > > > > > > > > If you insist on using the default DC and cluster then before > > > > > > > adding the first host, create an additional DC and move the > > > > > > > Default cluster over there. You may then change the network on > > > > > > > the > > > > > > > Default cluster and then move the Default cluster back > > > > > > > > > > > > > > Both are ugly. And should be solved by the proposal above. > > > > > > > > > > > > > > We do something similar for the Default cluster CPU level, where > > > > > > > we > > > > > > > set the intial level based on the first host added to the > > > > > > > cluster. > > > > > > > > > > > > I'm not sure what Engine has for Default cluster CPU level. But I > > > > > > have > > > > > > reservation of the hysteresis in your proposal - after a host is > > > > > > added, > > > > > > the DC cannot forget ovirtmgmt's vlan. > > > > > > > > > > > > How about letting the admin edit ovirtmgmt's vlan in the DC level, > > > > > > thus > > > > > > rendering all hosts out-of-sync. The the admin could manually, or > > > > > > through a script, or in the future through a distributed operation, > > > > > > sync > > > > > > all the hosts to the definition? > > > > > > > > > > Usually if you do that you will loose connectivity to the hosts. > > > > > > > > Yes, changing the management vlan id (or ip address) is never fun, and > > > > requires out-of-band intervention. > > > > > > > > > I'm not insisting on the automatic adjustment of the ovirtmgmt > > > > > network > > > > > to > > > > > match the hosts' (that is just a nice touch) we can take the allow > > > > > edit > > > > > approach. > > > > > > > > > > But allow to change VLAN on the ovirtmgmt network will indeed solve > > > > > the > > > > > issue I'm trying to solve while creating another issue of user > > > > > expecting > > > > > that we'll be able to re-tag the host from the engine side, which is > > > > > challenging to do. > > > > > > > > > > On the other hand, if we allow to change the VLAN as long as the > > > > > change > > > > > matches the hosts' configuration, it will both solve the issue while > > > > > not > > > > > eluding the user to think that we really can solve the chicken and > > > > > egg > > > > > issue of re-tag the entire system. > > > > > > > > > > Now with the above ability you do get a flow to do the re-tag. > > > > > 1. Place all the hosts in maintenance > > > > > 2. Re-tag the ovirtmgmt on all the hosts > > > > > 3. Re-tag the hosts on which the engine on > > > > > 4. Activate the hosts - this should work well now since connectivity > > > > > exist > > > > > 5. Change the tag on ovirtmgmt on the engine to match the hosts' > > > > > > > > > > Simple and clear process. > > > > > > > > > > When the workaround of creating another DC was not possible since the > > > > > system was already long in use and the need was re-tag of the network > > > > > the > > > > > above is what I've recommended in the, except that steps 4-5 where > > > > > done > > > > > as: > > > > > 4. Stop the engine > > > > > 5. Change the tag in the DB > > > > > 6. Start the engine > > > > > 7. Activate the hosts > > > > > > > > Sounds reasonable to me - but as far as I am aware this is not tightly > > > > related to the $Subject, which is the post-boot ovirtmgmt definition. > > > > > > > > I've added a few details to > > > > http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > > > > and I would apreciate a review from someone with intimate Engine > > > > know-how. > > > > > > > > Dan. > > > > > > > _______________________________________________ > > > Arch mailing list > > > Arch at ovirt.org > > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From alonbl at redhat.com Thu May 9 14:55:00 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Thu, 9 May 2013 10:55:00 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <20130508133549.GA17279@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> Message-ID: <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Dan Kenigsberg" > To: "Moti Asayag" > Cc: "arch" > Sent: Wednesday, May 8, 2013 4:35:49 PM > Subject: Re: feature suggestion: initial generation of management network > > On Tue, May 07, 2013 at 09:31:47AM -0400, Moti Asayag wrote: > > > > > > ----- Original Message ----- > > > From: "Omer Frenkel" > > > To: "Moti Asayag" > > > Cc: "arch" , "Alon Bar-Lev" > > > Sent: Tuesday, May 7, 2013 4:11:05 PM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > > > > > > > ----- Original Message ----- > > > > From: "Moti Asayag" > > > > To: "arch" > > > > Cc: "Alon Bar-Lev" > > > > Sent: Tuesday, May 7, 2013 2:22:19 PM > > > > Subject: Re: feature suggestion: initial generation of management > > > > network > > > > > > > > I stumbled upon few issues with the current design while implementing > > > > it: > > > > > > > > There seems to be a requirement to reboot the host after the > > > > installation > > > > is completed in order to assure the host is recoverable. > > > > > > > > Therefore, the building blocks of the installation process of 3.3 are: > > > > 1. host deploy which installs the host expect configuring its > > > > management > > > > network. > > > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the > > > > management > > > > network > > > > on the host and persisting the network configuration. > > > > 3. Reboot the host - This is a missing piece. (engine has FenceVds > > > > command, > > > > but it > > > > requires the power management to be configured prior to the > > > > installation > > > > and > > > > might > > > > be irrelevant for hosts without PM.) > > > > > > > > So, there are couple of issues here: > > > > 1. How to reboot the host? > > > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > > > engine > > > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > > > > > > > > > why not send a reboot flag to the CommitNetworkChanges which is sent > > > anyway, > > > one less call (or connection if you choose ssh) and easier to do. > > > > > > > Adding a reboot parameter to the CommitNetworkChanges (setSafeNetworkConfig > > on vdsm side) > > exceeds its logical scope which is persisting the network changes. > > > > Needless to say if such functionally will be required elsewhere, it > > couldn't be > > properly reused if implemented as part of that command. > > > > Adding Dan to comment on this as well. > > Yeah, a "reboot-after-me" flag defies my sense of cleanliness. Yes. > If reboot-after-initial-net-config is crucial, we would need to add a > special verb for that (or use the fenceNode verb if available). > > > However, I am not sure that this reboot is unavoidable. > Originally the reboot had two important goal: > - make sure that the updated kernel is running > - make sure that the network, which we tweak during bootstrap, is > accessible after boot > > Nowadays, the kernels does not change THAT often, for all ovirt can > matter. running an oldish kernel is not the end of the world. > > And with Moti's feature implemented, we no longer tweak net config > blindly during boot. We use a well-define setupNetwork API, with a > well-tested rollback mechanism. > > The bottom line is, that in my opinion, reboot-after-install can be > skipped these days. I agree. The current design of ovirt-host-deploy fully supports a deployment cycle without requiring a reboot. We no longer update kernel command-line, nor performing changes without activating them at runtime. The bridge setup was the one bit that was risky and could not be rolled back cleanly without re-implementation of large chunk of engine logic. The risk of a deployed system (without bridge) to be unresponsive after reboot is minimum. 1. iptables rules are already active. 2. udev rules are active. 3. vdsm is up. The major risk is basically if some dependency package was UPDATED during the installation of vdsm, while its service/consumer is already running, then we are running a host with the old software and there is a chance that after reboot with the new software the host will fail. I think that the decision to reboot host should be delegated to administrator, adding vdsm verb to reboot is usable. This way admin will be able to take a host to maintenance mode and reboot, and we can add checkbox to the add host dialog '[x] reboot', rebooting the host at the end of the sequence. I think the default should be off. > > > > > > > 2. When to perform the reboot? > > > > 2.1. After host deploy, by utilizing the host deploy to perform the > > > > reboot. > > > > It requires to configure the network by the monitor when the host is > > > > detected > > > > by the engine, > > > > detached from the installation flow. However it is a step toward the > > > > non-persistent network feature > > > > yet to be defined. > > > > 2.2. After setupNetwork is done and network was configured and > > > > persisted on > > > > the host. > > > > There is no special advantage from recoverable aspect, as setupNetwork > > > > is > > > > constantly > > > > used to persist the network configuration (by the complementary > > > > CommitNetworkChanges command). > > > > In case and network configuration fails, VDSM will revert to the last > > > > well > > > > known configuration > > > > - so connectivity with engine should be restored. Design wise, it fits > > > > to > > > > configure the management > > > > network as part of the installation sequence. > > > > If the network configuration fails in this context, the host status > > > > will be > > > > set to "InstallFailed" rather than "NonOperational", > > > > as might occur as a result of a failed setupNetwork command. > > > > > > > > > > > > Your inputs are welcome. > > > > > > > > Thanks, > > > > Moti > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From asegurap at redhat.com Thu May 9 14:57:16 2013 From: asegurap at redhat.com (Antoni Segura Puimedon) Date: Thu, 9 May 2013 10:57:16 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> Message-ID: <941280855.5773599.1368111436878.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Alon Bar-Lev" > To: "Dan Kenigsberg" > Cc: "arch" > Sent: Thursday, May 9, 2013 4:55:00 PM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Dan Kenigsberg" > > To: "Moti Asayag" > > Cc: "arch" > > Sent: Wednesday, May 8, 2013 4:35:49 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > On Tue, May 07, 2013 at 09:31:47AM -0400, Moti Asayag wrote: > > > > > > > > > ----- Original Message ----- > > > > From: "Omer Frenkel" > > > > To: "Moti Asayag" > > > > Cc: "arch" , "Alon Bar-Lev" > > > > Sent: Tuesday, May 7, 2013 4:11:05 PM > > > > Subject: Re: feature suggestion: initial generation of management > > > > network > > > > > > > > > > > > > > > > ----- Original Message ----- > > > > > From: "Moti Asayag" > > > > > To: "arch" > > > > > Cc: "Alon Bar-Lev" > > > > > Sent: Tuesday, May 7, 2013 2:22:19 PM > > > > > Subject: Re: feature suggestion: initial generation of management > > > > > network > > > > > > > > > > I stumbled upon few issues with the current design while implementing > > > > > it: > > > > > > > > > > There seems to be a requirement to reboot the host after the > > > > > installation > > > > > is completed in order to assure the host is recoverable. > > > > > > > > > > Therefore, the building blocks of the installation process of 3.3 > > > > > are: > > > > > 1. host deploy which installs the host expect configuring its > > > > > management > > > > > network. > > > > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the > > > > > management > > > > > network > > > > > on the host and persisting the network configuration. > > > > > 3. Reboot the host - This is a missing piece. (engine has FenceVds > > > > > command, > > > > > but it > > > > > requires the power management to be configured prior to the > > > > > installation > > > > > and > > > > > might > > > > > be irrelevant for hosts without PM.) > > > > > > > > > > So, there are couple of issues here: > > > > > 1. How to reboot the host? > > > > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > > > > engine > > > > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > > > > > > > > > > > > why not send a reboot flag to the CommitNetworkChanges which is sent > > > > anyway, > > > > one less call (or connection if you choose ssh) and easier to do. > > > > > > > > > > Adding a reboot parameter to the CommitNetworkChanges > > > (setSafeNetworkConfig > > > on vdsm side) > > > exceeds its logical scope which is persisting the network changes. > > > > > > Needless to say if such functionally will be required elsewhere, it > > > couldn't be > > > properly reused if implemented as part of that command. > > > > > > Adding Dan to comment on this as well. > > > > Yeah, a "reboot-after-me" flag defies my sense of cleanliness. > > Yes. > > > If reboot-after-initial-net-config is crucial, we would need to add a > > special verb for that (or use the fenceNode verb if available). > > > > > > However, I am not sure that this reboot is unavoidable. > > Originally the reboot had two important goal: > > - make sure that the updated kernel is running > > - make sure that the network, which we tweak during bootstrap, is > > accessible after boot > > > > Nowadays, the kernels does not change THAT often, for all ovirt can > > matter. running an oldish kernel is not the end of the world. > > > > And with Moti's feature implemented, we no longer tweak net config > > blindly during boot. We use a well-define setupNetwork API, with a > > well-tested rollback mechanism. > > > > The bottom line is, that in my opinion, reboot-after-install can be > > skipped these days. > > I agree. > > The current design of ovirt-host-deploy fully supports a deployment cycle > without requiring a reboot. > We no longer update kernel command-line, nor performing changes without > activating them at runtime. > The bridge setup was the one bit that was risky and could not be rolled back > cleanly without re-implementation of large chunk of engine logic. > > The risk of a deployed system (without bridge) to be unresponsive after > reboot is minimum. > > 1. iptables rules are already active. > 2. udev rules are active. > 3. vdsm is up. > > The major risk is basically if some dependency package was UPDATED during the > installation of vdsm, while its service/consumer is already running, then we > are running a host with the old software and there is a chance that after > reboot with the new software the host will fail. > > I think that the decision to reboot host should be delegated to > administrator, adding vdsm verb to reboot is usable. This way admin will be > able to take a host to maintenance mode and reboot, and we can add checkbox > to the add host dialog '[x] reboot', rebooting the host at the end of the > sequence. I think the default should be off. I'm also in agreement with the addition of a reboot verb. It could be a nice addition regardless of this specific use case. > > > > > > > > > > > 2. When to perform the reboot? > > > > > 2.1. After host deploy, by utilizing the host deploy to perform the > > > > > reboot. > > > > > It requires to configure the network by the monitor when the host is > > > > > detected > > > > > by the engine, > > > > > detached from the installation flow. However it is a step toward the > > > > > non-persistent network feature > > > > > yet to be defined. > > > > > 2.2. After setupNetwork is done and network was configured and > > > > > persisted on > > > > > the host. > > > > > There is no special advantage from recoverable aspect, as > > > > > setupNetwork > > > > > is > > > > > constantly > > > > > used to persist the network configuration (by the complementary > > > > > CommitNetworkChanges command). > > > > > In case and network configuration fails, VDSM will revert to the last > > > > > well > > > > > known configuration > > > > > - so connectivity with engine should be restored. Design wise, it > > > > > fits > > > > > to > > > > > configure the management > > > > > network as part of the installation sequence. > > > > > If the network configuration fails in this context, the host status > > > > > will be > > > > > set to "InstallFailed" rather than "NonOperational", > > > > > as might occur as a result of a failed setupNetwork command. > > > > > > > > > > > > > > > Your inputs are welcome. > > > > > > > > > > Thanks, > > > > > Moti > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From bazulay at redhat.com Thu May 9 15:12:38 2013 From: bazulay at redhat.com (Barak Azulay) Date: Thu, 9 May 2013 11:12:38 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <518B3744.8020209@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <518B3744.8020209@redhat.com> Message-ID: <1441971488.13645005.1368112358854.JavaMail.root@redhat.com> After reading the thread, top posting to give my perspective of the what I think would be the best approach: Some background: - The reboot at the end of host deployment came after issues in host deployment that were discovered after fencing/reboot (which had nothing to do with the reboot reason). - Host deployment takes care of of constructing the bridge (when no bonding exist on host). So Bottom line: - I think that both handling the network configuration and reboot should be removed from host deployment (I think we all agree on that) - host deployment should leave the VDSM up and running with everything configured (including the SSL and SSH keys), and listening to all networks. - About the reboot - I'm not sure we still need reboot (this is a question even without this discussion) - Anyway about how to reboot: - I don't think it should be done through a VDSM command - There is an open issue already with enabling a few stages of fencing with no PowerManagement module but based on SSH, So a new engine internal command should be introduced like (runSSHCmdOnVds ... and this command can have 2 variants ... one of them is reboot ... (the other is restart VDSM)) - such command will not require a password as the SSH keys should already be in place Thoughts/Implications on Host Life cycle: - The easiest approach will be to leave the host in Installing phase throughout this process (all under installVdsCommand), and eventually move it to reboot - If one skips reboot than assuming the cluster has more than one network, he host will imminently move to none-operational ( like today) - We can take a different approach and add a new status to the Host - PendingNetworkConfig, which could be executed automatically in the future with network-labels, ot today simply popup todo dialog in the UI. Thanks Barak Azulay ----- Original Message ----- > From: "Livnat Peer" > To: "Dan Kenigsberg" , "Barak Azulay" > Cc: "arch" > Sent: Thursday, May 9, 2013 8:42:28 AM > Subject: Re: feature suggestion: initial generation of management network > > On 05/08/2013 04:35 PM, Dan Kenigsberg wrote: > > On Tue, May 07, 2013 at 09:31:47AM -0400, Moti Asayag wrote: > >> > >> > >> ----- Original Message ----- > >>> From: "Omer Frenkel" > >>> To: "Moti Asayag" > >>> Cc: "arch" , "Alon Bar-Lev" > >>> Sent: Tuesday, May 7, 2013 4:11:05 PM > >>> Subject: Re: feature suggestion: initial generation of management network > >>> > >>> > >>> > >>> ----- Original Message ----- > >>>> From: "Moti Asayag" > >>>> To: "arch" > >>>> Cc: "Alon Bar-Lev" > >>>> Sent: Tuesday, May 7, 2013 2:22:19 PM > >>>> Subject: Re: feature suggestion: initial generation of management > >>>> network > >>>> > >>>> I stumbled upon few issues with the current design while implementing > >>>> it: > >>>> > >>>> There seems to be a requirement to reboot the host after the > >>>> installation > >>>> is completed in order to assure the host is recoverable. > >>>> > >>>> Therefore, the building blocks of the installation process of 3.3 are: > >>>> 1. host deploy which installs the host expect configuring its management > >>>> network. > >>>> 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > >>>> network > >>>> on the host and persisting the network configuration. > >>>> 3. Reboot the host - This is a missing piece. (engine has FenceVds > >>>> command, > >>>> but it > >>>> requires the power management to be configured prior to the installation > >>>> and > >>>> might > >>>> be irrelevant for hosts without PM.) > >>>> > >>>> So, there are couple of issues here: > >>>> 1. How to reboot the host? > >>>> 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > >>>> engine > >>>> 1.2. By opening ssh dialog to the host in order to execute the reboot > >>>> > >>> > >>> why not send a reboot flag to the CommitNetworkChanges which is sent > >>> anyway, > >>> one less call (or connection if you choose ssh) and easier to do. > >>> > >> > >> Adding a reboot parameter to the CommitNetworkChanges > >> (setSafeNetworkConfig on vdsm side) > >> exceeds its logical scope which is persisting the network changes. > >> > >> Needless to say if such functionally will be required elsewhere, it > >> couldn't be > >> properly reused if implemented as part of that command. > >> > >> Adding Dan to comment on this as well. > > > > Yeah, a "reboot-after-me" flag defies my sense of cleanliness. > > If reboot-after-initial-net-config is crucial, we would need to add a > > special verb for that (or use the fenceNode verb if available). > > > > +1 > > > > > However, I am not sure that this reboot is unavoidable. > > Originally the reboot had two important goal: > > - make sure that the updated kernel is running > > - make sure that the network, which we tweak during bootstrap, is > > accessible after boot > > > > Nowadays, the kernels does not change THAT often, for all ovirt can > > matter. running an oldish kernel is not the end of the world. > > > > And with Moti's feature implemented, we no longer tweak net config > > blindly during boot. We use a well-define setupNetwork API, with a > > well-tested rollback mechanism. > > > > The bottom line is, that in my opinion, reboot-after-install can be > > skipped these days. > > > > Adding Barak to the thread as I think he had some concern about removing > the reboot after install. > > > >> > >>>> 2. When to perform the reboot? > >>>> 2.1. After host deploy, by utilizing the host deploy to perform the > >>>> reboot. > >>>> It requires to configure the network by the monitor when the host is > >>>> detected > >>>> by the engine, > >>>> detached from the installation flow. However it is a step toward the > >>>> non-persistent network feature > >>>> yet to be defined. > >>>> 2.2. After setupNetwork is done and network was configured and persisted > >>>> on > >>>> the host. > >>>> There is no special advantage from recoverable aspect, as setupNetwork > >>>> is > >>>> constantly > >>>> used to persist the network configuration (by the complementary > >>>> CommitNetworkChanges command). > >>>> In case and network configuration fails, VDSM will revert to the last > >>>> well > >>>> known configuration > >>>> - so connectivity with engine should be restored. Design wise, it fits > >>>> to > >>>> configure the management > >>>> network as part of the installation sequence. > >>>> If the network configuration fails in this context, the host status will > >>>> be > >>>> set to "InstallFailed" rather than "NonOperational", > >>>> as might occur as a result of a failed setupNetwork command. > >>>> > >>>> > >>>> Your inputs are welcome. > >>>> > >>>> Thanks, > >>>> Moti > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > > > From alonbl at redhat.com Thu May 9 15:36:30 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Thu, 9 May 2013 11:36:30 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <1441971488.13645005.1368112358854.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <518B3744.8020209@redhat.com> <1441971488.13645005.1368112358854.JavaMail.root@redhat.com> Message-ID: <2095276319.5812204.1368113790742.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Barak Azulay" > To: "Livnat Peer" > Cc: "arch" > Sent: Thursday, May 9, 2013 6:12:38 PM > Subject: Re: feature suggestion: initial generation of management network > > After reading the thread, top posting to give my perspective of the what I > think would be the best approach: > > Some background: > - The reboot at the end of host deployment came after issues in host > deployment that were discovered after fencing/reboot (which had nothing to > do with the reboot reason). > - Host deployment takes care of of constructing the bridge (when no bonding > exist on host). > > So Bottom line: > - I think that both handling the network configuration and reboot should be > removed from host deployment (I think we all agree on that) > - host deployment should leave the VDSM up and running with everything > configured (including the SSL and SSH keys), and listening to all networks. > - About the reboot - I'm not sure we still need reboot (this is a question > even without this discussion) > - Anyway about how to reboot: > - I don't think it should be done through a VDSM command > - There is an open issue already with enabling a few stages of fencing with > no PowerManagement module but based on SSH, > So a new engine internal command should be introduced like > (runSSHCmdOnVds ... and this command can have 2 variants ... one of them > is reboot ... (the other is restart VDSM)) > - such command will not require a password as the SSH keys should already > be in place I disagree. After management agent (VDSM in our case) is installed, all communications should be done via that agent. Using multi-protocol layers is not wise in this architecture. For example, if we switch the direction of engine->vdsm communications we cannot use SSH. > > Thoughts/Implications on Host Life cycle: > - The easiest approach will be to leave the host in Installing phase > throughout this process (all under installVdsCommand), and eventually move > it to reboot > - If one skips reboot than assuming the cluster has more than one network, he > host will imminently move to none-operational ( like today) > - We can take a different approach and add a new status to the Host - > PendingNetworkConfig, which could be executed automatically in the future > with network-labels, ot today simply popup todo dialog in the UI. > > > Thanks > Barak Azulay > > > > ----- Original Message ----- > > From: "Livnat Peer" > > To: "Dan Kenigsberg" , "Barak Azulay" > > > > Cc: "arch" > > Sent: Thursday, May 9, 2013 8:42:28 AM > > Subject: Re: feature suggestion: initial generation of management network > > > > On 05/08/2013 04:35 PM, Dan Kenigsberg wrote: > > > On Tue, May 07, 2013 at 09:31:47AM -0400, Moti Asayag wrote: > > >> > > >> > > >> ----- Original Message ----- > > >>> From: "Omer Frenkel" > > >>> To: "Moti Asayag" > > >>> Cc: "arch" , "Alon Bar-Lev" > > >>> Sent: Tuesday, May 7, 2013 4:11:05 PM > > >>> Subject: Re: feature suggestion: initial generation of management > > >>> network > > >>> > > >>> > > >>> > > >>> ----- Original Message ----- > > >>>> From: "Moti Asayag" > > >>>> To: "arch" > > >>>> Cc: "Alon Bar-Lev" > > >>>> Sent: Tuesday, May 7, 2013 2:22:19 PM > > >>>> Subject: Re: feature suggestion: initial generation of management > > >>>> network > > >>>> > > >>>> I stumbled upon few issues with the current design while implementing > > >>>> it: > > >>>> > > >>>> There seems to be a requirement to reboot the host after the > > >>>> installation > > >>>> is completed in order to assure the host is recoverable. > > >>>> > > >>>> Therefore, the building blocks of the installation process of 3.3 are: > > >>>> 1. host deploy which installs the host expect configuring its > > >>>> management > > >>>> network. > > >>>> 2. SetupNetwork (and CommitNetworkChanges) - for creating the > > >>>> management > > >>>> network > > >>>> on the host and persisting the network configuration. > > >>>> 3. Reboot the host - This is a missing piece. (engine has FenceVds > > >>>> command, > > >>>> but it > > >>>> requires the power management to be configured prior to the > > >>>> installation > > >>>> and > > >>>> might > > >>>> be irrelevant for hosts without PM.) > > >>>> > > >>>> So, there are couple of issues here: > > >>>> 1. How to reboot the host? > > >>>> 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > >>>> engine > > >>>> 1.2. By opening ssh dialog to the host in order to execute the reboot > > >>>> > > >>> > > >>> why not send a reboot flag to the CommitNetworkChanges which is sent > > >>> anyway, > > >>> one less call (or connection if you choose ssh) and easier to do. > > >>> > > >> > > >> Adding a reboot parameter to the CommitNetworkChanges > > >> (setSafeNetworkConfig on vdsm side) > > >> exceeds its logical scope which is persisting the network changes. > > >> > > >> Needless to say if such functionally will be required elsewhere, it > > >> couldn't be > > >> properly reused if implemented as part of that command. > > >> > > >> Adding Dan to comment on this as well. > > > > > > Yeah, a "reboot-after-me" flag defies my sense of cleanliness. > > > If reboot-after-initial-net-config is crucial, we would need to add a > > > special verb for that (or use the fenceNode verb if available). > > > > > > > +1 > > > > > > > > However, I am not sure that this reboot is unavoidable. > > > Originally the reboot had two important goal: > > > - make sure that the updated kernel is running > > > - make sure that the network, which we tweak during bootstrap, is > > > accessible after boot > > > > > > Nowadays, the kernels does not change THAT often, for all ovirt can > > > matter. running an oldish kernel is not the end of the world. > > > > > > And with Moti's feature implemented, we no longer tweak net config > > > blindly during boot. We use a well-define setupNetwork API, with a > > > well-tested rollback mechanism. > > > > > > The bottom line is, that in my opinion, reboot-after-install can be > > > skipped these days. > > > > > > > Adding Barak to the thread as I think he had some concern about removing > > the reboot after install. > > > > > > >> > > >>>> 2. When to perform the reboot? > > >>>> 2.1. After host deploy, by utilizing the host deploy to perform the > > >>>> reboot. > > >>>> It requires to configure the network by the monitor when the host is > > >>>> detected > > >>>> by the engine, > > >>>> detached from the installation flow. However it is a step toward the > > >>>> non-persistent network feature > > >>>> yet to be defined. > > >>>> 2.2. After setupNetwork is done and network was configured and > > >>>> persisted > > >>>> on > > >>>> the host. > > >>>> There is no special advantage from recoverable aspect, as setupNetwork > > >>>> is > > >>>> constantly > > >>>> used to persist the network configuration (by the complementary > > >>>> CommitNetworkChanges command). > > >>>> In case and network configuration fails, VDSM will revert to the last > > >>>> well > > >>>> known configuration > > >>>> - so connectivity with engine should be restored. Design wise, it fits > > >>>> to > > >>>> configure the management > > >>>> network as part of the installation sequence. > > >>>> If the network configuration fails in this context, the host status > > >>>> will > > >>>> be > > >>>> set to "InstallFailed" rather than "NonOperational", > > >>>> as might occur as a result of a failed setupNetwork command. > > >>>> > > >>>> > > >>>> Your inputs are welcome. > > >>>> > > >>>> Thanks, > > >>>> Moti > > > _______________________________________________ > > > Arch mailing list > > > Arch at ovirt.org > > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From dougsland at redhat.com Thu May 9 19:58:27 2013 From: dougsland at redhat.com (Douglas Schilling Landgraf) Date: Thu, 09 May 2013 15:58:27 -0400 Subject: feature suggestion: initial generation of management network In-Reply-To: <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> Message-ID: <518BFFE3.8030308@redhat.com> On 05/09/2013 10:55 AM, Alon Bar-Lev wrote: > > > ----- Original Message ----- >> From: "Dan Kenigsberg" >> To: "Moti Asayag" >> Cc: "arch" >> Sent: Wednesday, May 8, 2013 4:35:49 PM >> Subject: Re: feature suggestion: initial generation of management network >> >> On Tue, May 07, 2013 at 09:31:47AM -0400, Moti Asayag wrote: >>> >>> >>> ----- Original Message ----- >>>> From: "Omer Frenkel" >>>> To: "Moti Asayag" >>>> Cc: "arch" , "Alon Bar-Lev" >>>> Sent: Tuesday, May 7, 2013 4:11:05 PM >>>> Subject: Re: feature suggestion: initial generation of management network >>>> >>>> >>>> >>>> ----- Original Message ----- >>>>> From: "Moti Asayag" >>>>> To: "arch" >>>>> Cc: "Alon Bar-Lev" >>>>> Sent: Tuesday, May 7, 2013 2:22:19 PM >>>>> Subject: Re: feature suggestion: initial generation of management >>>>> network >>>>> >>>>> I stumbled upon few issues with the current design while implementing >>>>> it: >>>>> >>>>> There seems to be a requirement to reboot the host after the >>>>> installation >>>>> is completed in order to assure the host is recoverable. >>>>> >>>>> Therefore, the building blocks of the installation process of 3.3 are: >>>>> 1. host deploy which installs the host expect configuring its >>>>> management >>>>> network. >>>>> 2. SetupNetwork (and CommitNetworkChanges) - for creating the >>>>> management >>>>> network >>>>> on the host and persisting the network configuration. >>>>> 3. Reboot the host - This is a missing piece. (engine has FenceVds >>>>> command, >>>>> but it >>>>> requires the power management to be configured prior to the >>>>> installation >>>>> and >>>>> might >>>>> be irrelevant for hosts without PM.) >>>>> >>>>> So, there are couple of issues here: >>>>> 1. How to reboot the host? >>>>> 1.1. By exposing new RebootNode verb in VDSM and invoking it from the >>>>> engine >>>>> 1.2. By opening ssh dialog to the host in order to execute the reboot >>>>> >>>> >>>> why not send a reboot flag to the CommitNetworkChanges which is sent >>>> anyway, >>>> one less call (or connection if you choose ssh) and easier to do. >>>> >>> >>> Adding a reboot parameter to the CommitNetworkChanges (setSafeNetworkConfig >>> on vdsm side) >>> exceeds its logical scope which is persisting the network changes. >>> >>> Needless to say if such functionally will be required elsewhere, it >>> couldn't be >>> properly reused if implemented as part of that command. >>> >>> Adding Dan to comment on this as well. >> >> Yeah, a "reboot-after-me" flag defies my sense of cleanliness. > > Yes. > >> If reboot-after-initial-net-config is crucial, we would need to add a >> special verb for that (or use the fenceNode verb if available). >> >> >> However, I am not sure that this reboot is unavoidable. >> Originally the reboot had two important goal: >> - make sure that the updated kernel is running >> - make sure that the network, which we tweak during bootstrap, is >> accessible after boot >> >> Nowadays, the kernels does not change THAT often, for all ovirt can >> matter. running an oldish kernel is not the end of the world. >> >> And with Moti's feature implemented, we no longer tweak net config >> blindly during boot. We use a well-define setupNetwork API, with a >> well-tested rollback mechanism. >> >> The bottom line is, that in my opinion, reboot-after-install can be >> skipped these days. > > I agree. > > The current design of ovirt-host-deploy fully supports a deployment cycle without requiring a reboot. > We no longer update kernel command-line, nor performing changes without activating them at runtime. > The bridge setup was the one bit that was risky and could not be rolled back cleanly without re-implementation of large chunk of engine logic. > > The risk of a deployed system (without bridge) to be unresponsive after reboot is minimum. > > 1. iptables rules are already active. > 2. udev rules are active. > 3. vdsm is up. > > The major risk is basically if some dependency package was UPDATED during the installation of vdsm, while its service/consumer is already running, then we are running a host with the old software and there is a chance that after reboot with the new software the host will fail. > > I think that the decision to reboot host should be delegated to administrator, adding vdsm verb to reboot is usable. This way admin will be able to take a host to maintenance mode and reboot, and we can add checkbox to the add host dialog '[x] reboot', rebooting the host at the end of the sequence. I think the default should be off. > +1 -- Cheers Douglas From bazulay at redhat.com Thu May 9 19:59:59 2013 From: bazulay at redhat.com (Barak Azulay) Date: Thu, 9 May 2013 15:59:59 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <2095276319.5812204.1368113790742.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <518B3744.8020209@redhat.com> <1441971488.13645005.1368112358854.JavaMail.root@redhat.com> <2095276319.5812204.1368113790742.JavaMail.root@redhat.com> Message-ID: <1367250542.13826069.1368129599726.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Alon Bar-Lev" > To: "Barak Azulay" > Cc: "Livnat Peer" , "arch" > Sent: Thursday, May 9, 2013 6:36:30 PM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Barak Azulay" > > To: "Livnat Peer" > > Cc: "arch" > > Sent: Thursday, May 9, 2013 6:12:38 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > After reading the thread, top posting to give my perspective of the what I > > think would be the best approach: > > > > Some background: > > - The reboot at the end of host deployment came after issues in host > > deployment that were discovered after fencing/reboot (which had nothing to > > do with the reboot reason). > > - Host deployment takes care of of constructing the bridge (when no bonding > > exist on host). > > > > So Bottom line: > > - I think that both handling the network configuration and reboot should be > > removed from host deployment (I think we all agree on that) > > - host deployment should leave the VDSM up and running with everything > > configured (including the SSL and SSH keys), and listening to all networks. > > - About the reboot - I'm not sure we still need reboot (this is a question > > even without this discussion) > > - Anyway about how to reboot: > > - I don't think it should be done through a VDSM command > > - There is an open issue already with enabling a few stages of fencing > > with > > no PowerManagement module but based on SSH, > > So a new engine internal command should be introduced like > > (runSSHCmdOnVds ... and this command can have 2 variants ... one of > > them > > is reboot ... (the other is restart VDSM)) > > - such command will not require a password as the SSH keys should already > > be in place > > I disagree. > After management agent (VDSM in our case) is installed, all communications > should be done via that agent. First I must say that the reboot may not be a must for host deploy, this is a legitimate point to discuss However on the other hand a reboot might still be needed for various reasons, If it was that simple we wouldn't have to: - use Power Management cards - build safety belts around services we depend on In addition the fact is that VDSM can turn not responsive due to many unexpected reasons, and we might still need the ability to force a host to reboot (and not through PM). On the other hand SSH is very reliable, used anyway by engine for various flows (deploy/upgrade/log collection), and for that reason will be configured anyway. > Using multi-protocol layers is not wise in this architecture. We already do, and as it looks like its here to stay - unless we remove the host deployment part from ovirt. > For example, if we switch the direction of engine->vdsm communications we > cannot use SSH. The current plans are to move to a bi directional communication transport on top of AMQP/TCP, See the work done by Saggi.M And even if we do - the SSH will still be required. > > > > > Thoughts/Implications on Host Life cycle: > > - The easiest approach will be to leave the host in Installing phase > > throughout this process (all under installVdsCommand), and eventually move > > it to reboot > > - If one skips reboot than assuming the cluster has more than one network, > > he > > host will imminently move to none-operational ( like today) > > - We can take a different approach and add a new status to the Host - > > PendingNetworkConfig, which could be executed automatically in the future > > with network-labels, ot today simply popup todo dialog in the UI. > > > > > > Thanks > > Barak Azulay > > > > > > > > ----- Original Message ----- > > > From: "Livnat Peer" > > > To: "Dan Kenigsberg" , "Barak Azulay" > > > > > > Cc: "arch" > > > Sent: Thursday, May 9, 2013 8:42:28 AM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > On 05/08/2013 04:35 PM, Dan Kenigsberg wrote: > > > > On Tue, May 07, 2013 at 09:31:47AM -0400, Moti Asayag wrote: > > > >> > > > >> > > > >> ----- Original Message ----- > > > >>> From: "Omer Frenkel" > > > >>> To: "Moti Asayag" > > > >>> Cc: "arch" , "Alon Bar-Lev" > > > >>> Sent: Tuesday, May 7, 2013 4:11:05 PM > > > >>> Subject: Re: feature suggestion: initial generation of management > > > >>> network > > > >>> > > > >>> > > > >>> > > > >>> ----- Original Message ----- > > > >>>> From: "Moti Asayag" > > > >>>> To: "arch" > > > >>>> Cc: "Alon Bar-Lev" > > > >>>> Sent: Tuesday, May 7, 2013 2:22:19 PM > > > >>>> Subject: Re: feature suggestion: initial generation of management > > > >>>> network > > > >>>> > > > >>>> I stumbled upon few issues with the current design while > > > >>>> implementing > > > >>>> it: > > > >>>> > > > >>>> There seems to be a requirement to reboot the host after the > > > >>>> installation > > > >>>> is completed in order to assure the host is recoverable. > > > >>>> > > > >>>> Therefore, the building blocks of the installation process of 3.3 > > > >>>> are: > > > >>>> 1. host deploy which installs the host expect configuring its > > > >>>> management > > > >>>> network. > > > >>>> 2. SetupNetwork (and CommitNetworkChanges) - for creating the > > > >>>> management > > > >>>> network > > > >>>> on the host and persisting the network configuration. > > > >>>> 3. Reboot the host - This is a missing piece. (engine has FenceVds > > > >>>> command, > > > >>>> but it > > > >>>> requires the power management to be configured prior to the > > > >>>> installation > > > >>>> and > > > >>>> might > > > >>>> be irrelevant for hosts without PM.) > > > >>>> > > > >>>> So, there are couple of issues here: > > > >>>> 1. How to reboot the host? > > > >>>> 1.1. By exposing new RebootNode verb in VDSM and invoking it from > > > >>>> the > > > >>>> engine > > > >>>> 1.2. By opening ssh dialog to the host in order to execute the > > > >>>> reboot > > > >>>> > > > >>> > > > >>> why not send a reboot flag to the CommitNetworkChanges which is sent > > > >>> anyway, > > > >>> one less call (or connection if you choose ssh) and easier to do. > > > >>> > > > >> > > > >> Adding a reboot parameter to the CommitNetworkChanges > > > >> (setSafeNetworkConfig on vdsm side) > > > >> exceeds its logical scope which is persisting the network changes. > > > >> > > > >> Needless to say if such functionally will be required elsewhere, it > > > >> couldn't be > > > >> properly reused if implemented as part of that command. > > > >> > > > >> Adding Dan to comment on this as well. > > > > > > > > Yeah, a "reboot-after-me" flag defies my sense of cleanliness. > > > > If reboot-after-initial-net-config is crucial, we would need to add a > > > > special verb for that (or use the fenceNode verb if available). > > > > > > > > > > +1 > > > > > > > > > > > However, I am not sure that this reboot is unavoidable. > > > > Originally the reboot had two important goal: > > > > - make sure that the updated kernel is running > > > > - make sure that the network, which we tweak during bootstrap, is > > > > accessible after boot > > > > > > > > Nowadays, the kernels does not change THAT often, for all ovirt can > > > > matter. running an oldish kernel is not the end of the world. > > > > > > > > And with Moti's feature implemented, we no longer tweak net config > > > > blindly during boot. We use a well-define setupNetwork API, with a > > > > well-tested rollback mechanism. > > > > > > > > The bottom line is, that in my opinion, reboot-after-install can be > > > > skipped these days. > > > > > > > > > > Adding Barak to the thread as I think he had some concern about removing > > > the reboot after install. > > > > > > > > > >> > > > >>>> 2. When to perform the reboot? > > > >>>> 2.1. After host deploy, by utilizing the host deploy to perform the > > > >>>> reboot. > > > >>>> It requires to configure the network by the monitor when the host is > > > >>>> detected > > > >>>> by the engine, > > > >>>> detached from the installation flow. However it is a step toward the > > > >>>> non-persistent network feature > > > >>>> yet to be defined. > > > >>>> 2.2. After setupNetwork is done and network was configured and > > > >>>> persisted > > > >>>> on > > > >>>> the host. > > > >>>> There is no special advantage from recoverable aspect, as > > > >>>> setupNetwork > > > >>>> is > > > >>>> constantly > > > >>>> used to persist the network configuration (by the complementary > > > >>>> CommitNetworkChanges command). > > > >>>> In case and network configuration fails, VDSM will revert to the > > > >>>> last > > > >>>> well > > > >>>> known configuration > > > >>>> - so connectivity with engine should be restored. Design wise, it > > > >>>> fits > > > >>>> to > > > >>>> configure the management > > > >>>> network as part of the installation sequence. > > > >>>> If the network configuration fails in this context, the host status > > > >>>> will > > > >>>> be > > > >>>> set to "InstallFailed" rather than "NonOperational", > > > >>>> as might occur as a result of a failed setupNetwork command. > > > >>>> > > > >>>> > > > >>>> Your inputs are welcome. > > > >>>> > > > >>>> Thanks, > > > >>>> Moti > > > > _______________________________________________ > > > > Arch mailing list > > > > Arch at ovirt.org > > > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > > > > > _______________________________________________ > > > Arch mailing list > > > Arch at ovirt.org > > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > > > > > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > From alonbl at redhat.com Thu May 9 20:24:26 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Thu, 9 May 2013 16:24:26 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <1367250542.13826069.1368129599726.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <518B3744.8020209@redhat.com> <1441971488.13645005.1368112358854.JavaMail.root@redhat.com> <2095276319.5812204.1368113790742.JavaMail.root@redhat.com> <1367250542.13826069.1368129599726.JavaMail.root@redhat.com> Message-ID: <1932720504.6022857.1368131066950.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Barak Azulay" > To: "Alon Bar-Lev" > Cc: "Livnat Peer" , "arch" > Sent: Thursday, May 9, 2013 10:59:59 PM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Alon Bar-Lev" > > To: "Barak Azulay" > > Cc: "Livnat Peer" , "arch" > > Sent: Thursday, May 9, 2013 6:36:30 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > > > > > ----- Original Message ----- > > > From: "Barak Azulay" > > > To: "Livnat Peer" > > > Cc: "arch" > > > Sent: Thursday, May 9, 2013 6:12:38 PM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > After reading the thread, top posting to give my perspective of the what > > > I > > > think would be the best approach: > > > > > > Some background: > > > - The reboot at the end of host deployment came after issues in host > > > deployment that were discovered after fencing/reboot (which had nothing > > > to > > > do with the reboot reason). > > > - Host deployment takes care of of constructing the bridge (when no > > > bonding > > > exist on host). > > > > > > So Bottom line: > > > - I think that both handling the network configuration and reboot should > > > be > > > removed from host deployment (I think we all agree on that) > > > - host deployment should leave the VDSM up and running with everything > > > configured (including the SSL and SSH keys), and listening to all > > > networks. > > > - About the reboot - I'm not sure we still need reboot (this is a > > > question > > > even without this discussion) > > > - Anyway about how to reboot: > > > - I don't think it should be done through a VDSM command > > > - There is an open issue already with enabling a few stages of fencing > > > with > > > no PowerManagement module but based on SSH, > > > So a new engine internal command should be introduced like > > > (runSSHCmdOnVds ... and this command can have 2 variants ... one of > > > them > > > is reboot ... (the other is restart VDSM)) > > > - such command will not require a password as the SSH keys should > > > already > > > be in place > > > > I disagree. > > After management agent (VDSM in our case) is installed, all communications > > should be done via that agent. > > First I must say that the reboot may not be a must for host deploy, this is > a legitimate point to discuss > > However on the other hand a reboot might still be needed for various reasons, > If it was that simple we wouldn't have to: > - use Power Management cards > - build safety belts around services we depend on > > In addition the fact is that VDSM can turn not responsive due to many > unexpected reasons, and we might still need the ability to force a host to > reboot (and not through PM). > On the other hand SSH is very reliable, used anyway by engine for various > flows (deploy/upgrade/log collection), and for that reason will be > configured anyway. > > > > Using multi-protocol layers is not wise in this architecture. > > We already do, and as it looks like its here to stay - unless we remove the > host deployment part from ovirt. > > > For example, if we switch the direction of engine->vdsm communications we > > cannot use SSH. > > The current plans are to move to a bi directional communication transport on > top of AMQP/TCP, > See the work done by Saggi.M > > And even if we do - the SSH will still be required. > I think it would be mistake to require SSH or relay on communication direction (initiated by the engine for example). Also the requirement of 'root' user at host is something that should be avoided. If you do not trust your own agent, then we need to provide fail-safe mechanism for our own core functionality (such as reboot). SSH is used now only to provision host, as you wrote: 1. Host-deploy. 2. ovirt-node upgrade. Log collection is irrelevant, as it is not exactly part of the product, and can be easily replaced with any other method of log collections. Both of the above are host provisioning, which can be easily automated out-side of ovirt domain. 2. Node upgrade - for sure... it can be done manually or via pxe, we provide this within ovirt-engine just for the sake of friendly. 1. host-deploy, can be done via external provisioning framework, such as foreman. Introducing application dependency in SSH is new introduction, and should be considered carefully. Thanks! > > > > > > > > > Thoughts/Implications on Host Life cycle: > > > - The easiest approach will be to leave the host in Installing phase > > > throughout this process (all under installVdsCommand), and eventually > > > move > > > it to reboot > > > - If one skips reboot than assuming the cluster has more than one > > > network, > > > he > > > host will imminently move to none-operational ( like today) > > > - We can take a different approach and add a new status to the Host - > > > PendingNetworkConfig, which could be executed automatically in the future > > > with network-labels, ot today simply popup todo dialog in the UI. > > > > > > > > > Thanks > > > Barak Azulay > > > > > > > > > > > > ----- Original Message ----- > > > > From: "Livnat Peer" > > > > To: "Dan Kenigsberg" , "Barak Azulay" > > > > > > > > Cc: "arch" > > > > Sent: Thursday, May 9, 2013 8:42:28 AM > > > > Subject: Re: feature suggestion: initial generation of management > > > > network > > > > > > > > On 05/08/2013 04:35 PM, Dan Kenigsberg wrote: > > > > > On Tue, May 07, 2013 at 09:31:47AM -0400, Moti Asayag wrote: > > > > >> > > > > >> > > > > >> ----- Original Message ----- > > > > >>> From: "Omer Frenkel" > > > > >>> To: "Moti Asayag" > > > > >>> Cc: "arch" , "Alon Bar-Lev" > > > > >>> Sent: Tuesday, May 7, 2013 4:11:05 PM > > > > >>> Subject: Re: feature suggestion: initial generation of management > > > > >>> network > > > > >>> > > > > >>> > > > > >>> > > > > >>> ----- Original Message ----- > > > > >>>> From: "Moti Asayag" > > > > >>>> To: "arch" > > > > >>>> Cc: "Alon Bar-Lev" > > > > >>>> Sent: Tuesday, May 7, 2013 2:22:19 PM > > > > >>>> Subject: Re: feature suggestion: initial generation of management > > > > >>>> network > > > > >>>> > > > > >>>> I stumbled upon few issues with the current design while > > > > >>>> implementing > > > > >>>> it: > > > > >>>> > > > > >>>> There seems to be a requirement to reboot the host after the > > > > >>>> installation > > > > >>>> is completed in order to assure the host is recoverable. > > > > >>>> > > > > >>>> Therefore, the building blocks of the installation process of 3.3 > > > > >>>> are: > > > > >>>> 1. host deploy which installs the host expect configuring its > > > > >>>> management > > > > >>>> network. > > > > >>>> 2. SetupNetwork (and CommitNetworkChanges) - for creating the > > > > >>>> management > > > > >>>> network > > > > >>>> on the host and persisting the network configuration. > > > > >>>> 3. Reboot the host - This is a missing piece. (engine has FenceVds > > > > >>>> command, > > > > >>>> but it > > > > >>>> requires the power management to be configured prior to the > > > > >>>> installation > > > > >>>> and > > > > >>>> might > > > > >>>> be irrelevant for hosts without PM.) > > > > >>>> > > > > >>>> So, there are couple of issues here: > > > > >>>> 1. How to reboot the host? > > > > >>>> 1.1. By exposing new RebootNode verb in VDSM and invoking it from > > > > >>>> the > > > > >>>> engine > > > > >>>> 1.2. By opening ssh dialog to the host in order to execute the > > > > >>>> reboot > > > > >>>> > > > > >>> > > > > >>> why not send a reboot flag to the CommitNetworkChanges which is > > > > >>> sent > > > > >>> anyway, > > > > >>> one less call (or connection if you choose ssh) and easier to do. > > > > >>> > > > > >> > > > > >> Adding a reboot parameter to the CommitNetworkChanges > > > > >> (setSafeNetworkConfig on vdsm side) > > > > >> exceeds its logical scope which is persisting the network changes. > > > > >> > > > > >> Needless to say if such functionally will be required elsewhere, it > > > > >> couldn't be > > > > >> properly reused if implemented as part of that command. > > > > >> > > > > >> Adding Dan to comment on this as well. > > > > > > > > > > Yeah, a "reboot-after-me" flag defies my sense of cleanliness. > > > > > If reboot-after-initial-net-config is crucial, we would need to add a > > > > > special verb for that (or use the fenceNode verb if available). > > > > > > > > > > > > > +1 > > > > > > > > > > > > > > However, I am not sure that this reboot is unavoidable. > > > > > Originally the reboot had two important goal: > > > > > - make sure that the updated kernel is running > > > > > - make sure that the network, which we tweak during bootstrap, is > > > > > accessible after boot > > > > > > > > > > Nowadays, the kernels does not change THAT often, for all ovirt can > > > > > matter. running an oldish kernel is not the end of the world. > > > > > > > > > > And with Moti's feature implemented, we no longer tweak net config > > > > > blindly during boot. We use a well-define setupNetwork API, with a > > > > > well-tested rollback mechanism. > > > > > > > > > > The bottom line is, that in my opinion, reboot-after-install can be > > > > > skipped these days. > > > > > > > > > > > > > Adding Barak to the thread as I think he had some concern about > > > > removing > > > > the reboot after install. > > > > > > > > > > > > >> > > > > >>>> 2. When to perform the reboot? > > > > >>>> 2.1. After host deploy, by utilizing the host deploy to perform > > > > >>>> the > > > > >>>> reboot. > > > > >>>> It requires to configure the network by the monitor when the host > > > > >>>> is > > > > >>>> detected > > > > >>>> by the engine, > > > > >>>> detached from the installation flow. However it is a step toward > > > > >>>> the > > > > >>>> non-persistent network feature > > > > >>>> yet to be defined. > > > > >>>> 2.2. After setupNetwork is done and network was configured and > > > > >>>> persisted > > > > >>>> on > > > > >>>> the host. > > > > >>>> There is no special advantage from recoverable aspect, as > > > > >>>> setupNetwork > > > > >>>> is > > > > >>>> constantly > > > > >>>> used to persist the network configuration (by the complementary > > > > >>>> CommitNetworkChanges command). > > > > >>>> In case and network configuration fails, VDSM will revert to the > > > > >>>> last > > > > >>>> well > > > > >>>> known configuration > > > > >>>> - so connectivity with engine should be restored. Design wise, it > > > > >>>> fits > > > > >>>> to > > > > >>>> configure the management > > > > >>>> network as part of the installation sequence. > > > > >>>> If the network configuration fails in this context, the host > > > > >>>> status > > > > >>>> will > > > > >>>> be > > > > >>>> set to "InstallFailed" rather than "NonOperational", > > > > >>>> as might occur as a result of a failed setupNetwork command. > > > > >>>> > > > > >>>> > > > > >>>> Your inputs are welcome. > > > > >>>> > > > > >>>> Thanks, > > > > >>>> Moti > > > > > _______________________________________________ > > > > > Arch mailing list > > > > > Arch at ovirt.org > > > > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > > > > > > > > _______________________________________________ > > > > Arch mailing list > > > > Arch at ovirt.org > > > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > > > > > > > > > > _______________________________________________ > > > Arch mailing list > > > Arch at ovirt.org > > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > From lpeer at redhat.com Sun May 12 06:59:07 2013 From: lpeer at redhat.com (Livnat Peer) Date: Sun, 12 May 2013 09:59:07 +0300 Subject: feature suggestion: initial generation of management network In-Reply-To: <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> Message-ID: <518F3DBB.6080606@redhat.com> Thread Summary - 1. We all agree the automatic reboot after host installation is not needed anymore and can be removed. 2. There is a vast agreement that we need to add a new VDSM verb for reboot. 3. There was a suggestion to add a checkbox when adding a host to reboot the host after installation, default would be not to reboot. (leaving the option to reboot to the administrator). If there is no objection we'll go with the above. Thanks, Livnat On 05/07/2013 02:22 PM, Moti Asayag wrote: > I stumbled upon few issues with the current design while implementing it: > > There seems to be a requirement to reboot the host after the installation > is completed in order to assure the host is recoverable. > > Therefore, the building blocks of the installation process of 3.3 are: > 1. host deploy which installs the host expect configuring its management network. > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management network > on the host and persisting the network configuration. > 3. Reboot the host - This is a missing piece. (engine has FenceVds command, but it > requires the power management to be configured prior to the installation and might > be irrelevant for hosts without PM.) > > So, there are couple of issues here: > 1. How to reboot the host? > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the engine > 1.2. By opening ssh dialog to the host in order to execute the reboot > > 2. When to perform the reboot? > 2.1. After host deploy, by utilizing the host deploy to perform the reboot. > It requires to configure the network by the monitor when the host is detected by the engine, > detached from the installation flow. However it is a step toward the non-persistent network feature > yet to be defined. > 2.2. After setupNetwork is done and network was configured and persisted on the host. > There is no special advantage from recoverable aspect, as setupNetwork is constantly > used to persist the network configuration (by the complementary CommitNetworkChanges command). > In case and network configuration fails, VDSM will revert to the last well known configuration > - so connectivity with engine should be restored. Design wise, it fits to configure the management > network as part of the installation sequence. > If the network configuration fails in this context, the host status will be set to "InstallFailed" rather than "NonOperational", > as might occur as a result of a failed setupNetwork command. > > > Your inputs are welcome. > > Thanks, > Moti > ----- Original Message ----- >> From: "Dan Kenigsberg" >> To: "Simon Grinberg" , "Moti Asayag" >> Cc: "arch" >> Sent: Tuesday, January 1, 2013 2:47:57 PM >> Subject: Re: feature suggestion: initial generation of management network >> >> On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: >>> >>> >>> ----- Original Message ----- >>>> From: "Dan Kenigsberg" >>>> To: "Simon Grinberg" >>>> Cc: "arch" >>>> Sent: Thursday, December 27, 2012 2:14:06 PM >>>> Subject: Re: feature suggestion: initial generation of management network >>>> >>>> On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: >>>>> >>>>> >>>>> ----- Original Message ----- >>>>>> From: "Dan Kenigsberg" >>>>>> To: "arch" >>>>>> Sent: Tuesday, December 25, 2012 2:27:22 PM >>>>>> Subject: feature suggestion: initial generation of management >>>>>> network >>>>>> >>>>>> Current condition: >>>>>> ================== >>>>>> The management network, named ovirtmgmt, is created during host >>>>>> bootstrap. It consists of a bridge device, connected to the >>>>>> network >>>>>> device that was used to communicate with Engine (nic, bonding or >>>>>> vlan). >>>>>> It inherits its ip settings from the latter device. >>>>>> >>>>>> Why Is the Management Network Needed? >>>>>> ===================================== >>>>>> Understandably, some may ask why do we need to have a management >>>>>> network - why having a host with IPv4 configured on it is not >>>>>> enough. >>>>>> The answer is twofold: >>>>>> 1. In oVirt, a network is an abstraction of the resources >>>>>> required >>>>>> for >>>>>> connectivity of a host for a specific usage. This is true for >>>>>> the >>>>>> management network just as it is for VM network or a display >>>>>> network. >>>>>> The network entity is the key for adding/changing nics and IP >>>>>> address. >>>>>> 2. In many occasions (such as small setups) the management >>>>>> network is >>>>>> used as a VM/display network as well. >>>>>> >>>>>> Problems in current connectivity: >>>>>> ================================ >>>>>> According to alonbl of ovirt-host-deploy fame, and with no >>>>>> conflict >>>>>> to >>>>>> my own experience, creating the management network is the most >>>>>> fragile, >>>>>> error-prone step of bootstrap. >>>>> >>>>> +1, >>>>> I've raise that repeatedly in the past, bootstrap should not create >>>>> the management network but pick up the existing configuration and >>>>> let the engine override later with it's own configuration if it >>>>> differs , I'm glad that we finally get to that. >>>>> >>>>>> >>>>>> Currently it always creates a bridged network (even if the DC >>>>>> requires a >>>>>> non-bridged ovirtmgmt), it knows nothing about the defined MTU >>>>>> for >>>>>> ovirtmgmt, it uses ping to guess on top of which device to build >>>>>> (and >>>>>> thus requires Vdsm-to-Engine reverse connectivity), and is the >>>>>> sole >>>>>> remaining user of the addNetwork/vdsm-store-net-conf scripts. >>>>>> >>>>>> Suggested feature: >>>>>> ================== >>>>>> Bootstrap would avoid creating a management network. Instead, >>>>>> after >>>>>> bootstrapping a host, Engine would send a getVdsCaps probe to the >>>>>> installed host, receiving a complete picture of the network >>>>>> configuration on the host. Among this picture is the device that >>>>>> holds >>>>>> the host's management IP address. >>>>>> >>>>>> Engine would send setupNetwork command to generate ovirtmgmt with >>>>>> details devised from this picture, and according to the DC >>>>>> definition >>>>>> of >>>>>> ovirtmgmt. For example, if Vdsm reports: >>>>>> >>>>>> - vlan bond4.3000 has the host's IP, configured to use dhcp. >>>>>> - bond4 is comprises eth2 and eth3 >>>>>> - ovirtmgmt is defined as a VM network with MTU 9000 >>>>>> >>>>>> then Engine sends the likes of: >>>>>> setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, >>>>>> bonding=bond4: {eth2,eth3}, MTU=9000) >>>>> >>>>> Just one comment here, >>>>> In order to save time and confusion - if the ovirtmgmt is defined >>>>> with default values meaning the user did not bother to touch it, >>>>> let it pick up the VLAN configuration from the first host added in >>>>> the Data Center. >>>>> >>>>> Otherwise, you may override the host VLAN and loose connectivity. >>>>> >>>>> This will also solve the situation many users encounter today. >>>>> 1. The engine in on a host that actually has VLAN defined >>>>> 2. The ovirtmgmt network was not updated in the DC >>>>> 3. A host, with VLAN already defined is added - everything works >>>>> fine >>>>> 4. Any number of hosts are now added, again everything seems to >>>>> work fine. >>>>> >>>>> But, now try to use setupNetworks, and you'll find out that you >>>>> can't do much on the interface that contains the ovirtmgmt since >>>>> the definition does not match. You can't sync (Since this will >>>>> remove the VLAN and cause connectivity lose) you can't add more >>>>> networks on top since it already has non-VLAN network on top >>>>> according to the DC definition, etc. >>>>> >>>>> On the other hand you can't update the ovirtmgmt definition on the >>>>> DC since there are clusters in the DC that use the network. >>>>> >>>>> The only workaround not involving DB hack to change the VLAN on the >>>>> network is to: >>>>> 1. Create new DC >>>>> 2. Do not use the wizard that pops up to create your cluster. >>>>> 3. Modify the ovirtmgmt network to have VLANs >>>>> 4. Now create a cluster and add your hosts. >>>>> >>>>> If you insist on using the default DC and cluster then before >>>>> adding the first host, create an additional DC and move the >>>>> Default cluster over there. You may then change the network on the >>>>> Default cluster and then move the Default cluster back >>>>> >>>>> Both are ugly. And should be solved by the proposal above. >>>>> >>>>> We do something similar for the Default cluster CPU level, where we >>>>> set the intial level based on the first host added to the cluster. >>>> >>>> I'm not sure what Engine has for Default cluster CPU level. But I >>>> have >>>> reservation of the hysteresis in your proposal - after a host is >>>> added, >>>> the DC cannot forget ovirtmgmt's vlan. >>>> >>>> How about letting the admin edit ovirtmgmt's vlan in the DC level, >>>> thus >>>> rendering all hosts out-of-sync. The the admin could manually, or >>>> through a script, or in the future through a distributed operation, >>>> sync >>>> all the hosts to the definition? >>> >>> Usually if you do that you will loose connectivity to the hosts. >> >> Yes, changing the management vlan id (or ip address) is never fun, and >> requires out-of-band intervention. >> >>> I'm not insisting on the automatic adjustment of the ovirtmgmt network to >>> match the hosts' (that is just a nice touch) we can take the allow edit >>> approach. >>> >>> But allow to change VLAN on the ovirtmgmt network will indeed solve the >>> issue I'm trying to solve while creating another issue of user expecting >>> that we'll be able to re-tag the host from the engine side, which is >>> challenging to do. >>> >>> On the other hand, if we allow to change the VLAN as long as the change >>> matches the hosts' configuration, it will both solve the issue while not >>> eluding the user to think that we really can solve the chicken and egg >>> issue of re-tag the entire system. >>> >>> Now with the above ability you do get a flow to do the re-tag. >>> 1. Place all the hosts in maintenance >>> 2. Re-tag the ovirtmgmt on all the hosts >>> 3. Re-tag the hosts on which the engine on >>> 4. Activate the hosts - this should work well now since connectivity exist >>> 5. Change the tag on ovirtmgmt on the engine to match the hosts' >>> >>> Simple and clear process. >>> >>> When the workaround of creating another DC was not possible since the >>> system was already long in use and the need was re-tag of the network the >>> above is what I've recommended in the, except that steps 4-5 where done >>> as: >>> 4. Stop the engine >>> 5. Change the tag in the DB >>> 6. Start the engine >>> 7. Activate the hosts >> >> Sounds reasonable to me - but as far as I am aware this is not tightly >> related to the $Subject, which is the post-boot ovirtmgmt definition. >> >> I've added a few details to >> http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine >> and I would apreciate a review from someone with intimate Engine >> know-how. >> >> Dan. >> > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > > From bazulay at redhat.com Sun May 12 08:15:20 2013 From: bazulay at redhat.com (Barak Azulay) Date: Sun, 12 May 2013 04:15:20 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <518F3DBB.6080606@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <518F3DBB.6080606@redhat.com> Message-ID: <557908336.127528.1368346520205.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Livnat Peer" > To: "Moti Asayag" > Cc: "arch" , "Alon Bar-Lev" , "Barak Azulay" , "Simon > Grinberg" > Sent: Sunday, May 12, 2013 9:59:07 AM > Subject: Re: feature suggestion: initial generation of management network > > Thread Summary - > > 1. We all agree the automatic reboot after host installation is not > needed anymore and can be removed. > > 2. There is a vast agreement that we need to add a new VDSM verb for reboot. I disagree with the above In addition to the fact that it will not work when VDSM is not responsive (when this action will be needed the most) > > 3. There was a suggestion to add a checkbox when adding a host to reboot > the host after installation, default would be not to reboot. (leaving > the option to reboot to the administrator). > > > If there is no objection we'll go with the above. > > Thanks, Livnat > > > On 05/07/2013 02:22 PM, Moti Asayag wrote: > > I stumbled upon few issues with the current design while implementing it: > > > > There seems to be a requirement to reboot the host after the installation > > is completed in order to assure the host is recoverable. > > > > Therefore, the building blocks of the installation process of 3.3 are: > > 1. host deploy which installs the host expect configuring its management > > network. > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > > network > > on the host and persisting the network configuration. > > 3. Reboot the host - This is a missing piece. (engine has FenceVds command, > > but it > > requires the power management to be configured prior to the installation > > and might > > be irrelevant for hosts without PM.) > > > > So, there are couple of issues here: > > 1. How to reboot the host? > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > engine > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > > > 2. When to perform the reboot? > > 2.1. After host deploy, by utilizing the host deploy to perform the reboot. > > It requires to configure the network by the monitor when the host is > > detected by the engine, > > detached from the installation flow. However it is a step toward the > > non-persistent network feature > > yet to be defined. > > 2.2. After setupNetwork is done and network was configured and persisted on > > the host. > > There is no special advantage from recoverable aspect, as setupNetwork is > > constantly > > used to persist the network configuration (by the complementary > > CommitNetworkChanges command). > > In case and network configuration fails, VDSM will revert to the last well > > known configuration > > - so connectivity with engine should be restored. Design wise, it fits to > > configure the management > > network as part of the installation sequence. > > If the network configuration fails in this context, the host status will be > > set to "InstallFailed" rather than "NonOperational", > > as might occur as a result of a failed setupNetwork command. > > > > > > Your inputs are welcome. > > > > Thanks, > > Moti > > ----- Original Message ----- > >> From: "Dan Kenigsberg" > >> To: "Simon Grinberg" , "Moti Asayag" > >> > >> Cc: "arch" > >> Sent: Tuesday, January 1, 2013 2:47:57 PM > >> Subject: Re: feature suggestion: initial generation of management network > >> > >> On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > >>> > >>> > >>> ----- Original Message ----- > >>>> From: "Dan Kenigsberg" > >>>> To: "Simon Grinberg" > >>>> Cc: "arch" > >>>> Sent: Thursday, December 27, 2012 2:14:06 PM > >>>> Subject: Re: feature suggestion: initial generation of management > >>>> network > >>>> > >>>> On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > >>>>> > >>>>> > >>>>> ----- Original Message ----- > >>>>>> From: "Dan Kenigsberg" > >>>>>> To: "arch" > >>>>>> Sent: Tuesday, December 25, 2012 2:27:22 PM > >>>>>> Subject: feature suggestion: initial generation of management > >>>>>> network > >>>>>> > >>>>>> Current condition: > >>>>>> ================== > >>>>>> The management network, named ovirtmgmt, is created during host > >>>>>> bootstrap. It consists of a bridge device, connected to the > >>>>>> network > >>>>>> device that was used to communicate with Engine (nic, bonding or > >>>>>> vlan). > >>>>>> It inherits its ip settings from the latter device. > >>>>>> > >>>>>> Why Is the Management Network Needed? > >>>>>> ===================================== > >>>>>> Understandably, some may ask why do we need to have a management > >>>>>> network - why having a host with IPv4 configured on it is not > >>>>>> enough. > >>>>>> The answer is twofold: > >>>>>> 1. In oVirt, a network is an abstraction of the resources > >>>>>> required > >>>>>> for > >>>>>> connectivity of a host for a specific usage. This is true for > >>>>>> the > >>>>>> management network just as it is for VM network or a display > >>>>>> network. > >>>>>> The network entity is the key for adding/changing nics and IP > >>>>>> address. > >>>>>> 2. In many occasions (such as small setups) the management > >>>>>> network is > >>>>>> used as a VM/display network as well. > >>>>>> > >>>>>> Problems in current connectivity: > >>>>>> ================================ > >>>>>> According to alonbl of ovirt-host-deploy fame, and with no > >>>>>> conflict > >>>>>> to > >>>>>> my own experience, creating the management network is the most > >>>>>> fragile, > >>>>>> error-prone step of bootstrap. > >>>>> > >>>>> +1, > >>>>> I've raise that repeatedly in the past, bootstrap should not create > >>>>> the management network but pick up the existing configuration and > >>>>> let the engine override later with it's own configuration if it > >>>>> differs , I'm glad that we finally get to that. > >>>>> > >>>>>> > >>>>>> Currently it always creates a bridged network (even if the DC > >>>>>> requires a > >>>>>> non-bridged ovirtmgmt), it knows nothing about the defined MTU > >>>>>> for > >>>>>> ovirtmgmt, it uses ping to guess on top of which device to build > >>>>>> (and > >>>>>> thus requires Vdsm-to-Engine reverse connectivity), and is the > >>>>>> sole > >>>>>> remaining user of the addNetwork/vdsm-store-net-conf scripts. > >>>>>> > >>>>>> Suggested feature: > >>>>>> ================== > >>>>>> Bootstrap would avoid creating a management network. Instead, > >>>>>> after > >>>>>> bootstrapping a host, Engine would send a getVdsCaps probe to the > >>>>>> installed host, receiving a complete picture of the network > >>>>>> configuration on the host. Among this picture is the device that > >>>>>> holds > >>>>>> the host's management IP address. > >>>>>> > >>>>>> Engine would send setupNetwork command to generate ovirtmgmt with > >>>>>> details devised from this picture, and according to the DC > >>>>>> definition > >>>>>> of > >>>>>> ovirtmgmt. For example, if Vdsm reports: > >>>>>> > >>>>>> - vlan bond4.3000 has the host's IP, configured to use dhcp. > >>>>>> - bond4 is comprises eth2 and eth3 > >>>>>> - ovirtmgmt is defined as a VM network with MTU 9000 > >>>>>> > >>>>>> then Engine sends the likes of: > >>>>>> setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, > >>>>>> bonding=bond4: {eth2,eth3}, MTU=9000) > >>>>> > >>>>> Just one comment here, > >>>>> In order to save time and confusion - if the ovirtmgmt is defined > >>>>> with default values meaning the user did not bother to touch it, > >>>>> let it pick up the VLAN configuration from the first host added in > >>>>> the Data Center. > >>>>> > >>>>> Otherwise, you may override the host VLAN and loose connectivity. > >>>>> > >>>>> This will also solve the situation many users encounter today. > >>>>> 1. The engine in on a host that actually has VLAN defined > >>>>> 2. The ovirtmgmt network was not updated in the DC > >>>>> 3. A host, with VLAN already defined is added - everything works > >>>>> fine > >>>>> 4. Any number of hosts are now added, again everything seems to > >>>>> work fine. > >>>>> > >>>>> But, now try to use setupNetworks, and you'll find out that you > >>>>> can't do much on the interface that contains the ovirtmgmt since > >>>>> the definition does not match. You can't sync (Since this will > >>>>> remove the VLAN and cause connectivity lose) you can't add more > >>>>> networks on top since it already has non-VLAN network on top > >>>>> according to the DC definition, etc. > >>>>> > >>>>> On the other hand you can't update the ovirtmgmt definition on the > >>>>> DC since there are clusters in the DC that use the network. > >>>>> > >>>>> The only workaround not involving DB hack to change the VLAN on the > >>>>> network is to: > >>>>> 1. Create new DC > >>>>> 2. Do not use the wizard that pops up to create your cluster. > >>>>> 3. Modify the ovirtmgmt network to have VLANs > >>>>> 4. Now create a cluster and add your hosts. > >>>>> > >>>>> If you insist on using the default DC and cluster then before > >>>>> adding the first host, create an additional DC and move the > >>>>> Default cluster over there. You may then change the network on the > >>>>> Default cluster and then move the Default cluster back > >>>>> > >>>>> Both are ugly. And should be solved by the proposal above. > >>>>> > >>>>> We do something similar for the Default cluster CPU level, where we > >>>>> set the intial level based on the first host added to the cluster. > >>>> > >>>> I'm not sure what Engine has for Default cluster CPU level. But I > >>>> have > >>>> reservation of the hysteresis in your proposal - after a host is > >>>> added, > >>>> the DC cannot forget ovirtmgmt's vlan. > >>>> > >>>> How about letting the admin edit ovirtmgmt's vlan in the DC level, > >>>> thus > >>>> rendering all hosts out-of-sync. The the admin could manually, or > >>>> through a script, or in the future through a distributed operation, > >>>> sync > >>>> all the hosts to the definition? > >>> > >>> Usually if you do that you will loose connectivity to the hosts. > >> > >> Yes, changing the management vlan id (or ip address) is never fun, and > >> requires out-of-band intervention. > >> > >>> I'm not insisting on the automatic adjustment of the ovirtmgmt network to > >>> match the hosts' (that is just a nice touch) we can take the allow edit > >>> approach. > >>> > >>> But allow to change VLAN on the ovirtmgmt network will indeed solve the > >>> issue I'm trying to solve while creating another issue of user expecting > >>> that we'll be able to re-tag the host from the engine side, which is > >>> challenging to do. > >>> > >>> On the other hand, if we allow to change the VLAN as long as the change > >>> matches the hosts' configuration, it will both solve the issue while not > >>> eluding the user to think that we really can solve the chicken and egg > >>> issue of re-tag the entire system. > >>> > >>> Now with the above ability you do get a flow to do the re-tag. > >>> 1. Place all the hosts in maintenance > >>> 2. Re-tag the ovirtmgmt on all the hosts > >>> 3. Re-tag the hosts on which the engine on > >>> 4. Activate the hosts - this should work well now since connectivity > >>> exist > >>> 5. Change the tag on ovirtmgmt on the engine to match the hosts' > >>> > >>> Simple and clear process. > >>> > >>> When the workaround of creating another DC was not possible since the > >>> system was already long in use and the need was re-tag of the network the > >>> above is what I've recommended in the, except that steps 4-5 where done > >>> as: > >>> 4. Stop the engine > >>> 5. Change the tag in the DB > >>> 6. Start the engine > >>> 7. Activate the hosts > >> > >> Sounds reasonable to me - but as far as I am aware this is not tightly > >> related to the $Subject, which is the post-boot ovirtmgmt definition. > >> > >> I've added a few details to > >> http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > >> and I would apreciate a review from someone with intimate Engine > >> know-how. > >> > >> Dan. > >> > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > From alonbl at redhat.com Sun May 12 08:25:45 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Sun, 12 May 2013 04:25:45 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <557908336.127528.1368346520205.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <518F3DBB.6080606@redhat.com> <557908336.127528.1368346520205.JavaMail.root@redhat.com> Message-ID: <1435740579.270682.1368347145591.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Barak Azulay" > To: "Livnat Peer" > Cc: "Alon Bar-Lev" , "arch" , "Simon Grinberg" > Sent: Sunday, May 12, 2013 11:15:20 AM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Livnat Peer" > > To: "Moti Asayag" > > Cc: "arch" , "Alon Bar-Lev" , "Barak > > Azulay" , "Simon > > Grinberg" > > Sent: Sunday, May 12, 2013 9:59:07 AM > > Subject: Re: feature suggestion: initial generation of management network > > > > Thread Summary - > > > > 1. We all agree the automatic reboot after host installation is not > > needed anymore and can be removed. > > > > 2. There is a vast agreement that we need to add a new VDSM verb for > > reboot. > > I disagree with the above > > In addition to the fact that it will not work when VDSM is not responsive > (when this action will be needed the most) If vdsm is unresponsive because of a fault in vdsm we can add a fail safe mechanism for critical commands within vdsm. And we can always fallback to the standard fencing in such cases. Can you please describe the scenario of which host-deploy succeeds and vdsm is unresponsive? Current sequence: 1. host-deploy + reboot - all via single ssh session. New sequence: 1. host-deploy - via ssh. 2. network setup - via vdsm. 3. optional reboot - via vdsm. In the new sequence, vdsm must be responsive to accomplish (2), and if (2) succeeds vdsm, again, must be responsive. Thanks! > > > > > > 3. There was a suggestion to add a checkbox when adding a host to reboot > > the host after installation, default would be not to reboot. (leaving > > the option to reboot to the administrator). > > > > > > If there is no objection we'll go with the above. > > > > Thanks, Livnat > > > > > > On 05/07/2013 02:22 PM, Moti Asayag wrote: > > > I stumbled upon few issues with the current design while implementing it: > > > > > > There seems to be a requirement to reboot the host after the installation > > > is completed in order to assure the host is recoverable. > > > > > > Therefore, the building blocks of the installation process of 3.3 are: > > > 1. host deploy which installs the host expect configuring its management > > > network. > > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the management > > > network > > > on the host and persisting the network configuration. > > > 3. Reboot the host - This is a missing piece. (engine has FenceVds > > > command, > > > but it > > > requires the power management to be configured prior to the installation > > > and might > > > be irrelevant for hosts without PM.) > > > > > > So, there are couple of issues here: > > > 1. How to reboot the host? > > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > > engine > > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > > > > > 2. When to perform the reboot? > > > 2.1. After host deploy, by utilizing the host deploy to perform the > > > reboot. > > > It requires to configure the network by the monitor when the host is > > > detected by the engine, > > > detached from the installation flow. However it is a step toward the > > > non-persistent network feature > > > yet to be defined. > > > 2.2. After setupNetwork is done and network was configured and persisted > > > on > > > the host. > > > There is no special advantage from recoverable aspect, as setupNetwork is > > > constantly > > > used to persist the network configuration (by the complementary > > > CommitNetworkChanges command). > > > In case and network configuration fails, VDSM will revert to the last > > > well > > > known configuration > > > - so connectivity with engine should be restored. Design wise, it fits to > > > configure the management > > > network as part of the installation sequence. > > > If the network configuration fails in this context, the host status will > > > be > > > set to "InstallFailed" rather than "NonOperational", > > > as might occur as a result of a failed setupNetwork command. > > > > > > > > > Your inputs are welcome. > > > > > > Thanks, > > > Moti > > > ----- Original Message ----- > > >> From: "Dan Kenigsberg" > > >> To: "Simon Grinberg" , "Moti Asayag" > > >> > > >> Cc: "arch" > > >> Sent: Tuesday, January 1, 2013 2:47:57 PM > > >> Subject: Re: feature suggestion: initial generation of management > > >> network > > >> > > >> On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > > >>> > > >>> > > >>> ----- Original Message ----- > > >>>> From: "Dan Kenigsberg" > > >>>> To: "Simon Grinberg" > > >>>> Cc: "arch" > > >>>> Sent: Thursday, December 27, 2012 2:14:06 PM > > >>>> Subject: Re: feature suggestion: initial generation of management > > >>>> network > > >>>> > > >>>> On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > > >>>>> > > >>>>> > > >>>>> ----- Original Message ----- > > >>>>>> From: "Dan Kenigsberg" > > >>>>>> To: "arch" > > >>>>>> Sent: Tuesday, December 25, 2012 2:27:22 PM > > >>>>>> Subject: feature suggestion: initial generation of management > > >>>>>> network > > >>>>>> > > >>>>>> Current condition: > > >>>>>> ================== > > >>>>>> The management network, named ovirtmgmt, is created during host > > >>>>>> bootstrap. It consists of a bridge device, connected to the > > >>>>>> network > > >>>>>> device that was used to communicate with Engine (nic, bonding or > > >>>>>> vlan). > > >>>>>> It inherits its ip settings from the latter device. > > >>>>>> > > >>>>>> Why Is the Management Network Needed? > > >>>>>> ===================================== > > >>>>>> Understandably, some may ask why do we need to have a management > > >>>>>> network - why having a host with IPv4 configured on it is not > > >>>>>> enough. > > >>>>>> The answer is twofold: > > >>>>>> 1. In oVirt, a network is an abstraction of the resources > > >>>>>> required > > >>>>>> for > > >>>>>> connectivity of a host for a specific usage. This is true for > > >>>>>> the > > >>>>>> management network just as it is for VM network or a display > > >>>>>> network. > > >>>>>> The network entity is the key for adding/changing nics and IP > > >>>>>> address. > > >>>>>> 2. In many occasions (such as small setups) the management > > >>>>>> network is > > >>>>>> used as a VM/display network as well. > > >>>>>> > > >>>>>> Problems in current connectivity: > > >>>>>> ================================ > > >>>>>> According to alonbl of ovirt-host-deploy fame, and with no > > >>>>>> conflict > > >>>>>> to > > >>>>>> my own experience, creating the management network is the most > > >>>>>> fragile, > > >>>>>> error-prone step of bootstrap. > > >>>>> > > >>>>> +1, > > >>>>> I've raise that repeatedly in the past, bootstrap should not create > > >>>>> the management network but pick up the existing configuration and > > >>>>> let the engine override later with it's own configuration if it > > >>>>> differs , I'm glad that we finally get to that. > > >>>>> > > >>>>>> > > >>>>>> Currently it always creates a bridged network (even if the DC > > >>>>>> requires a > > >>>>>> non-bridged ovirtmgmt), it knows nothing about the defined MTU > > >>>>>> for > > >>>>>> ovirtmgmt, it uses ping to guess on top of which device to build > > >>>>>> (and > > >>>>>> thus requires Vdsm-to-Engine reverse connectivity), and is the > > >>>>>> sole > > >>>>>> remaining user of the addNetwork/vdsm-store-net-conf scripts. > > >>>>>> > > >>>>>> Suggested feature: > > >>>>>> ================== > > >>>>>> Bootstrap would avoid creating a management network. Instead, > > >>>>>> after > > >>>>>> bootstrapping a host, Engine would send a getVdsCaps probe to the > > >>>>>> installed host, receiving a complete picture of the network > > >>>>>> configuration on the host. Among this picture is the device that > > >>>>>> holds > > >>>>>> the host's management IP address. > > >>>>>> > > >>>>>> Engine would send setupNetwork command to generate ovirtmgmt with > > >>>>>> details devised from this picture, and according to the DC > > >>>>>> definition > > >>>>>> of > > >>>>>> ovirtmgmt. For example, if Vdsm reports: > > >>>>>> > > >>>>>> - vlan bond4.3000 has the host's IP, configured to use dhcp. > > >>>>>> - bond4 is comprises eth2 and eth3 > > >>>>>> - ovirtmgmt is defined as a VM network with MTU 9000 > > >>>>>> > > >>>>>> then Engine sends the likes of: > > >>>>>> setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, > > >>>>>> bonding=bond4: {eth2,eth3}, MTU=9000) > > >>>>> > > >>>>> Just one comment here, > > >>>>> In order to save time and confusion - if the ovirtmgmt is defined > > >>>>> with default values meaning the user did not bother to touch it, > > >>>>> let it pick up the VLAN configuration from the first host added in > > >>>>> the Data Center. > > >>>>> > > >>>>> Otherwise, you may override the host VLAN and loose connectivity. > > >>>>> > > >>>>> This will also solve the situation many users encounter today. > > >>>>> 1. The engine in on a host that actually has VLAN defined > > >>>>> 2. The ovirtmgmt network was not updated in the DC > > >>>>> 3. A host, with VLAN already defined is added - everything works > > >>>>> fine > > >>>>> 4. Any number of hosts are now added, again everything seems to > > >>>>> work fine. > > >>>>> > > >>>>> But, now try to use setupNetworks, and you'll find out that you > > >>>>> can't do much on the interface that contains the ovirtmgmt since > > >>>>> the definition does not match. You can't sync (Since this will > > >>>>> remove the VLAN and cause connectivity lose) you can't add more > > >>>>> networks on top since it already has non-VLAN network on top > > >>>>> according to the DC definition, etc. > > >>>>> > > >>>>> On the other hand you can't update the ovirtmgmt definition on the > > >>>>> DC since there are clusters in the DC that use the network. > > >>>>> > > >>>>> The only workaround not involving DB hack to change the VLAN on the > > >>>>> network is to: > > >>>>> 1. Create new DC > > >>>>> 2. Do not use the wizard that pops up to create your cluster. > > >>>>> 3. Modify the ovirtmgmt network to have VLANs > > >>>>> 4. Now create a cluster and add your hosts. > > >>>>> > > >>>>> If you insist on using the default DC and cluster then before > > >>>>> adding the first host, create an additional DC and move the > > >>>>> Default cluster over there. You may then change the network on the > > >>>>> Default cluster and then move the Default cluster back > > >>>>> > > >>>>> Both are ugly. And should be solved by the proposal above. > > >>>>> > > >>>>> We do something similar for the Default cluster CPU level, where we > > >>>>> set the intial level based on the first host added to the cluster. > > >>>> > > >>>> I'm not sure what Engine has for Default cluster CPU level. But I > > >>>> have > > >>>> reservation of the hysteresis in your proposal - after a host is > > >>>> added, > > >>>> the DC cannot forget ovirtmgmt's vlan. > > >>>> > > >>>> How about letting the admin edit ovirtmgmt's vlan in the DC level, > > >>>> thus > > >>>> rendering all hosts out-of-sync. The the admin could manually, or > > >>>> through a script, or in the future through a distributed operation, > > >>>> sync > > >>>> all the hosts to the definition? > > >>> > > >>> Usually if you do that you will loose connectivity to the hosts. > > >> > > >> Yes, changing the management vlan id (or ip address) is never fun, and > > >> requires out-of-band intervention. > > >> > > >>> I'm not insisting on the automatic adjustment of the ovirtmgmt network > > >>> to > > >>> match the hosts' (that is just a nice touch) we can take the allow edit > > >>> approach. > > >>> > > >>> But allow to change VLAN on the ovirtmgmt network will indeed solve the > > >>> issue I'm trying to solve while creating another issue of user > > >>> expecting > > >>> that we'll be able to re-tag the host from the engine side, which is > > >>> challenging to do. > > >>> > > >>> On the other hand, if we allow to change the VLAN as long as the change > > >>> matches the hosts' configuration, it will both solve the issue while > > >>> not > > >>> eluding the user to think that we really can solve the chicken and egg > > >>> issue of re-tag the entire system. > > >>> > > >>> Now with the above ability you do get a flow to do the re-tag. > > >>> 1. Place all the hosts in maintenance > > >>> 2. Re-tag the ovirtmgmt on all the hosts > > >>> 3. Re-tag the hosts on which the engine on > > >>> 4. Activate the hosts - this should work well now since connectivity > > >>> exist > > >>> 5. Change the tag on ovirtmgmt on the engine to match the hosts' > > >>> > > >>> Simple and clear process. > > >>> > > >>> When the workaround of creating another DC was not possible since the > > >>> system was already long in use and the need was re-tag of the network > > >>> the > > >>> above is what I've recommended in the, except that steps 4-5 where done > > >>> as: > > >>> 4. Stop the engine > > >>> 5. Change the tag in the DB > > >>> 6. Start the engine > > >>> 7. Activate the hosts > > >> > > >> Sounds reasonable to me - but as far as I am aware this is not tightly > > >> related to the $Subject, which is the post-boot ovirtmgmt definition. > > >> > > >> I've added a few details to > > >> http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > > >> and I would apreciate a review from someone with intimate Engine > > >> know-how. > > >> > > >> Dan. > > >> > > > _______________________________________________ > > > Arch mailing list > > > Arch at ovirt.org > > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > > > > > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From masayag at redhat.com Sun May 12 08:37:01 2013 From: masayag at redhat.com (Moti Asayag) Date: Sun, 12 May 2013 04:37:01 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <1435740579.270682.1368347145591.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <518F3DBB.6080606@redhat.com> <557908336.127528.1368346520205.JavaMail.root@redhat.com> <1435740579.270682.1368347145591.JavaMail.root@redhat.com> Message-ID: <35164694.135768.1368347821291.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Alon Bar-Lev" > To: "Barak Azulay" > Cc: "arch" , "Simon Grinberg" > Sent: Sunday, May 12, 2013 11:25:45 AM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Barak Azulay" > > To: "Livnat Peer" > > Cc: "Alon Bar-Lev" , "arch" , "Simon > > Grinberg" > > Sent: Sunday, May 12, 2013 11:15:20 AM > > Subject: Re: feature suggestion: initial generation of management network > > > > > > > > ----- Original Message ----- > > > From: "Livnat Peer" > > > To: "Moti Asayag" > > > Cc: "arch" , "Alon Bar-Lev" , "Barak > > > Azulay" , "Simon > > > Grinberg" > > > Sent: Sunday, May 12, 2013 9:59:07 AM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > Thread Summary - > > > > > > 1. We all agree the automatic reboot after host installation is not > > > needed anymore and can be removed. > > > > > > 2. There is a vast agreement that we need to add a new VDSM verb for > > > reboot. > > > > I disagree with the above > > > > In addition to the fact that it will not work when VDSM is not responsive > > (when this action will be needed the most) > > If vdsm is unresponsive because of a fault in vdsm we can add a fail safe > mechanism for critical commands within vdsm. > And we can always fallback to the standard fencing in such cases. > > Can you please describe the scenario of which host-deploy succeeds and vdsm > is unresponsive? > > Current sequence: > 1. host-deploy + reboot - all via single ssh session. > > New sequence: > 1. host-deploy - via ssh. > 2. network setup - via vdsm. I'd like to add that if step 2 fails, VDSM should rollback to the last known network configuration, therefore it shouldn't remain non-responsive in case the setup network command caused a communication lose. > 3. optional reboot - via vdsm. > > In the new sequence, vdsm must be responsive to accomplish (2), and if (2) > succeeds vdsm, again, must be responsive. > > Thanks! > > > > > > > > > > > 3. There was a suggestion to add a checkbox when adding a host to reboot > > > the host after installation, default would be not to reboot. (leaving > > > the option to reboot to the administrator). > > > > > > > > > If there is no objection we'll go with the above. > > > > > > Thanks, Livnat > > > > > > > > > On 05/07/2013 02:22 PM, Moti Asayag wrote: > > > > I stumbled upon few issues with the current design while implementing > > > > it: > > > > > > > > There seems to be a requirement to reboot the host after the > > > > installation > > > > is completed in order to assure the host is recoverable. > > > > > > > > Therefore, the building blocks of the installation process of 3.3 are: > > > > 1. host deploy which installs the host expect configuring its > > > > management > > > > network. > > > > 2. SetupNetwork (and CommitNetworkChanges) - for creating the > > > > management > > > > network > > > > on the host and persisting the network configuration. > > > > 3. Reboot the host - This is a missing piece. (engine has FenceVds > > > > command, > > > > but it > > > > requires the power management to be configured prior to the > > > > installation > > > > and might > > > > be irrelevant for hosts without PM.) > > > > > > > > So, there are couple of issues here: > > > > 1. How to reboot the host? > > > > 1.1. By exposing new RebootNode verb in VDSM and invoking it from the > > > > engine > > > > 1.2. By opening ssh dialog to the host in order to execute the reboot > > > > > > > > 2. When to perform the reboot? > > > > 2.1. After host deploy, by utilizing the host deploy to perform the > > > > reboot. > > > > It requires to configure the network by the monitor when the host is > > > > detected by the engine, > > > > detached from the installation flow. However it is a step toward the > > > > non-persistent network feature > > > > yet to be defined. > > > > 2.2. After setupNetwork is done and network was configured and > > > > persisted > > > > on > > > > the host. > > > > There is no special advantage from recoverable aspect, as setupNetwork > > > > is > > > > constantly > > > > used to persist the network configuration (by the complementary > > > > CommitNetworkChanges command). > > > > In case and network configuration fails, VDSM will revert to the last > > > > well > > > > known configuration > > > > - so connectivity with engine should be restored. Design wise, it fits > > > > to > > > > configure the management > > > > network as part of the installation sequence. > > > > If the network configuration fails in this context, the host status > > > > will > > > > be > > > > set to "InstallFailed" rather than "NonOperational", > > > > as might occur as a result of a failed setupNetwork command. > > > > > > > > > > > > Your inputs are welcome. > > > > > > > > Thanks, > > > > Moti > > > > ----- Original Message ----- > > > >> From: "Dan Kenigsberg" > > > >> To: "Simon Grinberg" , "Moti Asayag" > > > >> > > > >> Cc: "arch" > > > >> Sent: Tuesday, January 1, 2013 2:47:57 PM > > > >> Subject: Re: feature suggestion: initial generation of management > > > >> network > > > >> > > > >> On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: > > > >>> > > > >>> > > > >>> ----- Original Message ----- > > > >>>> From: "Dan Kenigsberg" > > > >>>> To: "Simon Grinberg" > > > >>>> Cc: "arch" > > > >>>> Sent: Thursday, December 27, 2012 2:14:06 PM > > > >>>> Subject: Re: feature suggestion: initial generation of management > > > >>>> network > > > >>>> > > > >>>> On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: > > > >>>>> > > > >>>>> > > > >>>>> ----- Original Message ----- > > > >>>>>> From: "Dan Kenigsberg" > > > >>>>>> To: "arch" > > > >>>>>> Sent: Tuesday, December 25, 2012 2:27:22 PM > > > >>>>>> Subject: feature suggestion: initial generation of management > > > >>>>>> network > > > >>>>>> > > > >>>>>> Current condition: > > > >>>>>> ================== > > > >>>>>> The management network, named ovirtmgmt, is created during host > > > >>>>>> bootstrap. It consists of a bridge device, connected to the > > > >>>>>> network > > > >>>>>> device that was used to communicate with Engine (nic, bonding or > > > >>>>>> vlan). > > > >>>>>> It inherits its ip settings from the latter device. > > > >>>>>> > > > >>>>>> Why Is the Management Network Needed? > > > >>>>>> ===================================== > > > >>>>>> Understandably, some may ask why do we need to have a management > > > >>>>>> network - why having a host with IPv4 configured on it is not > > > >>>>>> enough. > > > >>>>>> The answer is twofold: > > > >>>>>> 1. In oVirt, a network is an abstraction of the resources > > > >>>>>> required > > > >>>>>> for > > > >>>>>> connectivity of a host for a specific usage. This is true for > > > >>>>>> the > > > >>>>>> management network just as it is for VM network or a display > > > >>>>>> network. > > > >>>>>> The network entity is the key for adding/changing nics and IP > > > >>>>>> address. > > > >>>>>> 2. In many occasions (such as small setups) the management > > > >>>>>> network is > > > >>>>>> used as a VM/display network as well. > > > >>>>>> > > > >>>>>> Problems in current connectivity: > > > >>>>>> ================================ > > > >>>>>> According to alonbl of ovirt-host-deploy fame, and with no > > > >>>>>> conflict > > > >>>>>> to > > > >>>>>> my own experience, creating the management network is the most > > > >>>>>> fragile, > > > >>>>>> error-prone step of bootstrap. > > > >>>>> > > > >>>>> +1, > > > >>>>> I've raise that repeatedly in the past, bootstrap should not create > > > >>>>> the management network but pick up the existing configuration and > > > >>>>> let the engine override later with it's own configuration if it > > > >>>>> differs , I'm glad that we finally get to that. > > > >>>>> > > > >>>>>> > > > >>>>>> Currently it always creates a bridged network (even if the DC > > > >>>>>> requires a > > > >>>>>> non-bridged ovirtmgmt), it knows nothing about the defined MTU > > > >>>>>> for > > > >>>>>> ovirtmgmt, it uses ping to guess on top of which device to build > > > >>>>>> (and > > > >>>>>> thus requires Vdsm-to-Engine reverse connectivity), and is the > > > >>>>>> sole > > > >>>>>> remaining user of the addNetwork/vdsm-store-net-conf scripts. > > > >>>>>> > > > >>>>>> Suggested feature: > > > >>>>>> ================== > > > >>>>>> Bootstrap would avoid creating a management network. Instead, > > > >>>>>> after > > > >>>>>> bootstrapping a host, Engine would send a getVdsCaps probe to the > > > >>>>>> installed host, receiving a complete picture of the network > > > >>>>>> configuration on the host. Among this picture is the device that > > > >>>>>> holds > > > >>>>>> the host's management IP address. > > > >>>>>> > > > >>>>>> Engine would send setupNetwork command to generate ovirtmgmt with > > > >>>>>> details devised from this picture, and according to the DC > > > >>>>>> definition > > > >>>>>> of > > > >>>>>> ovirtmgmt. For example, if Vdsm reports: > > > >>>>>> > > > >>>>>> - vlan bond4.3000 has the host's IP, configured to use dhcp. > > > >>>>>> - bond4 is comprises eth2 and eth3 > > > >>>>>> - ovirtmgmt is defined as a VM network with MTU 9000 > > > >>>>>> > > > >>>>>> then Engine sends the likes of: > > > >>>>>> setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, > > > >>>>>> bonding=bond4: {eth2,eth3}, MTU=9000) > > > >>>>> > > > >>>>> Just one comment here, > > > >>>>> In order to save time and confusion - if the ovirtmgmt is defined > > > >>>>> with default values meaning the user did not bother to touch it, > > > >>>>> let it pick up the VLAN configuration from the first host added in > > > >>>>> the Data Center. > > > >>>>> > > > >>>>> Otherwise, you may override the host VLAN and loose connectivity. > > > >>>>> > > > >>>>> This will also solve the situation many users encounter today. > > > >>>>> 1. The engine in on a host that actually has VLAN defined > > > >>>>> 2. The ovirtmgmt network was not updated in the DC > > > >>>>> 3. A host, with VLAN already defined is added - everything works > > > >>>>> fine > > > >>>>> 4. Any number of hosts are now added, again everything seems to > > > >>>>> work fine. > > > >>>>> > > > >>>>> But, now try to use setupNetworks, and you'll find out that you > > > >>>>> can't do much on the interface that contains the ovirtmgmt since > > > >>>>> the definition does not match. You can't sync (Since this will > > > >>>>> remove the VLAN and cause connectivity lose) you can't add more > > > >>>>> networks on top since it already has non-VLAN network on top > > > >>>>> according to the DC definition, etc. > > > >>>>> > > > >>>>> On the other hand you can't update the ovirtmgmt definition on the > > > >>>>> DC since there are clusters in the DC that use the network. > > > >>>>> > > > >>>>> The only workaround not involving DB hack to change the VLAN on the > > > >>>>> network is to: > > > >>>>> 1. Create new DC > > > >>>>> 2. Do not use the wizard that pops up to create your cluster. > > > >>>>> 3. Modify the ovirtmgmt network to have VLANs > > > >>>>> 4. Now create a cluster and add your hosts. > > > >>>>> > > > >>>>> If you insist on using the default DC and cluster then before > > > >>>>> adding the first host, create an additional DC and move the > > > >>>>> Default cluster over there. You may then change the network on the > > > >>>>> Default cluster and then move the Default cluster back > > > >>>>> > > > >>>>> Both are ugly. And should be solved by the proposal above. > > > >>>>> > > > >>>>> We do something similar for the Default cluster CPU level, where we > > > >>>>> set the intial level based on the first host added to the cluster. > > > >>>> > > > >>>> I'm not sure what Engine has for Default cluster CPU level. But I > > > >>>> have > > > >>>> reservation of the hysteresis in your proposal - after a host is > > > >>>> added, > > > >>>> the DC cannot forget ovirtmgmt's vlan. > > > >>>> > > > >>>> How about letting the admin edit ovirtmgmt's vlan in the DC level, > > > >>>> thus > > > >>>> rendering all hosts out-of-sync. The the admin could manually, or > > > >>>> through a script, or in the future through a distributed operation, > > > >>>> sync > > > >>>> all the hosts to the definition? > > > >>> > > > >>> Usually if you do that you will loose connectivity to the hosts. > > > >> > > > >> Yes, changing the management vlan id (or ip address) is never fun, and > > > >> requires out-of-band intervention. > > > >> > > > >>> I'm not insisting on the automatic adjustment of the ovirtmgmt > > > >>> network > > > >>> to > > > >>> match the hosts' (that is just a nice touch) we can take the allow > > > >>> edit > > > >>> approach. > > > >>> > > > >>> But allow to change VLAN on the ovirtmgmt network will indeed solve > > > >>> the > > > >>> issue I'm trying to solve while creating another issue of user > > > >>> expecting > > > >>> that we'll be able to re-tag the host from the engine side, which is > > > >>> challenging to do. > > > >>> > > > >>> On the other hand, if we allow to change the VLAN as long as the > > > >>> change > > > >>> matches the hosts' configuration, it will both solve the issue while > > > >>> not > > > >>> eluding the user to think that we really can solve the chicken and > > > >>> egg > > > >>> issue of re-tag the entire system. > > > >>> > > > >>> Now with the above ability you do get a flow to do the re-tag. > > > >>> 1. Place all the hosts in maintenance > > > >>> 2. Re-tag the ovirtmgmt on all the hosts > > > >>> 3. Re-tag the hosts on which the engine on > > > >>> 4. Activate the hosts - this should work well now since connectivity > > > >>> exist > > > >>> 5. Change the tag on ovirtmgmt on the engine to match the hosts' > > > >>> > > > >>> Simple and clear process. > > > >>> > > > >>> When the workaround of creating another DC was not possible since the > > > >>> system was already long in use and the need was re-tag of the network > > > >>> the > > > >>> above is what I've recommended in the, except that steps 4-5 where > > > >>> done > > > >>> as: > > > >>> 4. Stop the engine > > > >>> 5. Change the tag in the DB > > > >>> 6. Start the engine > > > >>> 7. Activate the hosts > > > >> > > > >> Sounds reasonable to me - but as far as I am aware this is not tightly > > > >> related to the $Subject, which is the post-boot ovirtmgmt definition. > > > >> > > > >> I've added a few details to > > > >> http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine > > > >> and I would apreciate a review from someone with intimate Engine > > > >> know-how. > > > >> > > > >> Dan. > > > >> > > > > _______________________________________________ > > > > Arch mailing list > > > > Arch at ovirt.org > > > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > > > > > > > > > > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > From lpeer at redhat.com Sun May 12 08:46:06 2013 From: lpeer at redhat.com (Livnat Peer) Date: Sun, 12 May 2013 11:46:06 +0300 Subject: feature suggestion: initial generation of management network In-Reply-To: <557908336.127528.1368346520205.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <518F3DBB.6080606@redhat.com> <557908336.127528.1368346520205.JavaMail.root@redhat.com> Message-ID: <518F56CE.3010802@redhat.com> On 05/12/2013 11:15 AM, Barak Azulay wrote: > > > ----- Original Message ----- >> From: "Livnat Peer" >> To: "Moti Asayag" >> Cc: "arch" , "Alon Bar-Lev" , "Barak Azulay" , "Simon >> Grinberg" >> Sent: Sunday, May 12, 2013 9:59:07 AM >> Subject: Re: feature suggestion: initial generation of management network >> >> Thread Summary - >> >> 1. We all agree the automatic reboot after host installation is not >> needed anymore and can be removed. >> >> 2. There is a vast agreement that we need to add a new VDSM verb for reboot. > > I disagree with the above > > In addition to the fact that it will not work when VDSM is not responsive (when this action will be needed the most) > you can fence the node if VDSM is non responsive, that's the mechanism we use today to deal with such cases. > >> >> 3. There was a suggestion to add a checkbox when adding a host to reboot >> the host after installation, default would be not to reboot. (leaving >> the option to reboot to the administrator). >> >> >> If there is no objection we'll go with the above. >> >> Thanks, Livnat >> >> >> On 05/07/2013 02:22 PM, Moti Asayag wrote: >>> I stumbled upon few issues with the current design while implementing it: >>> >>> There seems to be a requirement to reboot the host after the installation >>> is completed in order to assure the host is recoverable. >>> >>> Therefore, the building blocks of the installation process of 3.3 are: >>> 1. host deploy which installs the host expect configuring its management >>> network. >>> 2. SetupNetwork (and CommitNetworkChanges) - for creating the management >>> network >>> on the host and persisting the network configuration. >>> 3. Reboot the host - This is a missing piece. (engine has FenceVds command, >>> but it >>> requires the power management to be configured prior to the installation >>> and might >>> be irrelevant for hosts without PM.) >>> >>> So, there are couple of issues here: >>> 1. How to reboot the host? >>> 1.1. By exposing new RebootNode verb in VDSM and invoking it from the >>> engine >>> 1.2. By opening ssh dialog to the host in order to execute the reboot >>> >>> 2. When to perform the reboot? >>> 2.1. After host deploy, by utilizing the host deploy to perform the reboot. >>> It requires to configure the network by the monitor when the host is >>> detected by the engine, >>> detached from the installation flow. However it is a step toward the >>> non-persistent network feature >>> yet to be defined. >>> 2.2. After setupNetwork is done and network was configured and persisted on >>> the host. >>> There is no special advantage from recoverable aspect, as setupNetwork is >>> constantly >>> used to persist the network configuration (by the complementary >>> CommitNetworkChanges command). >>> In case and network configuration fails, VDSM will revert to the last well >>> known configuration >>> - so connectivity with engine should be restored. Design wise, it fits to >>> configure the management >>> network as part of the installation sequence. >>> If the network configuration fails in this context, the host status will be >>> set to "InstallFailed" rather than "NonOperational", >>> as might occur as a result of a failed setupNetwork command. >>> >>> >>> Your inputs are welcome. >>> >>> Thanks, >>> Moti >>> ----- Original Message ----- >>>> From: "Dan Kenigsberg" >>>> To: "Simon Grinberg" , "Moti Asayag" >>>> >>>> Cc: "arch" >>>> Sent: Tuesday, January 1, 2013 2:47:57 PM >>>> Subject: Re: feature suggestion: initial generation of management network >>>> >>>> On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: >>>>> >>>>> >>>>> ----- Original Message ----- >>>>>> From: "Dan Kenigsberg" >>>>>> To: "Simon Grinberg" >>>>>> Cc: "arch" >>>>>> Sent: Thursday, December 27, 2012 2:14:06 PM >>>>>> Subject: Re: feature suggestion: initial generation of management >>>>>> network >>>>>> >>>>>> On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: >>>>>>> >>>>>>> >>>>>>> ----- Original Message ----- >>>>>>>> From: "Dan Kenigsberg" >>>>>>>> To: "arch" >>>>>>>> Sent: Tuesday, December 25, 2012 2:27:22 PM >>>>>>>> Subject: feature suggestion: initial generation of management >>>>>>>> network >>>>>>>> >>>>>>>> Current condition: >>>>>>>> ================== >>>>>>>> The management network, named ovirtmgmt, is created during host >>>>>>>> bootstrap. It consists of a bridge device, connected to the >>>>>>>> network >>>>>>>> device that was used to communicate with Engine (nic, bonding or >>>>>>>> vlan). >>>>>>>> It inherits its ip settings from the latter device. >>>>>>>> >>>>>>>> Why Is the Management Network Needed? >>>>>>>> ===================================== >>>>>>>> Understandably, some may ask why do we need to have a management >>>>>>>> network - why having a host with IPv4 configured on it is not >>>>>>>> enough. >>>>>>>> The answer is twofold: >>>>>>>> 1. In oVirt, a network is an abstraction of the resources >>>>>>>> required >>>>>>>> for >>>>>>>> connectivity of a host for a specific usage. This is true for >>>>>>>> the >>>>>>>> management network just as it is for VM network or a display >>>>>>>> network. >>>>>>>> The network entity is the key for adding/changing nics and IP >>>>>>>> address. >>>>>>>> 2. In many occasions (such as small setups) the management >>>>>>>> network is >>>>>>>> used as a VM/display network as well. >>>>>>>> >>>>>>>> Problems in current connectivity: >>>>>>>> ================================ >>>>>>>> According to alonbl of ovirt-host-deploy fame, and with no >>>>>>>> conflict >>>>>>>> to >>>>>>>> my own experience, creating the management network is the most >>>>>>>> fragile, >>>>>>>> error-prone step of bootstrap. >>>>>>> >>>>>>> +1, >>>>>>> I've raise that repeatedly in the past, bootstrap should not create >>>>>>> the management network but pick up the existing configuration and >>>>>>> let the engine override later with it's own configuration if it >>>>>>> differs , I'm glad that we finally get to that. >>>>>>> >>>>>>>> >>>>>>>> Currently it always creates a bridged network (even if the DC >>>>>>>> requires a >>>>>>>> non-bridged ovirtmgmt), it knows nothing about the defined MTU >>>>>>>> for >>>>>>>> ovirtmgmt, it uses ping to guess on top of which device to build >>>>>>>> (and >>>>>>>> thus requires Vdsm-to-Engine reverse connectivity), and is the >>>>>>>> sole >>>>>>>> remaining user of the addNetwork/vdsm-store-net-conf scripts. >>>>>>>> >>>>>>>> Suggested feature: >>>>>>>> ================== >>>>>>>> Bootstrap would avoid creating a management network. Instead, >>>>>>>> after >>>>>>>> bootstrapping a host, Engine would send a getVdsCaps probe to the >>>>>>>> installed host, receiving a complete picture of the network >>>>>>>> configuration on the host. Among this picture is the device that >>>>>>>> holds >>>>>>>> the host's management IP address. >>>>>>>> >>>>>>>> Engine would send setupNetwork command to generate ovirtmgmt with >>>>>>>> details devised from this picture, and according to the DC >>>>>>>> definition >>>>>>>> of >>>>>>>> ovirtmgmt. For example, if Vdsm reports: >>>>>>>> >>>>>>>> - vlan bond4.3000 has the host's IP, configured to use dhcp. >>>>>>>> - bond4 is comprises eth2 and eth3 >>>>>>>> - ovirtmgmt is defined as a VM network with MTU 9000 >>>>>>>> >>>>>>>> then Engine sends the likes of: >>>>>>>> setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, >>>>>>>> bonding=bond4: {eth2,eth3}, MTU=9000) >>>>>>> >>>>>>> Just one comment here, >>>>>>> In order to save time and confusion - if the ovirtmgmt is defined >>>>>>> with default values meaning the user did not bother to touch it, >>>>>>> let it pick up the VLAN configuration from the first host added in >>>>>>> the Data Center. >>>>>>> >>>>>>> Otherwise, you may override the host VLAN and loose connectivity. >>>>>>> >>>>>>> This will also solve the situation many users encounter today. >>>>>>> 1. The engine in on a host that actually has VLAN defined >>>>>>> 2. The ovirtmgmt network was not updated in the DC >>>>>>> 3. A host, with VLAN already defined is added - everything works >>>>>>> fine >>>>>>> 4. Any number of hosts are now added, again everything seems to >>>>>>> work fine. >>>>>>> >>>>>>> But, now try to use setupNetworks, and you'll find out that you >>>>>>> can't do much on the interface that contains the ovirtmgmt since >>>>>>> the definition does not match. You can't sync (Since this will >>>>>>> remove the VLAN and cause connectivity lose) you can't add more >>>>>>> networks on top since it already has non-VLAN network on top >>>>>>> according to the DC definition, etc. >>>>>>> >>>>>>> On the other hand you can't update the ovirtmgmt definition on the >>>>>>> DC since there are clusters in the DC that use the network. >>>>>>> >>>>>>> The only workaround not involving DB hack to change the VLAN on the >>>>>>> network is to: >>>>>>> 1. Create new DC >>>>>>> 2. Do not use the wizard that pops up to create your cluster. >>>>>>> 3. Modify the ovirtmgmt network to have VLANs >>>>>>> 4. Now create a cluster and add your hosts. >>>>>>> >>>>>>> If you insist on using the default DC and cluster then before >>>>>>> adding the first host, create an additional DC and move the >>>>>>> Default cluster over there. You may then change the network on the >>>>>>> Default cluster and then move the Default cluster back >>>>>>> >>>>>>> Both are ugly. And should be solved by the proposal above. >>>>>>> >>>>>>> We do something similar for the Default cluster CPU level, where we >>>>>>> set the intial level based on the first host added to the cluster. >>>>>> >>>>>> I'm not sure what Engine has for Default cluster CPU level. But I >>>>>> have >>>>>> reservation of the hysteresis in your proposal - after a host is >>>>>> added, >>>>>> the DC cannot forget ovirtmgmt's vlan. >>>>>> >>>>>> How about letting the admin edit ovirtmgmt's vlan in the DC level, >>>>>> thus >>>>>> rendering all hosts out-of-sync. The the admin could manually, or >>>>>> through a script, or in the future through a distributed operation, >>>>>> sync >>>>>> all the hosts to the definition? >>>>> >>>>> Usually if you do that you will loose connectivity to the hosts. >>>> >>>> Yes, changing the management vlan id (or ip address) is never fun, and >>>> requires out-of-band intervention. >>>> >>>>> I'm not insisting on the automatic adjustment of the ovirtmgmt network to >>>>> match the hosts' (that is just a nice touch) we can take the allow edit >>>>> approach. >>>>> >>>>> But allow to change VLAN on the ovirtmgmt network will indeed solve the >>>>> issue I'm trying to solve while creating another issue of user expecting >>>>> that we'll be able to re-tag the host from the engine side, which is >>>>> challenging to do. >>>>> >>>>> On the other hand, if we allow to change the VLAN as long as the change >>>>> matches the hosts' configuration, it will both solve the issue while not >>>>> eluding the user to think that we really can solve the chicken and egg >>>>> issue of re-tag the entire system. >>>>> >>>>> Now with the above ability you do get a flow to do the re-tag. >>>>> 1. Place all the hosts in maintenance >>>>> 2. Re-tag the ovirtmgmt on all the hosts >>>>> 3. Re-tag the hosts on which the engine on >>>>> 4. Activate the hosts - this should work well now since connectivity >>>>> exist >>>>> 5. Change the tag on ovirtmgmt on the engine to match the hosts' >>>>> >>>>> Simple and clear process. >>>>> >>>>> When the workaround of creating another DC was not possible since the >>>>> system was already long in use and the need was re-tag of the network the >>>>> above is what I've recommended in the, except that steps 4-5 where done >>>>> as: >>>>> 4. Stop the engine >>>>> 5. Change the tag in the DB >>>>> 6. Start the engine >>>>> 7. Activate the hosts >>>> >>>> Sounds reasonable to me - but as far as I am aware this is not tightly >>>> related to the $Subject, which is the post-boot ovirtmgmt definition. >>>> >>>> I've added a few details to >>>> http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine >>>> and I would apreciate a review from someone with intimate Engine >>>> know-how. >>>> >>>> Dan. >>>> >>> _______________________________________________ >>> Arch mailing list >>> Arch at ovirt.org >>> http://lists.ovirt.org/mailman/listinfo/arch >>> >>> >> >> > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > > From lpeer at redhat.com Sun May 12 08:46:46 2013 From: lpeer at redhat.com (Livnat Peer) Date: Sun, 12 May 2013 11:46:46 +0300 Subject: feature suggestion: initial generation of management network In-Reply-To: <1435740579.270682.1368347145591.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <518F3DBB.6080606@redhat.com> <557908336.127528.1368346520205.JavaMail.root@redhat.com> <1435740579.270682.1368347145591.JavaMail.root@redhat.com> Message-ID: <518F56F6.4080405@redhat.com> On 05/12/2013 11:25 AM, Alon Bar-Lev wrote: > > > ----- Original Message ----- >> From: "Barak Azulay" >> To: "Livnat Peer" >> Cc: "Alon Bar-Lev" , "arch" , "Simon Grinberg" >> Sent: Sunday, May 12, 2013 11:15:20 AM >> Subject: Re: feature suggestion: initial generation of management network >> >> >> >> ----- Original Message ----- >>> From: "Livnat Peer" >>> To: "Moti Asayag" >>> Cc: "arch" , "Alon Bar-Lev" , "Barak >>> Azulay" , "Simon >>> Grinberg" >>> Sent: Sunday, May 12, 2013 9:59:07 AM >>> Subject: Re: feature suggestion: initial generation of management network >>> >>> Thread Summary - >>> >>> 1. We all agree the automatic reboot after host installation is not >>> needed anymore and can be removed. >>> >>> 2. There is a vast agreement that we need to add a new VDSM verb for >>> reboot. >> >> I disagree with the above >> >> In addition to the fact that it will not work when VDSM is not responsive >> (when this action will be needed the most) > > If vdsm is unresponsive because of a fault in vdsm we can add a fail safe mechanism for critical commands within vdsm. > And we can always fallback to the standard fencing in such cases. > > Can you please describe the scenario of which host-deploy succeeds and vdsm is unresponsive? > > Current sequence: > 1. host-deploy + reboot - all via single ssh session. > > New sequence: > 1. host-deploy - via ssh. > 2. network setup - via vdsm. > 3. optional reboot - via vdsm. > > In the new sequence, vdsm must be responsive to accomplish (2), and if (2) succeeds vdsm, again, must be responsive. > +1, fully agree with the above. > Thanks! > >> >> >>> >>> 3. There was a suggestion to add a checkbox when adding a host to reboot >>> the host after installation, default would be not to reboot. (leaving >>> the option to reboot to the administrator). >>> >>> >>> If there is no objection we'll go with the above. >>> >>> Thanks, Livnat >>> >>> >>> On 05/07/2013 02:22 PM, Moti Asayag wrote: >>>> I stumbled upon few issues with the current design while implementing it: >>>> >>>> There seems to be a requirement to reboot the host after the installation >>>> is completed in order to assure the host is recoverable. >>>> >>>> Therefore, the building blocks of the installation process of 3.3 are: >>>> 1. host deploy which installs the host expect configuring its management >>>> network. >>>> 2. SetupNetwork (and CommitNetworkChanges) - for creating the management >>>> network >>>> on the host and persisting the network configuration. >>>> 3. Reboot the host - This is a missing piece. (engine has FenceVds >>>> command, >>>> but it >>>> requires the power management to be configured prior to the installation >>>> and might >>>> be irrelevant for hosts without PM.) >>>> >>>> So, there are couple of issues here: >>>> 1. How to reboot the host? >>>> 1.1. By exposing new RebootNode verb in VDSM and invoking it from the >>>> engine >>>> 1.2. By opening ssh dialog to the host in order to execute the reboot >>>> >>>> 2. When to perform the reboot? >>>> 2.1. After host deploy, by utilizing the host deploy to perform the >>>> reboot. >>>> It requires to configure the network by the monitor when the host is >>>> detected by the engine, >>>> detached from the installation flow. However it is a step toward the >>>> non-persistent network feature >>>> yet to be defined. >>>> 2.2. After setupNetwork is done and network was configured and persisted >>>> on >>>> the host. >>>> There is no special advantage from recoverable aspect, as setupNetwork is >>>> constantly >>>> used to persist the network configuration (by the complementary >>>> CommitNetworkChanges command). >>>> In case and network configuration fails, VDSM will revert to the last >>>> well >>>> known configuration >>>> - so connectivity with engine should be restored. Design wise, it fits to >>>> configure the management >>>> network as part of the installation sequence. >>>> If the network configuration fails in this context, the host status will >>>> be >>>> set to "InstallFailed" rather than "NonOperational", >>>> as might occur as a result of a failed setupNetwork command. >>>> >>>> >>>> Your inputs are welcome. >>>> >>>> Thanks, >>>> Moti >>>> ----- Original Message ----- >>>>> From: "Dan Kenigsberg" >>>>> To: "Simon Grinberg" , "Moti Asayag" >>>>> >>>>> Cc: "arch" >>>>> Sent: Tuesday, January 1, 2013 2:47:57 PM >>>>> Subject: Re: feature suggestion: initial generation of management >>>>> network >>>>> >>>>> On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: >>>>>> >>>>>> >>>>>> ----- Original Message ----- >>>>>>> From: "Dan Kenigsberg" >>>>>>> To: "Simon Grinberg" >>>>>>> Cc: "arch" >>>>>>> Sent: Thursday, December 27, 2012 2:14:06 PM >>>>>>> Subject: Re: feature suggestion: initial generation of management >>>>>>> network >>>>>>> >>>>>>> On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: >>>>>>>> >>>>>>>> >>>>>>>> ----- Original Message ----- >>>>>>>>> From: "Dan Kenigsberg" >>>>>>>>> To: "arch" >>>>>>>>> Sent: Tuesday, December 25, 2012 2:27:22 PM >>>>>>>>> Subject: feature suggestion: initial generation of management >>>>>>>>> network >>>>>>>>> >>>>>>>>> Current condition: >>>>>>>>> ================== >>>>>>>>> The management network, named ovirtmgmt, is created during host >>>>>>>>> bootstrap. It consists of a bridge device, connected to the >>>>>>>>> network >>>>>>>>> device that was used to communicate with Engine (nic, bonding or >>>>>>>>> vlan). >>>>>>>>> It inherits its ip settings from the latter device. >>>>>>>>> >>>>>>>>> Why Is the Management Network Needed? >>>>>>>>> ===================================== >>>>>>>>> Understandably, some may ask why do we need to have a management >>>>>>>>> network - why having a host with IPv4 configured on it is not >>>>>>>>> enough. >>>>>>>>> The answer is twofold: >>>>>>>>> 1. In oVirt, a network is an abstraction of the resources >>>>>>>>> required >>>>>>>>> for >>>>>>>>> connectivity of a host for a specific usage. This is true for >>>>>>>>> the >>>>>>>>> management network just as it is for VM network or a display >>>>>>>>> network. >>>>>>>>> The network entity is the key for adding/changing nics and IP >>>>>>>>> address. >>>>>>>>> 2. In many occasions (such as small setups) the management >>>>>>>>> network is >>>>>>>>> used as a VM/display network as well. >>>>>>>>> >>>>>>>>> Problems in current connectivity: >>>>>>>>> ================================ >>>>>>>>> According to alonbl of ovirt-host-deploy fame, and with no >>>>>>>>> conflict >>>>>>>>> to >>>>>>>>> my own experience, creating the management network is the most >>>>>>>>> fragile, >>>>>>>>> error-prone step of bootstrap. >>>>>>>> >>>>>>>> +1, >>>>>>>> I've raise that repeatedly in the past, bootstrap should not create >>>>>>>> the management network but pick up the existing configuration and >>>>>>>> let the engine override later with it's own configuration if it >>>>>>>> differs , I'm glad that we finally get to that. >>>>>>>> >>>>>>>>> >>>>>>>>> Currently it always creates a bridged network (even if the DC >>>>>>>>> requires a >>>>>>>>> non-bridged ovirtmgmt), it knows nothing about the defined MTU >>>>>>>>> for >>>>>>>>> ovirtmgmt, it uses ping to guess on top of which device to build >>>>>>>>> (and >>>>>>>>> thus requires Vdsm-to-Engine reverse connectivity), and is the >>>>>>>>> sole >>>>>>>>> remaining user of the addNetwork/vdsm-store-net-conf scripts. >>>>>>>>> >>>>>>>>> Suggested feature: >>>>>>>>> ================== >>>>>>>>> Bootstrap would avoid creating a management network. Instead, >>>>>>>>> after >>>>>>>>> bootstrapping a host, Engine would send a getVdsCaps probe to the >>>>>>>>> installed host, receiving a complete picture of the network >>>>>>>>> configuration on the host. Among this picture is the device that >>>>>>>>> holds >>>>>>>>> the host's management IP address. >>>>>>>>> >>>>>>>>> Engine would send setupNetwork command to generate ovirtmgmt with >>>>>>>>> details devised from this picture, and according to the DC >>>>>>>>> definition >>>>>>>>> of >>>>>>>>> ovirtmgmt. For example, if Vdsm reports: >>>>>>>>> >>>>>>>>> - vlan bond4.3000 has the host's IP, configured to use dhcp. >>>>>>>>> - bond4 is comprises eth2 and eth3 >>>>>>>>> - ovirtmgmt is defined as a VM network with MTU 9000 >>>>>>>>> >>>>>>>>> then Engine sends the likes of: >>>>>>>>> setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, >>>>>>>>> bonding=bond4: {eth2,eth3}, MTU=9000) >>>>>>>> >>>>>>>> Just one comment here, >>>>>>>> In order to save time and confusion - if the ovirtmgmt is defined >>>>>>>> with default values meaning the user did not bother to touch it, >>>>>>>> let it pick up the VLAN configuration from the first host added in >>>>>>>> the Data Center. >>>>>>>> >>>>>>>> Otherwise, you may override the host VLAN and loose connectivity. >>>>>>>> >>>>>>>> This will also solve the situation many users encounter today. >>>>>>>> 1. The engine in on a host that actually has VLAN defined >>>>>>>> 2. The ovirtmgmt network was not updated in the DC >>>>>>>> 3. A host, with VLAN already defined is added - everything works >>>>>>>> fine >>>>>>>> 4. Any number of hosts are now added, again everything seems to >>>>>>>> work fine. >>>>>>>> >>>>>>>> But, now try to use setupNetworks, and you'll find out that you >>>>>>>> can't do much on the interface that contains the ovirtmgmt since >>>>>>>> the definition does not match. You can't sync (Since this will >>>>>>>> remove the VLAN and cause connectivity lose) you can't add more >>>>>>>> networks on top since it already has non-VLAN network on top >>>>>>>> according to the DC definition, etc. >>>>>>>> >>>>>>>> On the other hand you can't update the ovirtmgmt definition on the >>>>>>>> DC since there are clusters in the DC that use the network. >>>>>>>> >>>>>>>> The only workaround not involving DB hack to change the VLAN on the >>>>>>>> network is to: >>>>>>>> 1. Create new DC >>>>>>>> 2. Do not use the wizard that pops up to create your cluster. >>>>>>>> 3. Modify the ovirtmgmt network to have VLANs >>>>>>>> 4. Now create a cluster and add your hosts. >>>>>>>> >>>>>>>> If you insist on using the default DC and cluster then before >>>>>>>> adding the first host, create an additional DC and move the >>>>>>>> Default cluster over there. You may then change the network on the >>>>>>>> Default cluster and then move the Default cluster back >>>>>>>> >>>>>>>> Both are ugly. And should be solved by the proposal above. >>>>>>>> >>>>>>>> We do something similar for the Default cluster CPU level, where we >>>>>>>> set the intial level based on the first host added to the cluster. >>>>>>> >>>>>>> I'm not sure what Engine has for Default cluster CPU level. But I >>>>>>> have >>>>>>> reservation of the hysteresis in your proposal - after a host is >>>>>>> added, >>>>>>> the DC cannot forget ovirtmgmt's vlan. >>>>>>> >>>>>>> How about letting the admin edit ovirtmgmt's vlan in the DC level, >>>>>>> thus >>>>>>> rendering all hosts out-of-sync. The the admin could manually, or >>>>>>> through a script, or in the future through a distributed operation, >>>>>>> sync >>>>>>> all the hosts to the definition? >>>>>> >>>>>> Usually if you do that you will loose connectivity to the hosts. >>>>> >>>>> Yes, changing the management vlan id (or ip address) is never fun, and >>>>> requires out-of-band intervention. >>>>> >>>>>> I'm not insisting on the automatic adjustment of the ovirtmgmt network >>>>>> to >>>>>> match the hosts' (that is just a nice touch) we can take the allow edit >>>>>> approach. >>>>>> >>>>>> But allow to change VLAN on the ovirtmgmt network will indeed solve the >>>>>> issue I'm trying to solve while creating another issue of user >>>>>> expecting >>>>>> that we'll be able to re-tag the host from the engine side, which is >>>>>> challenging to do. >>>>>> >>>>>> On the other hand, if we allow to change the VLAN as long as the change >>>>>> matches the hosts' configuration, it will both solve the issue while >>>>>> not >>>>>> eluding the user to think that we really can solve the chicken and egg >>>>>> issue of re-tag the entire system. >>>>>> >>>>>> Now with the above ability you do get a flow to do the re-tag. >>>>>> 1. Place all the hosts in maintenance >>>>>> 2. Re-tag the ovirtmgmt on all the hosts >>>>>> 3. Re-tag the hosts on which the engine on >>>>>> 4. Activate the hosts - this should work well now since connectivity >>>>>> exist >>>>>> 5. Change the tag on ovirtmgmt on the engine to match the hosts' >>>>>> >>>>>> Simple and clear process. >>>>>> >>>>>> When the workaround of creating another DC was not possible since the >>>>>> system was already long in use and the need was re-tag of the network >>>>>> the >>>>>> above is what I've recommended in the, except that steps 4-5 where done >>>>>> as: >>>>>> 4. Stop the engine >>>>>> 5. Change the tag in the DB >>>>>> 6. Start the engine >>>>>> 7. Activate the hosts >>>>> >>>>> Sounds reasonable to me - but as far as I am aware this is not tightly >>>>> related to the $Subject, which is the post-boot ovirtmgmt definition. >>>>> >>>>> I've added a few details to >>>>> http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine >>>>> and I would apreciate a review from someone with intimate Engine >>>>> know-how. >>>>> >>>>> Dan. >>>>> >>>> _______________________________________________ >>>> Arch mailing list >>>> Arch at ovirt.org >>>> http://lists.ovirt.org/mailman/listinfo/arch >>>> >>>> >>> >>> >> _______________________________________________ >> Arch mailing list >> Arch at ovirt.org >> http://lists.ovirt.org/mailman/listinfo/arch >> From alonbl at redhat.com Sun May 12 11:52:51 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Sun, 12 May 2013 07:52:51 -0400 (EDT) Subject: [ANN] New development environment for ovirt-engine In-Reply-To: <1534961780.272884.1368354892411.JavaMail.root@redhat.com> Message-ID: <9423451.273926.1368359571790.JavaMail.root@redhat.com> Hello all ovirt-engine developers, When I first joined the ovirt project, it took me about two weeks to setup a development environment, I needed to work on a bug related to host-deploy so I needed an environment that could use the ssh, PKI, vdsm-bootstrap and communicate with vdsm using SSL, this was virtually impossible to do so without tweaking the product in a way that it is so different from production use, that I cannot guarantee that whatever tested in development will actually work in production. I peeked at the installation script in a hope that I can create partial environment similar to production, but I found that the packaging implementation makes to much assumption and is very difficult to adopt. The fact that I do not use fedora/rhel for my development made it even worse. I had no other option than to create rpms after each of my changes and test each in real production like setup. It was obvious to me that the manual customization of developers to achieve working product will eventually break as product grow and move away from being developer friendly to production friendly. For example, product defaults cannot be these which serve developers, but these which serve production the best, or having a valid PKI setup cannot be optional any more as components do need to use it. Same for location of files and configuration, for example, if we write a pluggable infrastructure for branding, we cannot damage the interface just because developers runs the product in their own manual customization. I took the opportunity handed to me to port the ovirt-engine to other distributions in order to provide a development environment that is similar to production setup. Together with Sandro Bonazzola and Alex Lourie we re-wrote the whole installation of the product which can also be used to setup the desired development environment. Within this environment the product is set up using the same tools and configuration as in production, while the process does not require special privileges nor changes the state of the developer machine. A complete documentation is available[1], I preferred to use README within the source tree as wiki tend to quickly become obsolete, while documentation within source tree can be modified by the commit that introduces a change. I will redirect to this file from the current wiki once the site will be up. In a nut shell, after installing prerequisites, build and install the product using: $ make clean install-dev PREFIX=$HOME/ovirt-engine This will run maven and create product installation at $HOME/ovirt-engine Next, a setup phase is required just like in production, to initialize configuration and database: $ $HOME/ovirt-engine/bin/engine-setup-2 You have now fully functional product, including PKI, SSL, host-deploy, tools. No manual database updates are required, no lose of functionality. All that is left is to start the engine service: $ $HOME/ovirt-engine/share/ovirt-engine/services/ovirt-engine.py start Access to application: http://localhost:8080 https://localhost:8443 Debugging port is opened at port 8787. Farther information exists in the documentation[1]. There are several inherit benefits of the new environment, the major one is the ability to manage several environments in parallel on the same host. For example, if we develop two separate features on two branches we can install the product into $HOME/ovirt-engine-feature1 and $HOME/ovirt-engine-feature-2 and have a separate database for each, if we modify the ports jboss is listening to we can run two instances of engine at the same time! We will be happy to work with all developers to assist in porting into the new development environment, the simplest is to create a new database for this effort. Moti has a sequence of converting the existing database owned by postgres to be owned by the engine, Moti, can you please share that? We are sure there are missing bits, we will be happy to know these so we can improve. I am aware that developers (especially java) are conservative, but I ask you to give us a chance, so that we make it easy for developers to join the project, and to allow us to drop the parallel effort of packaging to production and fixing the broken development environment. A special thanks to developers who took the time to test and provide feedback before the merged: - Yaniv Bronheim - Moti Asayag - Limor Gavish - Sharad Mishra - Ofer Schreiber We are hoping that after migration you will be find this environment useful and friendly, Sandro Bonazzola, Alex Lourie, Alon Bar-Lev. [1] http://gerrit.ovirt.org/gitweb?p=ovirt-engine.git;a=blob;f=README.developer;hb=HEAD From danken at redhat.com Sun May 12 12:39:54 2013 From: danken at redhat.com (Dan Kenigsberg) Date: Sun, 12 May 2013 15:39:54 +0300 Subject: feature suggestion: initial generation of management network In-Reply-To: <941280855.5773599.1368111436878.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> <941280855.5773599.1368111436878.JavaMail.root@redhat.com> Message-ID: <20130512123954.GF26216@redhat.com> On Thu, May 09, 2013 at 10:57:16AM -0400, Antoni Segura Puimedon wrote: > > > > The risk of a deployed system (without bridge) to be unresponsive after > > reboot is minimum. > > > > 1. iptables rules are already active. > > 2. udev rules are active. > > 3. vdsm is up. > > > > The major risk is basically if some dependency package was UPDATED during the > > installation of vdsm, while its service/consumer is already running, then we > > are running a host with the old software and there is a chance that after > > reboot with the new software the host will fail. > > > > I think that the decision to reboot host should be delegated to > > administrator, adding vdsm verb to reboot is usable. This way admin will be > > able to take a host to maintenance mode and reboot, and we can add checkbox > > to the add host dialog '[x] reboot', rebooting the host at the end of the > > sequence. I think the default should be off. > > I'm also in agreement with the addition of a reboot verb. It could be a nice > addition regardless of this specific use case. A "reboot" verb is nice, but I am not yet sure that it is actually needed. Above, Alon give one argument for it - to make sure that vdsm (and its dependencies, and other updated packages) works smoothely after boot. That's a good argument - but it may be acheived by post-deploy boot as done today - without an additional frighteningly-named verb. Note that vdsm, or any other package, may be upgraded by yum asynchronous to Engine's operation, so we may face a surprise cannot-start-after-boot later in the host life cycle. Not only post-install. As I said in my first comment to this thread - I do not think that reboot-after-install is desperately needed, and find that it does not deserve the Engine-side complexity of calling a new verb. Dan. From alonbl at redhat.com Sun May 12 12:58:53 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Sun, 12 May 2013 08:58:53 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <20130512123954.GF26216@redhat.com> References: <20121227121406.GD8915@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> <941280855.5773599.1368111436878.JavaMail.root@redhat.com> <20130512123954.GF26216@redhat.com> Message-ID: <2047117021.311691.1368363533872.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Dan Kenigsberg" > To: "Antoni Segura Puimedon" > Cc: "Alon Bar-Lev" , "arch" > Sent: Sunday, May 12, 2013 3:39:54 PM > Subject: Re: feature suggestion: initial generation of management network > > On Thu, May 09, 2013 at 10:57:16AM -0400, Antoni Segura Puimedon wrote: > > > > > > The risk of a deployed system (without bridge) to be unresponsive after > > > reboot is minimum. > > > > > > 1. iptables rules are already active. > > > 2. udev rules are active. > > > 3. vdsm is up. > > > > > > The major risk is basically if some dependency package was UPDATED during > > > the > > > installation of vdsm, while its service/consumer is already running, then > > > we > > > are running a host with the old software and there is a chance that after > > > reboot with the new software the host will fail. > > > > > > I think that the decision to reboot host should be delegated to > > > administrator, adding vdsm verb to reboot is usable. This way admin will > > > be > > > able to take a host to maintenance mode and reboot, and we can add > > > checkbox > > > to the add host dialog '[x] reboot', rebooting the host at the end of the > > > sequence. I think the default should be off. > > > > I'm also in agreement with the addition of a reboot verb. It could be a > > nice > > addition regardless of this specific use case. > > A "reboot" verb is nice, but I am not yet sure that it is actually > needed. Above, Alon give one argument for it - to make sure that vdsm > (and its dependencies, and other updated packages) works smoothely after > boot. That's a good argument - but it may be acheived by post-deploy > boot as done today - without an additional frighteningly-named verb. > > Note that vdsm, or any other package, may be upgraded by yum > asynchronous to Engine's operation, so we may face a surprise > cannot-start-after-boot later in the host life cycle. Not only > post-install. > > As I said in my first comment to this thread - I do not think that > reboot-after-install is desperately needed, and find that it does not > deserve the Engine-side complexity of calling a new verb. > > Dan. What we are trying to say, product wise, is that the requirement to remotely reboot a host (cooperate reboot) may be available regardless of the host-deploy sequence. Administrator may decide to reboot a host right after host-deploy or once a week. Adding the ability to perform reboot is different independent discussion. The only reason we discuss it here is because we currently force reboot after host-deploy (although in the API it is optional). Having the bridge created by the engine is, in my opinion, far more important than keeping the reboot feature. We can discuss if remote reboot feature should and to which version, regardless. Regards, Alon From bazulay at redhat.com Sun May 12 18:27:01 2013 From: bazulay at redhat.com (Barak Azulay) Date: Sun, 12 May 2013 14:27:01 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <2047117021.311691.1368363533872.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> <941280855.5773599.1368111436878.JavaMail.root@redhat.com> <20130512123954.GF26216@redhat.com> <2047117021.311691.1368363533872.JavaMail.root@redhat.com> Message-ID: <1837929436.302264.1368383221970.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Alon Bar-Lev" > To: "Dan Kenigsberg" > Cc: "arch" > Sent: Sunday, May 12, 2013 3:58:53 PM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Dan Kenigsberg" > > To: "Antoni Segura Puimedon" > > Cc: "Alon Bar-Lev" , "arch" > > Sent: Sunday, May 12, 2013 3:39:54 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > On Thu, May 09, 2013 at 10:57:16AM -0400, Antoni Segura Puimedon wrote: > > > > > > > > The risk of a deployed system (without bridge) to be unresponsive after > > > > reboot is minimum. > > > > > > > > 1. iptables rules are already active. > > > > 2. udev rules are active. > > > > 3. vdsm is up. > > > > > > > > The major risk is basically if some dependency package was UPDATED > > > > during > > > > the > > > > installation of vdsm, while its service/consumer is already running, > > > > then > > > > we > > > > are running a host with the old software and there is a chance that > > > > after > > > > reboot with the new software the host will fail. > > > > > > > > I think that the decision to reboot host should be delegated to > > > > administrator, adding vdsm verb to reboot is usable. This way admin > > > > will > > > > be > > > > able to take a host to maintenance mode and reboot, and we can add > > > > checkbox > > > > to the add host dialog '[x] reboot', rebooting the host at the end of > > > > the > > > > sequence. I think the default should be off. > > > > > > I'm also in agreement with the addition of a reboot verb. It could be a > > > nice > > > addition regardless of this specific use case. > > > > A "reboot" verb is nice, but I am not yet sure that it is actually > > needed. Above, Alon give one argument for it - to make sure that vdsm > > (and its dependencies, and other updated packages) works smoothely after > > boot. That's a good argument - but it may be acheived by post-deploy > > boot as done today - without an additional frighteningly-named verb. > > > > Note that vdsm, or any other package, may be upgraded by yum > > asynchronous to Engine's operation, so we may face a surprise > > cannot-start-after-boot later in the host life cycle. Not only > > post-install. > > > > As I said in my first comment to this thread - I do not think that > > reboot-after-install is desperately needed, and find that it does not > > deserve the Engine-side complexity of calling a new verb. > > > > Dan. > > What we are trying to say, product wise, is that the requirement to remotely > reboot a host (cooperate reboot) may be available regardless of the > host-deploy sequence. Administrator may decide to reboot a host right after > host-deploy or once a week. > > Adding the ability to perform reboot is different independent discussion. > > The only reason we discuss it here is because we currently force reboot after > host-deploy (although in the API it is optional). > > Having the bridge created by the engine is, in my opinion, far more important > than keeping the reboot feature. We can discuss if remote reboot feature > should and to which version, regardless. > > Regards, > Alon > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > > > From bazulay at redhat.com Sun May 12 18:40:41 2013 From: bazulay at redhat.com (Barak Azulay) Date: Sun, 12 May 2013 14:40:41 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <2047117021.311691.1368363533872.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <497587874.11990711.1367932265918.JavaMail.root@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> <941280855.5773599.1368111436878.JavaMail.root@redhat.com> <20130512123954.GF26216@redhat.com> <2047117021.311691.1368363533872.JavaMail.root@redhat.com> Message-ID: <239113646.302830.1368384041645.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Alon Bar-Lev" > To: "Dan Kenigsberg" > Cc: "arch" > Sent: Sunday, May 12, 2013 3:58:53 PM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Dan Kenigsberg" > > To: "Antoni Segura Puimedon" > > Cc: "Alon Bar-Lev" , "arch" > > Sent: Sunday, May 12, 2013 3:39:54 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > On Thu, May 09, 2013 at 10:57:16AM -0400, Antoni Segura Puimedon wrote: > > > > > > > > The risk of a deployed system (without bridge) to be unresponsive after > > > > reboot is minimum. > > > > > > > > 1. iptables rules are already active. > > > > 2. udev rules are active. > > > > 3. vdsm is up. > > > > > > > > The major risk is basically if some dependency package was UPDATED > > > > during > > > > the > > > > installation of vdsm, while its service/consumer is already running, > > > > then > > > > we > > > > are running a host with the old software and there is a chance that > > > > after > > > > reboot with the new software the host will fail. > > > > > > > > I think that the decision to reboot host should be delegated to > > > > administrator, adding vdsm verb to reboot is usable. This way admin > > > > will > > > > be > > > > able to take a host to maintenance mode and reboot, and we can add > > > > checkbox > > > > to the add host dialog '[x] reboot', rebooting the host at the end of > > > > the > > > > sequence. I think the default should be off. > > > > > > I'm also in agreement with the addition of a reboot verb. It could be a > > > nice > > > addition regardless of this specific use case. > > > > A "reboot" verb is nice, but I am not yet sure that it is actually > > needed. Above, Alon give one argument for it - to make sure that vdsm > > (and its dependencies, and other updated packages) works smoothely after > > boot. That's a good argument - but it may be acheived by post-deploy > > boot as done today - without an additional frighteningly-named verb. > > > > Note that vdsm, or any other package, may be upgraded by yum > > asynchronous to Engine's operation, so we may face a surprise > > cannot-start-after-boot later in the host life cycle. Not only > > post-install. > > > > As I said in my first comment to this thread - I do not think that > > reboot-after-install is desperately needed, and find that it does not > > deserve the Engine-side complexity of calling a new verb. > > > > Dan. > > What we are trying to say, product wise, is that the requirement to remotely > reboot a host No it is not a requirement - it is here because in the past it proved to be a necessity (on the early days), i don't think it's a must today. > (cooperate reboot) may be available regardless of the > host-deploy sequence. Administrator may decide to reboot a host right after > host-deploy or once a week. Correct, and the right way to do it is first move to maintenance mode. > > Adding the ability to perform reboot is different independent discussion. > > The only reason we discuss it here is because we currently force reboot after > host-deploy (although in the API it is optional). > > Having the bridge created by the engine is, in my opinion, far more important > than keeping the reboot feature. We can discuss if remote reboot feature > should and to which version, regardless. I agree with you that the creation of the bridge engine is more important from the reboot, However when discussing new reboot API for VDSM, I prefer to not do reboot at all on host-deploy, and do the bridge config by engine. The reboot API suggested is a general purpose API which in this discussion is focused around a specific use case (host-deploy), If we had a way to enforce call for the reboot API only in the deployment scenario, I would have been o.k. with it, But the weird thing about APIs is that people end up using them ... and not always as we intended, and we might end up finding ourselves in tough situations due to a stray reboot call from X ??? This is the reason I have suggested reboot over SSH which is different. And I would argue that host deploy is here to stay hence the dependency in SSH is here to stay. Thanks Barak Azulay > > Regards, > Alon > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > > > From alonbl at redhat.com Sun May 12 18:59:22 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Sun, 12 May 2013 14:59:22 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <239113646.302830.1368384041645.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> <941280855.5773599.1368111436878.JavaMail.root@redhat.com> <20130512123954.GF26216@redhat.com> <2047117021.311691.1368363533872.JavaMail.root@redhat.com> <239113646.302830.1368384041645.JavaMail.root@redhat.com> Message-ID: <1212694039.320712.1368385162654.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Barak Azulay" > To: "Alon Bar-Lev" > Cc: "Dan Kenigsberg" , "arch" > Sent: Sunday, May 12, 2013 9:40:41 PM > Subject: Re: feature suggestion: initial generation of management network > > > > ----- Original Message ----- > > From: "Alon Bar-Lev" > > To: "Dan Kenigsberg" > > Cc: "arch" > > Sent: Sunday, May 12, 2013 3:58:53 PM > > Subject: Re: feature suggestion: initial generation of management network > > > > > > > > ----- Original Message ----- > > > From: "Dan Kenigsberg" > > > To: "Antoni Segura Puimedon" > > > Cc: "Alon Bar-Lev" , "arch" > > > Sent: Sunday, May 12, 2013 3:39:54 PM > > > Subject: Re: feature suggestion: initial generation of management network > > > > > > On Thu, May 09, 2013 at 10:57:16AM -0400, Antoni Segura Puimedon wrote: > > > > > > > > > > The risk of a deployed system (without bridge) to be unresponsive > > > > > after > > > > > reboot is minimum. > > > > > > > > > > 1. iptables rules are already active. > > > > > 2. udev rules are active. > > > > > 3. vdsm is up. > > > > > > > > > > The major risk is basically if some dependency package was UPDATED > > > > > during > > > > > the > > > > > installation of vdsm, while its service/consumer is already running, > > > > > then > > > > > we > > > > > are running a host with the old software and there is a chance that > > > > > after > > > > > reboot with the new software the host will fail. > > > > > > > > > > I think that the decision to reboot host should be delegated to > > > > > administrator, adding vdsm verb to reboot is usable. This way admin > > > > > will > > > > > be > > > > > able to take a host to maintenance mode and reboot, and we can add > > > > > checkbox > > > > > to the add host dialog '[x] reboot', rebooting the host at the end of > > > > > the > > > > > sequence. I think the default should be off. > > > > > > > > I'm also in agreement with the addition of a reboot verb. It could be a > > > > nice > > > > addition regardless of this specific use case. > > > > > > A "reboot" verb is nice, but I am not yet sure that it is actually > > > needed. Above, Alon give one argument for it - to make sure that vdsm > > > (and its dependencies, and other updated packages) works smoothely after > > > boot. That's a good argument - but it may be acheived by post-deploy > > > boot as done today - without an additional frighteningly-named verb. > > > > > > Note that vdsm, or any other package, may be upgraded by yum > > > asynchronous to Engine's operation, so we may face a surprise > > > cannot-start-after-boot later in the host life cycle. Not only > > > post-install. > > > > > > As I said in my first comment to this thread - I do not think that > > > reboot-after-install is desperately needed, and find that it does not > > > deserve the Engine-side complexity of calling a new verb. > > > > > > Dan. > > > > What we are trying to say, product wise, is that the requirement to > > remotely > > reboot a host > > No it is not a requirement - it is here because in the past it proved to be a > necessity (on the early days), > i don't think it's a must today. > > > (cooperate reboot) may be available regardless of the > > host-deploy sequence. Administrator may decide to reboot a host right after > > host-deploy or once a week. > > Correct, and the right way to do it is first move to maintenance mode. > > > > > Adding the ability to perform reboot is different independent discussion. > > > > The only reason we discuss it here is because we currently force reboot > > after > > host-deploy (although in the API it is optional). > > > > Having the bridge created by the engine is, in my opinion, far more > > important > > than keeping the reboot feature. We can discuss if remote reboot feature > > should and to which version, regardless. > > > I agree with you that the creation of the bridge engine is more important > from the reboot, > However when discussing new reboot API for VDSM, I prefer to not do reboot at > all on host-deploy, and do the bridge config by engine. > > The reboot API suggested is a general purpose API which in this discussion is > focused around a specific use case (host-deploy), > If we had a way to enforce call for the reboot API only in the deployment > scenario, I would have been o.k. with it, > But the weird thing about APIs is that people end up using them ... and not > always as we intended, > and we might end up finding ourselves in tough situations due to a stray > reboot call from X ??? I must reply that, cannot help it... :))) What if the admin just login to the server and just rebooted? What if there is power failure? What if there is provisioning infrastructure that manages the servers in parallel of ovirt and it decides to reboot? We should deal with that in any case... And doing this via VDSM has only advantages, as VDSM is aware of the reboot request and can take safety measures to complete it successfully without losing information nor state. > This is the reason I have suggested reboot over SSH which is different. Right, but we can do about 40% (I think closer to 80%) of VDSM functionality via SSH, right? > And I would argue that host deploy is here to stay hence the dependency in > SSH is here to stay. There are talks to move to standard provisioning framework such as puppet or even foreman... not sure what the future is. > > Thanks > Barak Azulay > > > > > > > > Regards, > > Alon > > _______________________________________________ > > Arch mailing list > > Arch at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/arch > > > > > > > From bazulay at redhat.com Sun May 12 20:29:20 2013 From: bazulay at redhat.com (Barak Azulay) Date: Sun, 12 May 2013 16:29:20 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <1212694039.320712.1368385162654.JavaMail.root@redhat.com> References: <20121227121406.GD8915@redhat.com> <1596048609.8203468.1367933507824.JavaMail.root@redhat.com> <20130508133549.GA17279@redhat.com> <1556816649.5772576.1368111300185.JavaMail.root@redhat.com> <941280855.5773599.1368111436878.JavaMail.root@redhat.com> <20130512123954.GF26216@redhat.com> <2047117021.311691.1368363533872.JavaMail.root@redhat.com> <239113646.302830.1368384041645.JavaMail.root@redhat.com> <1212694039.320712.1368385162654.JavaMail.root@redhat.com> Message-ID: <8B96DE8C-7C04-4E76-91C1-C518A9C9857B@redhat.com> On May 12, 2013, at 21:59, Alon Bar-Lev wrote: > > > ----- Original Message ----- >> From: "Barak Azulay" >> To: "Alon Bar-Lev" >> Cc: "Dan Kenigsberg" , "arch" >> Sent: Sunday, May 12, 2013 9:40:41 PM >> Subject: Re: feature suggestion: initial generation of management network >> >> >> >> ----- Original Message ----- >>> From: "Alon Bar-Lev" >>> To: "Dan Kenigsberg" >>> Cc: "arch" >>> Sent: Sunday, May 12, 2013 3:58:53 PM >>> Subject: Re: feature suggestion: initial generation of management network >>> >>> >>> >>> ----- Original Message ----- >>>> From: "Dan Kenigsberg" >>>> To: "Antoni Segura Puimedon" >>>> Cc: "Alon Bar-Lev" , "arch" >>>> Sent: Sunday, May 12, 2013 3:39:54 PM >>>> Subject: Re: feature suggestion: initial generation of management network >>>> >>>> On Thu, May 09, 2013 at 10:57:16AM -0400, Antoni Segura Puimedon wrote: >>>>>> >>>>>> The risk of a deployed system (without bridge) to be unresponsive >>>>>> after >>>>>> reboot is minimum. >>>>>> >>>>>> 1. iptables rules are already active. >>>>>> 2. udev rules are active. >>>>>> 3. vdsm is up. >>>>>> >>>>>> The major risk is basically if some dependency package was UPDATED >>>>>> during >>>>>> the >>>>>> installation of vdsm, while its service/consumer is already running, >>>>>> then >>>>>> we >>>>>> are running a host with the old software and there is a chance that >>>>>> after >>>>>> reboot with the new software the host will fail. >>>>>> >>>>>> I think that the decision to reboot host should be delegated to >>>>>> administrator, adding vdsm verb to reboot is usable. This way admin >>>>>> will >>>>>> be >>>>>> able to take a host to maintenance mode and reboot, and we can add >>>>>> checkbox >>>>>> to the add host dialog '[x] reboot', rebooting the host at the end of >>>>>> the >>>>>> sequence. I think the default should be off. >>>>> >>>>> I'm also in agreement with the addition of a reboot verb. It could be a >>>>> nice >>>>> addition regardless of this specific use case. >>>> >>>> A "reboot" verb is nice, but I am not yet sure that it is actually >>>> needed. Above, Alon give one argument for it - to make sure that vdsm >>>> (and its dependencies, and other updated packages) works smoothely after >>>> boot. That's a good argument - but it may be acheived by post-deploy >>>> boot as done today - without an additional frighteningly-named verb. >>>> >>>> Note that vdsm, or any other package, may be upgraded by yum >>>> asynchronous to Engine's operation, so we may face a surprise >>>> cannot-start-after-boot later in the host life cycle. Not only >>>> post-install. >>>> >>>> As I said in my first comment to this thread - I do not think that >>>> reboot-after-install is desperately needed, and find that it does not >>>> deserve the Engine-side complexity of calling a new verb. >>>> >>>> Dan. >>> >>> What we are trying to say, product wise, is that the requirement to >>> remotely >>> reboot a host >> >> No it is not a requirement - it is here because in the past it proved to be a >> necessity (on the early days), >> i don't think it's a must today. >> >>> (cooperate reboot) may be available regardless of the >>> host-deploy sequence. Administrator may decide to reboot a host right after >>> host-deploy or once a week. >> >> Correct, and the right way to do it is first move to maintenance mode. >> >>> >>> Adding the ability to perform reboot is different independent discussion. >>> >>> The only reason we discuss it here is because we currently force reboot >>> after >>> host-deploy (although in the API it is optional). >>> >>> Having the bridge created by the engine is, in my opinion, far more >>> important >>> than keeping the reboot feature. We can discuss if remote reboot feature >>> should and to which version, regardless. >> >> >> I agree with you that the creation of the bridge engine is more important >> from the reboot, >> However when discussing new reboot API for VDSM, I prefer to not do reboot at >> all on host-deploy, and do the bridge config by engine. >> >> The reboot API suggested is a general purpose API which in this discussion is >> focused around a specific use case (host-deploy), >> If we had a way to enforce call for the reboot API only in the deployment >> scenario, I would have been o.k. with it, >> But the weird thing about APIs is that people end up using them ... and not >> always as we intended, >> and we might end up finding ourselves in tough situations due to a stray >> reboot call from X ??? > > I must reply that, cannot help it... :))) I know you can't ;-) > What if the admin just login to the server and just rebooted? > What if there is power failure? > What if there is provisioning infrastructure that manages the servers in parallel of ovirt and it decides to reboot? > We should deal with that in any case... > And doing this via VDSM has only advantages, as VDSM is aware of the reboot request and can take safety measures to complete it successfully without losing information nor state. As I said above, if we could limit the reboot API call for host deploy I was o.k with it. Since we can't than it is likely that it will be used in some other flow, which might cause a mess. It is one thing when an admin is doing it manually (and is aware he is loosing all the running VMs), and between having the engine call it in some other corner case flow (just because the API was there and it looked like he right thing when reviewed), and all running VMs die (just like case we have had with the lib virt crash .... Fencing race) Eventually general purpose APIs are being used, And here you are arguing about the most destructive API in the context of a distant corner case. I find it hard to believe such a patch will pass vdsm code review ... > >> This is the reason I have suggested reboot over SSH which is different. > > Right, but we can do about 40% (I think closer to 80%) of VDSM functionality via SSH, right? 35 ? ;-) > >> And I would argue that host deploy is here to stay hence the dependency in >> SSH is here to stay. > > There are talks to move to standard provisioning framework such as puppet or even foreman... not sure what the future is. And who will install and provision those systems ? .... I assume Otopi ... Using SSH > >> >> Thanks >> Barak Azulay >> >> >> >> >>> >>> Regards, >>> Alon >>> _______________________________________________ >>> Arch mailing list >>> Arch at ovirt.org >>> http://lists.ovirt.org/mailman/listinfo/arch >>> >>> >>> >> From alonbl at redhat.com Sun May 12 21:12:33 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Sun, 12 May 2013 17:12:33 -0400 (EDT) Subject: [ANN] New development environment for ovirt-engine In-Reply-To: <9423451.273926.1368359571790.JavaMail.root@redhat.com> References: <9423451.273926.1368359571790.JavaMail.root@redhat.com> Message-ID: <1692979816.341418.1368393153722.JavaMail.root@redhat.com> Hello, As promised, I updated the wiki pages of engine developer environment to refer to this[1] single new page, I hope in time we can merge all non-trivial contributions into the README.developer. Feel free to contribute/fix as you experience issues. Regards, Alon Bar-Lev. [1] http://www.ovirt.org/OVirt_Engine_Development_Environment ----- Original Message ----- > From: "Alon Bar-Lev" > To: "engine-devel" > Cc: "Yaniv Bronheim" , "Moti Asayag" , "Limor Gavish" , > "Sharad Mishra" , "Alex Lourie" , "Sandro Bonazzola" , > "arch" , "Ofer Schreiber" > Sent: Sunday, May 12, 2013 2:52:51 PM > Subject: [ANN] New development environment for ovirt-engine > > Hello all ovirt-engine developers, > > When I first joined the ovirt project, it took me about two weeks to setup a > development environment, I needed to work on a bug related to host-deploy so > I needed an environment that could use the ssh, PKI, vdsm-bootstrap and > communicate with vdsm using SSL, this was virtually impossible to do so > without tweaking the product in a way that it is so different from > production use, that I cannot guarantee that whatever tested in development > will actually work in production. > > I peeked at the installation script in a hope that I can create partial > environment similar to production, but I found that the packaging > implementation makes to much assumption and is very difficult to adopt. The > fact that I do not use fedora/rhel for my development made it even worse. > > I had no other option than to create rpms after each of my changes and test > each in real production like setup. > > It was obvious to me that the manual customization of developers to achieve > working product will eventually break as product grow and move away from > being developer friendly to production friendly. For example, product > defaults cannot be these which serve developers, but these which serve > production the best, or having a valid PKI setup cannot be optional any more > as components do need to use it. Same for location of files and > configuration, for example, if we write a pluggable infrastructure for > branding, we cannot damage the interface just because developers runs the > product in their own manual customization. > > I took the opportunity handed to me to port the ovirt-engine to other > distributions in order to provide a development environment that is similar > to production setup. Together with Sandro Bonazzola and Alex Lourie we > re-wrote the whole installation of the product which can also be used to > setup the desired development environment. > > Within this environment the product is set up using the same tools and > configuration as in production, while the process does not require special > privileges nor changes the state of the developer machine. > > A complete documentation is available[1], I preferred to use README within > the source tree as wiki tend to quickly become obsolete, while documentation > within source tree can be modified by the commit that introduces a change. I > will redirect to this file from the current wiki once the site will be up. > > In a nut shell, after installing prerequisites, build and install the product > using: > > $ make clean install-dev PREFIX=$HOME/ovirt-engine > > This will run maven and create product installation at $HOME/ovirt-engine > Next, a setup phase is required just like in production, to initialize > configuration and database: > > $ $HOME/ovirt-engine/bin/engine-setup-2 > > You have now fully functional product, including PKI, SSL, host-deploy, > tools. > No manual database updates are required, no lose of functionality. > > All that is left is to start the engine service: > > $ $HOME/ovirt-engine/share/ovirt-engine/services/ovirt-engine.py start > > Access to application: > http://localhost:8080 > https://localhost:8443 > Debugging port is opened at port 8787. > > Farther information exists in the documentation[1]. > > There are several inherit benefits of the new environment, the major one is > the ability to manage several environments in parallel on the same host. For > example, if we develop two separate features on two branches we can install > the product into $HOME/ovirt-engine-feature1 and > $HOME/ovirt-engine-feature-2 and have a separate database for each, if we > modify the ports jboss is listening to we can run two instances of engine at > the same time! > > We will be happy to work with all developers to assist in porting into the > new development environment, the simplest is to create a new database for > this effort. Moti has a sequence of converting the existing database owned > by postgres to be owned by the engine, Moti, can you please share that? > > We are sure there are missing bits, we will be happy to know these so we can > improve. > > I am aware that developers (especially java) are conservative, but I ask you > to give us a chance, so that we make it easy for developers to join the > project, and to allow us to drop the parallel effort of packaging to > production and fixing the broken development environment. > > A special thanks to developers who took the time to test and provide feedback > before the merged: > - Yaniv Bronheim > - Moti Asayag > - Limor Gavish > - Sharad Mishra > - Ofer Schreiber > > We are hoping that after migration you will be find this environment useful > and friendly, > > Sandro Bonazzola, > Alex Lourie, > Alon Bar-Lev. > > [1] > http://gerrit.ovirt.org/gitweb?p=ovirt-engine.git;a=blob;f=README.developer;hb=HEAD From alonbl at redhat.com Mon May 13 05:31:18 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Mon, 13 May 2013 01:31:18 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <3F4704D7-F7EA-4DE2-AA0B-255E117B28DB@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <518F3DBB.6080606@redhat.com> <557908336.127528.1368346520205.JavaMail.root@redhat.com> <518F56CE.3010802@redhat.com> <3F4704D7-F7EA-4DE2-AA0B-255E117B28DB@redhat.com> Message-ID: <281632927.397694.1368423078891.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Barak Azulay" > To: "Livnat Peer" > Cc: "arch" , "Alon Bar-Lev" , "Simon Grinberg" > Sent: Monday, May 13, 2013 8:21:33 AM > Subject: Re: feature suggestion: initial generation of management network > > you can fence the node if VDSM is non responsive, that's the mechanism > > we use today to deal with such cases. > > > There are already requests to enable: > - ability to fence a host with no PM > - ability to do less destructive fencing (= restart vdsm) when > none-responsive and the host is accessible using SSH. This way the VMs > running on the host will not get lost. > > Both the above can be achieved as mentioned using SSH, and can't be achieved > with the restart API to vdsm. Why? I truly don't understand... we should make sure VDSM is responsive to accept new commands, and VDSM can either reboot machine or restart itself. Alon From bazulay at redhat.com Mon May 13 05:21:33 2013 From: bazulay at redhat.com (Barak Azulay) Date: Mon, 13 May 2013 01:21:33 -0400 (EDT) Subject: feature suggestion: initial generation of management network In-Reply-To: <518F56CE.3010802@redhat.com> References: <20121227121406.GD8915@redhat.com> <11330867.78.1356611742783.JavaMail.javamailuser@localhost> <20130101124757.GI7274@redhat.com> <1264143158.8051712.1367925739116.JavaMail.root@redhat.com> <518F3DBB.6080606@redhat.com> <557908336.127528.1368346520205.JavaMail.root@redhat.com> <518F56CE.3010802@redhat.com> Message-ID: <3F4704D7-F7EA-4DE2-AA0B-255E117B28DB@redhat.com> On May 12, 2013, at 11:46, Livnat Peer wrote: > On 05/12/2013 11:15 AM, Barak Azulay wrote: >> >> >> ----- Original Message ----- >>> From: "Livnat Peer" >>> To: "Moti Asayag" >>> Cc: "arch" , "Alon Bar-Lev" , "Barak Azulay" , "Simon >>> Grinberg" >>> Sent: Sunday, May 12, 2013 9:59:07 AM >>> Subject: Re: feature suggestion: initial generation of management network >>> >>> Thread Summary - >>> >>> 1. We all agree the automatic reboot after host installation is not >>> needed anymore and can be removed. >>> >>> 2. There is a vast agreement that we need to add a new VDSM verb for reboot. >> >> I disagree with the above >> >> In addition to the fact that it will not work when VDSM is not responsive (when this action will be needed the most) >> > > you can fence the node if VDSM is non responsive, that's the mechanism > we use today to deal with such cases. There are already requests to enable: - ability to fence a host with no PM - ability to do less destructive fencing (= restart vdsm) when none-responsive and the host is accessible using SSH. This way the VMs running on the host will not get lost. Both the above can be achieved as mentioned using SSH, and can't be achieved with the restart API to vdsm. > >> >>> >>> 3. There was a suggestion to add a checkbox when adding a host to reboot >>> the host after installation, default would be not to reboot. (leaving >>> the option to reboot to the administrator). >>> >>> >>> If there is no objection we'll go with the above. >>> >>> Thanks, Livnat >>> >>> >>> On 05/07/2013 02:22 PM, Moti Asayag wrote: >>>> I stumbled upon few issues with the current design while implementing it: >>>> >>>> There seems to be a requirement to reboot the host after the installation >>>> is completed in order to assure the host is recoverable. >>>> >>>> Therefore, the building blocks of the installation process of 3.3 are: >>>> 1. host deploy which installs the host expect configuring its management >>>> network. >>>> 2. SetupNetwork (and CommitNetworkChanges) - for creating the management >>>> network >>>> on the host and persisting the network configuration. >>>> 3. Reboot the host - This is a missing piece. (engine has FenceVds command, >>>> but it >>>> requires the power management to be configured prior to the installation >>>> and might >>>> be irrelevant for hosts without PM.) >>>> >>>> So, there are couple of issues here: >>>> 1. How to reboot the host? >>>> 1.1. By exposing new RebootNode verb in VDSM and invoking it from the >>>> engine >>>> 1.2. By opening ssh dialog to the host in order to execute the reboot >>>> >>>> 2. When to perform the reboot? >>>> 2.1. After host deploy, by utilizing the host deploy to perform the reboot. >>>> It requires to configure the network by the monitor when the host is >>>> detected by the engine, >>>> detached from the installation flow. However it is a step toward the >>>> non-persistent network feature >>>> yet to be defined. >>>> 2.2. After setupNetwork is done and network was configured and persisted on >>>> the host. >>>> There is no special advantage from recoverable aspect, as setupNetwork is >>>> constantly >>>> used to persist the network configuration (by the complementary >>>> CommitNetworkChanges command). >>>> In case and network configuration fails, VDSM will revert to the last well >>>> known configuration >>>> - so connectivity with engine should be restored. Design wise, it fits to >>>> configure the management >>>> network as part of the installation sequence. >>>> If the network configuration fails in this context, the host status will be >>>> set to "InstallFailed" rather than "NonOperational", >>>> as might occur as a result of a failed setupNetwork command. >>>> >>>> >>>> Your inputs are welcome. >>>> >>>> Thanks, >>>> Moti >>>> ----- Original Message ----- >>>>> From: "Dan Kenigsberg" >>>>> To: "Simon Grinberg" , "Moti Asayag" >>>>> >>>>> Cc: "arch" >>>>> Sent: Tuesday, January 1, 2013 2:47:57 PM >>>>> Subject: Re: feature suggestion: initial generation of management network >>>>> >>>>> On Thu, Dec 27, 2012 at 07:36:40AM -0500, Simon Grinberg wrote: >>>>>> >>>>>> >>>>>> ----- Original Message ----- >>>>>>> From: "Dan Kenigsberg" >>>>>>> To: "Simon Grinberg" >>>>>>> Cc: "arch" >>>>>>> Sent: Thursday, December 27, 2012 2:14:06 PM >>>>>>> Subject: Re: feature suggestion: initial generation of management >>>>>>> network >>>>>>> >>>>>>> On Tue, Dec 25, 2012 at 09:29:26AM -0500, Simon Grinberg wrote: >>>>>>>> >>>>>>>> >>>>>>>> ----- Original Message ----- >>>>>>>>> From: "Dan Kenigsberg" >>>>>>>>> To: "arch" >>>>>>>>> Sent: Tuesday, December 25, 2012 2:27:22 PM >>>>>>>>> Subject: feature suggestion: initial generation of management >>>>>>>>> network >>>>>>>>> >>>>>>>>> Current condition: >>>>>>>>> ================== >>>>>>>>> The management network, named ovirtmgmt, is created during host >>>>>>>>> bootstrap. It consists of a bridge device, connected to the >>>>>>>>> network >>>>>>>>> device that was used to communicate with Engine (nic, bonding or >>>>>>>>> vlan). >>>>>>>>> It inherits its ip settings from the latter device. >>>>>>>>> >>>>>>>>> Why Is the Management Network Needed? >>>>>>>>> ===================================== >>>>>>>>> Understandably, some may ask why do we need to have a management >>>>>>>>> network - why having a host with IPv4 configured on it is not >>>>>>>>> enough. >>>>>>>>> The answer is twofold: >>>>>>>>> 1. In oVirt, a network is an abstraction of the resources >>>>>>>>> required >>>>>>>>> for >>>>>>>>> connectivity of a host for a specific usage. This is true for >>>>>>>>> the >>>>>>>>> management network just as it is for VM network or a display >>>>>>>>> network. >>>>>>>>> The network entity is the key for adding/changing nics and IP >>>>>>>>> address. >>>>>>>>> 2. In many occasions (such as small setups) the management >>>>>>>>> network is >>>>>>>>> used as a VM/display network as well. >>>>>>>>> >>>>>>>>> Problems in current connectivity: >>>>>>>>> ================================ >>>>>>>>> According to alonbl of ovirt-host-deploy fame, and with no >>>>>>>>> conflict >>>>>>>>> to >>>>>>>>> my own experience, creating the management network is the most >>>>>>>>> fragile, >>>>>>>>> error-prone step of bootstrap. >>>>>>>> >>>>>>>> +1, >>>>>>>> I've raise that repeatedly in the past, bootstrap should not create >>>>>>>> the management network but pick up the existing configuration and >>>>>>>> let the engine override later with it's own configuration if it >>>>>>>> differs , I'm glad that we finally get to that. >>>>>>>> >>>>>>>>> >>>>>>>>> Currently it always creates a bridged network (even if the DC >>>>>>>>> requires a >>>>>>>>> non-bridged ovirtmgmt), it knows nothing about the defined MTU >>>>>>>>> for >>>>>>>>> ovirtmgmt, it uses ping to guess on top of which device to build >>>>>>>>> (and >>>>>>>>> thus requires Vdsm-to-Engine reverse connectivity), and is the >>>>>>>>> sole >>>>>>>>> remaining user of the addNetwork/vdsm-store-net-conf scripts. >>>>>>>>> >>>>>>>>> Suggested feature: >>>>>>>>> ================== >>>>>>>>> Bootstrap would avoid creating a management network. Instead, >>>>>>>>> after >>>>>>>>> bootstrapping a host, Engine would send a getVdsCaps probe to the >>>>>>>>> installed host, receiving a complete picture of the network >>>>>>>>> configuration on the host. Among this picture is the device that >>>>>>>>> holds >>>>>>>>> the host's management IP address. >>>>>>>>> >>>>>>>>> Engine would send setupNetwork command to generate ovirtmgmt with >>>>>>>>> details devised from this picture, and according to the DC >>>>>>>>> definition >>>>>>>>> of >>>>>>>>> ovirtmgmt. For example, if Vdsm reports: >>>>>>>>> >>>>>>>>> - vlan bond4.3000 has the host's IP, configured to use dhcp. >>>>>>>>> - bond4 is comprises eth2 and eth3 >>>>>>>>> - ovirtmgmt is defined as a VM network with MTU 9000 >>>>>>>>> >>>>>>>>> then Engine sends the likes of: >>>>>>>>> setupNetworks(ovirtmgmt: {bridged=True, vlan=3000, iface=bond4, >>>>>>>>> bonding=bond4: {eth2,eth3}, MTU=9000) >>>>>>>> >>>>>>>> Just one comment here, >>>>>>>> In order to save time and confusion - if the ovirtmgmt is defined >>>>>>>> with default values meaning the user did not bother to touch it, >>>>>>>> let it pick up the VLAN configuration from the first host added in >>>>>>>> the Data Center. >>>>>>>> >>>>>>>> Otherwise, you may override the host VLAN and loose connectivity. >>>>>>>> >>>>>>>> This will also solve the situation many users encounter today. >>>>>>>> 1. The engine in on a host that actually has VLAN defined >>>>>>>> 2. The ovirtmgmt network was not updated in the DC >>>>>>>> 3. A host, with VLAN already defined is added - everything works >>>>>>>> fine >>>>>>>> 4. Any number of hosts are now added, again everything seems to >>>>>>>> work fine. >>>>>>>> >>>>>>>> But, now try to use setupNetworks, and you'll find out that you >>>>>>>> can't do much on the interface that contains the ovirtmgmt since >>>>>>>> the definition does not match. You can't sync (Since this will >>>>>>>> remove the VLAN and cause connectivity lose) you can't add more >>>>>>>> networks on top since it already has non-VLAN network on top >>>>>>>> according to the DC definition, etc. >>>>>>>> >>>>>>>> On the other hand you can't update the ovirtmgmt definition on the >>>>>>>> DC since there are clusters in the DC that use the network. >>>>>>>> >>>>>>>> The only workaround not involving DB hack to change the VLAN on the >>>>>>>> network is to: >>>>>>>> 1. Create new DC >>>>>>>> 2. Do not use the wizard that pops up to create your cluster. >>>>>>>> 3. Modify the ovirtmgmt network to have VLANs >>>>>>>> 4. Now create a cluster and add your hosts. >>>>>>>> >>>>>>>> If you insist on using the default DC and cluster then before >>>>>>>> adding the first host, create an additional DC and move the >>>>>>>> Default cluster over there. You may then change the network on the >>>>>>>> Default cluster and then move the Default cluster back >>>>>>>> >>>>>>>> Both are ugly. And should be solved by the proposal above. >>>>>>>> >>>>>>>> We do something similar for the Default cluster CPU level, where we >>>>>>>> set the intial level based on the first host added to the cluster. >>>>>>> >>>>>>> I'm not sure what Engine has for Default cluster CPU level. But I >>>>>>> have >>>>>>> reservation of the hysteresis in your proposal - after a host is >>>>>>> added, >>>>>>> the DC cannot forget ovirtmgmt's vlan. >>>>>>> >>>>>>> How about letting the admin edit ovirtmgmt's vlan in the DC level, >>>>>>> thus >>>>>>> rendering all hosts out-of-sync. The the admin could manually, or >>>>>>> through a script, or in the future through a distributed operation, >>>>>>> sync >>>>>>> all the hosts to the definition? >>>>>> >>>>>> Usually if you do that you will loose connectivity to the hosts. >>>>> >>>>> Yes, changing the management vlan id (or ip address) is never fun, and >>>>> requires out-of-band intervention. >>>>> >>>>>> I'm not insisting on the automatic adjustment of the ovirtmgmt network to >>>>>> match the hosts' (that is just a nice touch) we can take the allow edit >>>>>> approach. >>>>>> >>>>>> But allow to change VLAN on the ovirtmgmt network will indeed solve the >>>>>> issue I'm trying to solve while creating another issue of user expecting >>>>>> that we'll be able to re-tag the host from the engine side, which is >>>>>> challenging to do. >>>>>> >>>>>> On the other hand, if we allow to change the VLAN as long as the change >>>>>> matches the hosts' configuration, it will both solve the issue while not >>>>>> eluding the user to think that we really can solve the chicken and egg >>>>>> issue of re-tag the entire system. >>>>>> >>>>>> Now with the above ability you do get a flow to do the re-tag. >>>>>> 1. Place all the hosts in maintenance >>>>>> 2. Re-tag the ovirtmgmt on all the hosts >>>>>> 3. Re-tag the hosts on which the engine on >>>>>> 4. Activate the hosts - this should work well now since connectivity >>>>>> exist >>>>>> 5. Change the tag on ovirtmgmt on the engine to match the hosts' >>>>>> >>>>>> Simple and clear process. >>>>>> >>>>>> When the workaround of creating another DC was not possible since the >>>>>> system was already long in use and the need was re-tag of the network the >>>>>> above is what I've recommended in the, except that steps 4-5 where done >>>>>> as: >>>>>> 4. Stop the engine >>>>>> 5. Change the tag in the DB >>>>>> 6. Start the engine >>>>>> 7. Activate the hosts >>>>> >>>>> Sounds reasonable to me - but as far as I am aware this is not tightly >>>>> related to the $Subject, which is the post-boot ovirtmgmt definition. >>>>> >>>>> I've added a few details to >>>>> http://www.ovirt.org/Features/Normalized_ovirtmgmt_Initialization#Engine >>>>> and I would apreciate a review from someone with intimate Engine >>>>> know-how. >>>>> >>>>> Dan. >>>>> >>>> _______________________________________________ >>>> Arch mailing list >>>> Arch at ovirt.org >>>> http://lists.ovirt.org/mailman/listinfo/arch >>>> >>>> >>> >>> >> _______________________________________________ >> Arch mailing list >> Arch at ovirt.org >> http://lists.ovirt.org/mailman/listinfo/arch >> >> > > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > > From bazulay at redhat.com Mon May 13 19:34:02 2013 From: bazulay at redhat.com (Barak Azulay) Date: Mon, 13 May 2013 15:34:02 -0400 (EDT) Subject: [ANN] New development environment for ovirt-engine In-Reply-To: <9423451.273926.1368359571790.JavaMail.root@redhat.com> References: <9423451.273926.1368359571790.JavaMail.root@redhat.com> Message-ID: <545E4A23-4E49-478F-B8E8-FBB7488C22ED@redhat.com> Good work guys, Thanks Barak Azulay On May 12, 2013, at 14:52, Alon Bar-Lev wrote: > Hello all ovirt-engine developers, > > When I first joined the ovirt project, it took me about two weeks to setup a development environment, I needed to work on a bug related to host-deploy so I needed an environment that could use the ssh, PKI, vdsm-bootstrap and communicate with vdsm using SSL, this was virtually impossible to do so without tweaking the product in a way that it is so different from production use, that I cannot guarantee that whatever tested in development will actually work in production. > > I peeked at the installation script in a hope that I can create partial environment similar to production, but I found that the packaging implementation makes to much assumption and is very difficult to adopt. The fact that I do not use fedora/rhel for my development made it even worse. > > I had no other option than to create rpms after each of my changes and test each in real production like setup. > > It was obvious to me that the manual customization of developers to achieve working product will eventually break as product grow and move away from being developer friendly to production friendly. For example, product defaults cannot be these which serve developers, but these which serve production the best, or having a valid PKI setup cannot be optional any more as components do need to use it. Same for location of files and configuration, for example, if we write a pluggable infrastructure for branding, we cannot damage the interface just because developers runs the product in their own manual customization. > > I took the opportunity handed to me to port the ovirt-engine to other distributions in order to provide a development environment that is similar to production setup. Together with Sandro Bonazzola and Alex Lourie we re-wrote the whole installation of the product which can also be used to setup the desired development environment. > > Within this environment the product is set up using the same tools and configuration as in production, while the process does not require special privileges nor changes the state of the developer machine. > > A complete documentation is available[1], I preferred to use README within the source tree as wiki tend to quickly become obsolete, while documentation within source tree can be modified by the commit that introduces a change. I will redirect to this file from the current wiki once the site will be up. > > In a nut shell, after installing prerequisites, build and install the product using: > > $ make clean install-dev PREFIX=$HOME/ovirt-engine > > This will run maven and create product installation at $HOME/ovirt-engine > Next, a setup phase is required just like in production, to initialize configuration and database: > > $ $HOME/ovirt-engine/bin/engine-setup-2 > > You have now fully functional product, including PKI, SSL, host-deploy, tools. > No manual database updates are required, no lose of functionality. > > All that is left is to start the engine service: > > $ $HOME/ovirt-engine/share/ovirt-engine/services/ovirt-engine.py start > > Access to application: > http://localhost:8080 > https://localhost:8443 > Debugging port is opened at port 8787. > > Farther information exists in the documentation[1]. > > There are several inherit benefits of the new environment, the major one is the ability to manage several environments in parallel on the same host. For example, if we develop two separate features on two branches we can install the product into $HOME/ovirt-engine-feature1 and $HOME/ovirt-engine-feature-2 and have a separate database for each, if we modify the ports jboss is listening to we can run two instances of engine at the same time! > > We will be happy to work with all developers to assist in porting into the new development environment, the simplest is to create a new database for this effort. Moti has a sequence of converting the existing database owned by postgres to be owned by the engine, Moti, can you please share that? > > We are sure there are missing bits, we will be happy to know these so we can improve. > > I am aware that developers (especially java) are conservative, but I ask you to give us a chance, so that we make it easy for developers to join the project, and to allow us to drop the parallel effort of packaging to production and fixing the broken development environment. > > A special thanks to developers who took the time to test and provide feedback before the merged: > - Yaniv Bronheim > - Moti Asayag > - Limor Gavish > - Sharad Mishra > - Ofer Schreiber > > We are hoping that after migration you will be find this environment useful and friendly, > > Sandro Bonazzola, > Alex Lourie, > Alon Bar-Lev. > > [1] http://gerrit.ovirt.org/gitweb?p=ovirt-engine.git;a=blob;f=README.developer;hb=HEAD > _______________________________________________ > Arch mailing list > Arch at ovirt.org > http://lists.ovirt.org/mailman/listinfo/arch > > From emesika at redhat.com Tue May 14 00:45:41 2013 From: emesika at redhat.com (Eli Mesika) Date: Mon, 13 May 2013 20:45:41 -0400 (EDT) Subject: [Engine-devel] [ANN] New development environment for ovirt-engine In-Reply-To: <9423451.273926.1368359571790.JavaMail.root@redhat.com> References: <9423451.273926.1368359571790.JavaMail.root@redhat.com> Message-ID: <188337855.1067044.1368492341205.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Alon Bar-Lev" > To: "engine-devel" > Cc: "arch" , "Sharad Mishra" , "Limor Gavish" > Sent: Sunday, May 12, 2013 2:52:51 PM > Subject: [Engine-devel] [ANN] New development environment for ovirt-engine > > Hello all ovirt-engine developers, > > When I first joined the ovirt project, it took me about two weeks to setup a > development environment, I needed to work on a bug related to host-deploy so > I needed an environment that could use the ssh, PKI, vdsm-bootstrap and > communicate with vdsm using SSL, this was virtually impossible to do so > without tweaking the product in a way that it is so different from > production use, that I cannot guarantee that whatever tested in development > will actually work in production. > > I peeked at the installation script in a hope that I can create partial > environment similar to production, but I found that the packaging > implementation makes to much assumption and is very difficult to adopt. The > fact that I do not use fedora/rhel for my development made it even worse. > > I had no other option than to create rpms after each of my changes and test > each in real production like setup. > > It was obvious to me that the manual customization of developers to achieve > working product will eventually break as product grow and move away from > being developer friendly to production friendly. For example, product > defaults cannot be these which serve developers, but these which serve > production the best, or having a valid PKI setup cannot be optional any more > as components do need to use it. Same for location of files and > configuration, for example, if we write a pluggable infrastructure for > branding, we cannot damage the interface just because developers runs the > product in their own manual customization. > > I took the opportunity handed to me to port the ovirt-engine to other > distributions in order to provide a development environment that is similar > to production setup. Together with Sandro Bonazzola and Alex Lourie we > re-wrote the whole installation of the product which can also be used to > setup the desired development environment. > > Within this environment the product is set up using the same tools and > configuration as in production, while the process does not require special > privileges nor changes the state of the developer machine. > > A complete documentation is available[1], I preferred to use README within > the source tree as wiki tend to quickly become obsolete, while documentation > within source tree can be modified by the commit that introduces a change. I > will redirect to this file from the current wiki once the site will be up. > > In a nut shell, after installing prerequisites, build and install the product > using: > > $ make clean install-dev PREFIX=$HOME/ovirt-engine > > This will run maven and create product installation at $HOME/ovirt-engine > Next, a setup phase is required just like in production, to initialize > configuration and database: > > $ $HOME/ovirt-engine/bin/engine-setup-2 > > You have now fully functional product, including PKI, SSL, host-deploy, > tools. > No manual database updates are required, no lose of functionality. > > All that is left is to start the engine service: > > $ $HOME/ovirt-engine/share/ovirt-engine/services/ovirt-engine.py start > > Access to application: > http://localhost:8080 > https://localhost:8443 > Debugging port is opened at port 8787. > > Farther information exists in the documentation[1]. > > There are several inherit benefits of the new environment, the major one is > the ability to manage several environments in parallel on the same host. For > example, if we develop two separate features on two branches we can install > the product into $HOME/ovirt-engine-feature1 and > $HOME/ovirt-engine-feature-2 and have a separate database for each, if we > modify the ports jboss is listening to we can run two instances of engine at > the same time! It is not clear to me why working on 2 bugs needs 2 installations of the development environment. If you have 2 different git branches and a separate database for each, its enough , am I missing something ? I was used to create a git branch with the name of the BZ# and use create_db.sh script to create a new database with the BZ# name. Is this possible in the new method? Also, does this mean that I will have to create/configure a new workspace for eclipse each time I am starting to work on a new bug? Thanks > > We will be happy to work with all developers to assist in porting into the > new development environment, the simplest is to create a new database for > this effort. Moti has a sequence of converting the existing database owned > by postgres to be owned by the engine, Moti, can you please share that? > > We are sure there are missing bits, we will be happy to know these so we can > improve. > > I am aware that developers (especially java) are conservative, but I ask you > to give us a chance, so that we make it easy for developers to join the > project, and to allow us to drop the parallel effort of packaging to > production and fixing the broken development environment. > > A special thanks to developers who took the time to test and provide feedback > before the merged: > - Yaniv Bronheim > - Moti Asayag > - Limor Gavish > - Sharad Mishra > - Ofer Schreiber > > We are hoping that after migration you will be find this environment useful > and friendly, > > Sandro Bonazzola, > Alex Lourie, > Alon Bar-Lev. > > [1] > http://gerrit.ovirt.org/gitweb?p=ovirt-engine.git;a=blob;f=README.developer;hb=HEAD > _______________________________________________ > Engine-devel mailing list > Engine-devel at ovirt.org > http://lists.ovirt.org/mailman/listinfo/engine-devel > From yzaslavs at redhat.com Tue May 14 02:39:19 2013 From: yzaslavs at redhat.com (Yair Zaslavsky) Date: Mon, 13 May 2013 22:39:19 -0400 (EDT) Subject: [Engine-devel] [ANN] New development environment for ovirt-engine In-Reply-To: <188337855.1067044.1368492341205.JavaMail.root@redhat.com> References: <9423451.273926.1368359571790.JavaMail.root@redhat.com> <188337855.1067044.1368492341205.JavaMail.root@redhat.com> Message-ID: <975472192.855606.1368499159787.JavaMail.root@redhat.com> Alon, I have FC17, and followed the steps at the wiki , i defined the ovirt nightly repo [ovirt-nightly] name=ovirt-nightly baseurl=http://resources.ovirt.org/releases/nightly/rpm/Fedora/17/ enabled=1 gpgcheck=0 priority=1 protect=1 And performed yum install according to your guidelines. It fails to find python-m2crypto Can you please advise on the matter? Many thanks, Yair ----- Original Message ----- > From: "Eli Mesika" > To: "Alon Bar-Lev" > Cc: "engine-devel" , "arch" > Sent: Tuesday, May 14, 2013 3:45:41 AM > Subject: Re: [Engine-devel] [ANN] New development environment for ovirt-engine > > > > ----- Original Message ----- > > From: "Alon Bar-Lev" > > To: "engine-devel" > > Cc: "arch" , "Sharad Mishra" , "Limor > > Gavish" > > Sent: Sunday, May 12, 2013 2:52:51 PM > > Subject: [Engine-devel] [ANN] New development environment for ovirt-engine > > > > Hello all ovirt-engine developers, > > > > When I first joined the ovirt project, it took me about two weeks to setup > > a > > development environment, I needed to work on a bug related to host-deploy > > so > > I needed an environment that could use the ssh, PKI, vdsm-bootstrap and > > communicate with vdsm using SSL, this was virtually impossible to do so > > without tweaking the product in a way that it is so different from > > production use, that I cannot guarantee that whatever tested in development > > will actually work in production. > > > > I peeked at the installation script in a hope that I can create partial > > environment similar to production, but I found that the packaging > > implementation makes to much assumption and is very difficult to adopt. The > > fact that I do not use fedora/rhel for my development made it even worse. > > > > I had no other option than to create rpms after each of my changes and test > > each in real production like setup. > > > > It was obvious to me that the manual customization of developers to achieve > > working product will eventually break as product grow and move away from > > being developer friendly to production friendly. For example, product > > defaults cannot be these which serve developers, but these which serve > > production the best, or having a valid PKI setup cannot be optional any > > more > > as components do need to use it. Same for location of files and > > configuration, for example, if we write a pluggable infrastructure for > > branding, we cannot damage the interface just because developers runs the > > product in their own manual customization. > > > > I took the opportunity handed to me to port the ovirt-engine to other > > distributions in order to provide a development environment that is similar > > to production setup. Together with Sandro Bonazzola and Alex Lourie we > > re-wrote the whole installation of the product which can also be used to > > setup the desired development environment. > > > > Within this environment the product is set up using the same tools and > > configuration as in production, while the process does not require special > > privileges nor changes the state of the developer machine. > > > > A complete documentation is available[1], I preferred to use README within > > the source tree as wiki tend to quickly become obsolete, while > > documentation > > within source tree can be modified by the commit that introduces a change. > > I > > will redirect to this file from the current wiki once the site will be up. > > > > In a nut shell, after installing prerequisites, build and install the > > product > > using: > > > > $ make clean install-dev PREFIX=$HOME/ovirt-engine > > > > This will run maven and create product installation at $HOME/ovirt-engine > > Next, a setup phase is required just like in production, to initialize > > configuration and database: > > > > $ $HOME/ovirt-engine/bin/engine-setup-2 > > > > You have now fully functional product, including PKI, SSL, host-deploy, > > tools. > > No manual database updates are required, no lose of functionality. > > > > All that is left is to start the engine service: > > > > $ $HOME/ovirt-engine/share/ovirt-engine/services/ovirt-engine.py start > > > > Access to application: > > http://localhost:8080 > > https://localhost:8443 > > Debugging port is opened at port 8787. > > > > Farther information exists in the documentation[1]. > > > > There are several inherit benefits of the new environment, the major one is > > the ability to manage several environments in parallel on the same host. > > For > > example, if we develop two separate features on two branches we can install > > the product into $HOME/ovirt-engine-feature1 and > > $HOME/ovirt-engine-feature-2 and have a separate database for each, if we > > modify the ports jboss is listening to we can run two instances of engine > > at > > the same time! > > It is not clear to me why working on 2 bugs needs 2 installations of the > development environment. > If you have 2 different git branches and a separate database for each, its > enough , am I missing something ? > I was used to create a git branch with the name of the BZ# and use > create_db.sh script to create a new database with the BZ# name. > Is this possible in the new method? > Also, does this mean that I will have to create/configure a new workspace for > eclipse each time I am starting to work on a new bug? > > > Thanks > > > > > > We will be happy to work with all developers to assist in porting into the > > new development environment, the simplest is to create a new database for > > this effort. Moti has a sequence of converting the existing database owned > > by postgres to be owned by the engine, Moti, can you please share that? > > > > We are sure there are missing bits, we will be happy to know these so we > > can > > improve. > > > > I am aware that developers (especially java) are conservative, but I ask > > you > > to give us a chance, so that we make it easy for developers to join the > > project, and to allow us to drop the parallel effort of packaging to > > production and fixing the broken development environment. > > > > A special thanks to developers who took the time to test and provide > > feedback > > before the merged: > > - Yaniv Bronheim > > - Moti Asayag > > - Limor Gavish > > - Sharad Mishra > > - Ofer Schreiber > > > > We are hoping that after migration you will be find this environment useful > > and friendly, > > > > Sandro Bonazzola, > > Alex Lourie, > > Alon Bar-Lev. > > > > [1] > > http://gerrit.ovirt.org/gitweb?p=ovirt-engine.git;a=blob;f=README.developer;hb=HEAD > > _______________________________________________ > > Engine-devel mailing list > > Engine-devel at ovirt.org > > http://lists.ovirt.org/mailman/listinfo/engine-devel > > > _______________________________________________ > Engine-devel mailing list > Engine-devel at ovirt.org > http://lists.ovirt.org/mailman/listinfo/engine-devel > From alonbl at redhat.com Tue May 14 05:58:06 2013 From: alonbl at redhat.com (Alon Bar-Lev) Date: Tue, 14 May 2013 01:58:06 -0400 (EDT) Subject: [Engine-devel] [ANN] New development environment for ovirt-engine In-Reply-To: <975472192.855606.1368499159787.JavaMail.root@redhat.com> References: <9423451.273926.1368359571790.JavaMail.root@redhat.com> <188337855.1067044.1368492341205.JavaMail.root@redhat.com> <975472192.855606.1368499159787.JavaMail.root@redhat.com> Message-ID: <998811568.829527.1368511086124.JavaMail.root@redhat.com> ----- Original Message ----- > From: "Yair Zaslavsky" > To: "Eli Mesika" > Cc: "Alon Bar-Lev" , "engine-devel" , "arch" > Sent: Tuesday, May 14, 2013 5:39:19 AM > Subject: Re: [Engine-devel] [ANN] New development environment for ovirt-engine > > Alon, > I have FC17, and followed the steps at the wiki , i defined the ovirt nightly > repo > > [ovirt-nightly] > name=ovirt-nightly > baseurl=http://resources.ovirt.org/releases/nightly/rpm/Fedora/17/ > enabled=1 > gpgcheck=0 > priority=1 > protect=1 > > And performed yum install according to your guidelines. > It fails to find python-m2crypto Has nothing to do with ovirt :) Try m2crypto please. > > Can you please advise on the matter? > > Many thanks, > Yair > > > > ----- Original Message ----- > > From: "Eli Mesika"