[Users] vdsm live migration errors in latest master

Federico Simoncelli fsimonce at redhat.com
Thu Sep 26 12:14:06 EDT 2013


----- Original Message -----
> From: "Dan Kenigsberg" <danken at redhat.com>
> To: "Federico Simoncelli" <fsimonce at redhat.com>
> Cc: "Dead Horse" <deadhorseconsulting at gmail.com>, "users" <users at ovirt.org>, vdsm-devel at fedorahosted.org,
> abaron at redhat.com
> Sent: Thursday, September 26, 2013 2:09:15 PM
> Subject: Re: [Users] vdsm live migration errors in latest master
> 
> On Thu, Sep 26, 2013 at 05:35:46AM -0400, Federico Simoncelli wrote:
> > ----- Original Message -----
> > > From: "Dan Kenigsberg" <danken at redhat.com>
> > > To: "Federico Simoncelli" <fsimonce at redhat.com>
> > > Cc: "Dead Horse" <deadhorseconsulting at gmail.com>, "users"
> > > <users at ovirt.org>, vdsm-devel at fedorahosted.org,
> > > abaron at redhat.com
> > > Sent: Thursday, September 26, 2013 1:38:15 AM
> > > Subject: Re: [Users] vdsm live migration errors in latest master
> > > 
> > > On Tue, Sep 24, 2013 at 12:04:14PM -0400, Federico Simoncelli wrote:
> > > > ----- Original Message -----
> > > > > From: "Dan Kenigsberg" <danken at redhat.com>
> > > > > To: "Dead Horse" <deadhorseconsulting at gmail.com>
> > > > > Cc: "<users at ovirt.org>" <users at ovirt.org>,
> > > > > vdsm-devel at fedorahosted.org,
> > > > > fsimonce at redhat.com, abaron at redhat.com
> > > > > Sent: Tuesday, September 24, 2013 11:44:48 AM
> > > > > Subject: Re: [Users] vdsm live migration errors in latest master
> > > > > 
> > > > > On Mon, Sep 23, 2013 at 04:05:34PM -0500, Dead Horse wrote:
> > > > > > Seeing failed live migrations and these errors in the vdsm logs
> > > > > > with
> > > > > > latest
> > > > > > VDSM/Engine master.
> > > > > > Hosts are EL6.4
> > > > > 
> > > > > Thanks for posting this report.
> > > > > 
> > > > > The log is from the source of migration, right?
> > > > > Could you trace the history of the hosts of this VM? Could it be that
> > > > > it
> > > > > was started on an older version of vdsm (say ovirt-3.3.0) and then
> > > > > (due
> > > > > to migration or vdsm upgrade) got into a host with a much newer vdsm?
> > > > > 
> > > > > Would you share the vmCreate (or vmMigrationCreate) line for this Vm
> > > > > in
> > > > > your log? I smells like an unintended regression of
> > > > >     http://gerrit.ovirt.org/17714
> > > > >     vm: extend shared property to support locking
> > > > > 
> > > > > solving it may not be trivial, as we should not call
> > > > > _normalizeDriveSharedAttribute() automatically on migration
> > > > > destination,
> > > > > as it may well still be apart of a 3.3 clusterLevel.
> > > > > 
> > > > > Also, migration from vdsm with extended shared property, to an ovirt
> > > > > 3.3
> > > > > vdsm is going to explode (in a different way), since the destination
> > > > > does not expect the extended values.
> > > > > 
> > > > > Federico, do we have a choice but to revert that patch, and use
> > > > > something like "shared3" property instead?
> > > > 
> > > > I filed a bug at:
> > > > 
> > > > https://bugzilla.redhat.com/show_bug.cgi?id=1011608
> > > > 
> > > > A possible fix could be:
> > > > 
> > > > http://gerrit.ovirt.org/#/c/19509
> > > 
> > > Beyond this, we must make sure that on Engine side, the extended shared
> > > values would be used only for clusterLevel 3.4 and above.
> > > 
> > > Are the extended shared values already used by Engine?
> > 
> > Yes. That's the idea. Actually to be fair, the second case you mentioned
> > (migrating from extended shared property to old vdsm) it wouldn't have been
> > possible I suppose (the issue here is that Dead Horse has one or more
> > hosts running on master instead of 3.3). The extended shared property would
> > have appeared only in 3.4 and to allow the migration you would have had to
> > upgrade all the nodes.
> > 
> > But anyway since we were also talking about a new 3.3.1 barnch I just went
> > ahead and covered all cases.
> 
> I do not see how the 3.3.1 branch is relevant to the discussion, as its
> Vdsm is NOT going to support clusterLevel 3.4.

That is what I was referring to.

If 3.3.1 was 3.3.0 + backported patches then we just wouldn't backport the
extended shared attributes patch and that's it. But from what I understood
3.3.1 will be rebased on master (where instead we have the extended shared
attributes) and that is why we have to cover both migration direction cases
(instead of just the simple getattr one).

> Pardon my slowliness, but would you confirm that this feature is to be
> used only on clusterLevel 3.4 and above? If so, I'm +2ing your patch.

Yes, the extended attributes will be used in the hosted engine and cluster
level 3.4.
But what the engine does is not relevant to +2ing correct vdsm patches.

-- 
Federico


More information about the Users mailing list