Thank you for all the reviews!
I'm working on some of the changes you suggested to improve the code. Plan
to push an updated series tomorrow.
On Tue, Nov 13, 2018 at 4:30 PM Germano Veit Michel <germano(a)redhat.com>
wrote:
On Mon, Nov 12, 2018 at 3:20 PM Germano Veit Michel <germano(a)redhat.com>
wrote:
> Hi,
>
> I've finished developing the tools we would like to have to fix snapshot
> related issues.
>
> Piotr did an initial review in one of the patches, but there is still a
> lot of reviewing that needs to be done. Could you please review this
> series?
https://gerrit.ovirt.org/#/q/topic:snapshot-tools
> I'm specially concerned about 94366, I'm not sure that is the right way
> do to that.
>
> Note: there are some random CI failures on F28 that we are investigating
> in a different mail thread* , so with every rebase 1 to 3 patches fail.
> * [VDSM] TestMount.testSymlinkMount failing
>
> Thanks
>
>
>
> On Mon, Oct 8, 2018 at 4:43 PM Germano Veit Michel <germano(a)redhat.com>
> wrote:
>
>> All passing CI and rebased to latest master.
>> Reviews are welcome.
>>
>> On Tue, Sep 25, 2018 at 8:01 AM Germano Veit Michel <germano(a)redhat.com>
>> wrote:
>>
>>>
>>>
>>> On Tue, Sep 25, 2018 at 7:59 AM Dan Kenigsberg <danken(a)redhat.com>
>>> wrote:
>>>
>>>>
>>>>
>>>> On Tue, Sep 25, 2018 at 12:49 AM Germano Veit Michel <
>>>> germano(a)redhat.com> wrote:
>>>>
>>>>>
>>>>>
>>>>> On Mon, Sep 24, 2018 at 11:56 PM Dan Kenigsberg
<danken(a)redhat.com>
>>>>> wrote:
>>>>>
>>>>>> On Mon, Sep 24, 2018 at 9:24 AM Germano Veit Michel <
>>>>>> germano(a)redhat.com> wrote:
>>>>>> >
>>>>>> > Hi,
>>>>>> >
>>>>>> > I'm working an a tool (vdsm-tool update-volume) to make
modifying
>>>>>> SD metadata easier and more importantly, safer. This is very
useful to
>>>>>> recover from failed LSMs or snapshot issues.
>>>>>> >
>>>>>> > The plan is to use the VDSM API (modified by some of these
>>>>>> patches) and add a tool (vdsm-tool) that talks to the API and
modifies the
>>>>>> volumes metadata as required by the user. Currently this is done
manually,
>>>>>> i.e.: looking at MD_XXX tags, doing dd, sed and then dd back to
the
>>>>>> storage. Any wrong argument (like a skip in place of a seek) can
ruin the
>>>>>> entire metadata, so this tool can be quite handy.
>>>>>> >
>>>>>> > The code is not necessarily 100% finished yet, but I've
been
>>>>>> testing this for some time and it seems ok from a functional
point of view.
>>>>>> I'm just not sure everything I did (especially inside VDSM,
example 94366)
>>>>>> is correct. Your comments on what can/should be improved are very
welcome
>>>>>> at this point. Please see this series and help reviewing it.
>>>>>> >
>>>>>> >
>>>>>>
https://gerrit.ovirt.org/#/q/topic:update-volume+(status:open+OR+status:m...
>>>>>> >
https://gerrit.ovirt.org/#/c/93258/
>>>>>>
>>>>>> I am not a maintainer of vdsm.storage, but I can say that I'm
missing
>>>>>> a high-level description of the change that you are suggesting,
and
>>>>>> its motivation. When do you see a need to manually change the
>>>>>> metadata
>>>>>> of volumes? Shouldn't we fix the bug that causes this need?
>>>>>>
>>>>>
>>>>> Ohh, sorry. I thought it was obvious. This is for snapshot related
>>>>> issues. These bugs have been present for a while, they are fixed but
new
>>>>> ones come up. In the end downstream support is constantly manually
>>>>> repairing chains. This is why this tool is needed. Both to make
those
>>>>> changes safer and to save time.
>>>>>
>>>>>
>>>>>> I personally have a deep resentment to a "force" flags
- not just
>>>>>> here, but everywhere. It is never clear what is being forced.
Some
>>>>>> things cannot or should not be forced.
>>>>>>
>>>>>
>>>>> Nir already suggested to remove the force flag. I don't mind,
the
>>>>> idea was just to keep the old behavior the same if the force flag is
not
>>>>> used.
>>>>>
>>>>>
>>>>>>
>>>>>> One last note: a CI+1 gives a positive psychological vibe to
your
>>>>>> reviewer.
>>>>>>
>>>>> Yes, I know. Most of them have CI+1. They keep randomly failing on
>>>>> FC28 (but EL7 succeeds) on every push.
>>>>>
>>>>
>>>> We used to have a regression in iproute on Fedora few weeks ago. a
>>>> rebase on master may help. It is your responsibility to find why fc28
>>>> fails, as we cannot merge a patch that does not pass there.
>>>>
>>>
>>> Thanks, I'll rebase them. Hopefully thats it.
>>>
>>>
>>>>
>>>> And there is just one that is constantly failing, I even sent an email
>>>>> to this list ("Jenkins help") to try to better understand
why, but
>>>>> no one replied yet. It seems to be complaining about an object not
having a
>>>>> member, but the member is created at runtime (based on api schema).
>>>>>
>>>>
Today was my lucky day and CI didn't fail with those random errors on
umount. So all patches passed, please review:
https://gerrit.ovirt.org/#/q/topic:snapshot-tools