Hi,
I've pushed another set with your suggestions. Please continue reviewing.
Note: 2 of the patches failed CI, it is some random error it seems, not the
ones Nir fixed. I'll take a better look tomorrow and deal with those on
separate email threads.
Thanks a lot.
On Wed, Nov 28, 2018 at 4:15 PM Germano Veit Michel <germano(a)redhat.com>
wrote:
Thank you for all the reviews!
I'm working on some of the changes you suggested to improve the code. Plan
to push an updated series tomorrow.
On Tue, Nov 13, 2018 at 4:30 PM Germano Veit Michel <germano(a)redhat.com>
wrote:
>
>
> On Mon, Nov 12, 2018 at 3:20 PM Germano Veit Michel <germano(a)redhat.com>
> wrote:
>
>> Hi,
>>
>> I've finished developing the tools we would like to have to fix snapshot
>> related issues.
>>
>> Piotr did an initial review in one of the patches, but there is still a
>> lot of reviewing that needs to be done. Could you please review this
>> series?
https://gerrit.ovirt.org/#/q/topic:snapshot-tools
>> I'm specially concerned about 94366, I'm not sure that is the right way
>> do to that.
>>
>> Note: there are some random CI failures on F28 that we are investigating
>> in a different mail thread* , so with every rebase 1 to 3 patches fail.
>> * [VDSM] TestMount.testSymlinkMount failing
>>
>> Thanks
>>
>>
>>
>> On Mon, Oct 8, 2018 at 4:43 PM Germano Veit Michel <germano(a)redhat.com>
>> wrote:
>>
>>> All passing CI and rebased to latest master.
>>> Reviews are welcome.
>>>
>>> On Tue, Sep 25, 2018 at 8:01 AM Germano Veit Michel
<germano(a)redhat.com>
>>> wrote:
>>>
>>>>
>>>>
>>>> On Tue, Sep 25, 2018 at 7:59 AM Dan Kenigsberg <danken(a)redhat.com>
>>>> wrote:
>>>>
>>>>>
>>>>>
>>>>> On Tue, Sep 25, 2018 at 12:49 AM Germano Veit Michel <
>>>>> germano(a)redhat.com> wrote:
>>>>>
>>>>>>
>>>>>>
>>>>>> On Mon, Sep 24, 2018 at 11:56 PM Dan Kenigsberg
<danken(a)redhat.com>
>>>>>> wrote:
>>>>>>
>>>>>>> On Mon, Sep 24, 2018 at 9:24 AM Germano Veit Michel <
>>>>>>> germano(a)redhat.com> wrote:
>>>>>>> >
>>>>>>> > Hi,
>>>>>>> >
>>>>>>> > I'm working an a tool (vdsm-tool update-volume) to
make modifying
>>>>>>> SD metadata easier and more importantly, safer. This is very
useful to
>>>>>>> recover from failed LSMs or snapshot issues.
>>>>>>> >
>>>>>>> > The plan is to use the VDSM API (modified by some of
these
>>>>>>> patches) and add a tool (vdsm-tool) that talks to the API and
modifies the
>>>>>>> volumes metadata as required by the user. Currently this is
done manually,
>>>>>>> i.e.: looking at MD_XXX tags, doing dd, sed and then dd back
to the
>>>>>>> storage. Any wrong argument (like a skip in place of a seek)
can ruin the
>>>>>>> entire metadata, so this tool can be quite handy.
>>>>>>> >
>>>>>>> > The code is not necessarily 100% finished yet, but
I've been
>>>>>>> testing this for some time and it seems ok from a functional
point of view.
>>>>>>> I'm just not sure everything I did (especially inside
VDSM, example 94366)
>>>>>>> is correct. Your comments on what can/should be improved are
very welcome
>>>>>>> at this point. Please see this series and help reviewing it.
>>>>>>> >
>>>>>>> >
>>>>>>>
https://gerrit.ovirt.org/#/q/topic:update-volume+(status:open+OR+status:m...
>>>>>>> >
https://gerrit.ovirt.org/#/c/93258/
>>>>>>>
>>>>>>> I am not a maintainer of vdsm.storage, but I can say that
I'm
>>>>>>> missing
>>>>>>> a high-level description of the change that you are
suggesting, and
>>>>>>> its motivation. When do you see a need to manually change
the
>>>>>>> metadata
>>>>>>> of volumes? Shouldn't we fix the bug that causes this
need?
>>>>>>>
>>>>>>
>>>>>> Ohh, sorry. I thought it was obvious. This is for snapshot
related
>>>>>> issues. These bugs have been present for a while, they are fixed
but new
>>>>>> ones come up. In the end downstream support is constantly
manually
>>>>>> repairing chains. This is why this tool is needed. Both to make
those
>>>>>> changes safer and to save time.
>>>>>>
>>>>>>
>>>>>>> I personally have a deep resentment to a "force"
flags - not just
>>>>>>> here, but everywhere. It is never clear what is being forced.
Some
>>>>>>> things cannot or should not be forced.
>>>>>>>
>>>>>>
>>>>>> Nir already suggested to remove the force flag. I don't mind,
the
>>>>>> idea was just to keep the old behavior the same if the force flag
is not
>>>>>> used.
>>>>>>
>>>>>>
>>>>>>>
>>>>>>> One last note: a CI+1 gives a positive psychological vibe to
your
>>>>>>> reviewer.
>>>>>>>
>>>>>> Yes, I know. Most of them have CI+1. They keep randomly failing
on
>>>>>> FC28 (but EL7 succeeds) on every push.
>>>>>>
>>>>>
>>>>> We used to have a regression in iproute on Fedora few weeks ago. a
>>>>> rebase on master may help. It is your responsibility to find why
fc28
>>>>> fails, as we cannot merge a patch that does not pass there.
>>>>>
>>>>
>>>> Thanks, I'll rebase them. Hopefully thats it.
>>>>
>>>>
>>>>>
>>>>> And there is just one that is constantly failing, I even sent an
>>>>>> email to this list ("Jenkins help") to try to better
understand
>>>>>> why, but no one replied yet. It seems to be complaining about an
object not
>>>>>> having a member, but the member is created at runtime (based on
api schema).
>>>>>>
>>>>>
> Today was my lucky day and CI didn't fail with those random errors on
> umount. So all patches passed, please review:
>
https://gerrit.ovirt.org/#/q/topic:snapshot-tools
>