On Mon, Sep 13, 2021 at 1:08 AM Gianluca Cecchi
<gianluca.cecchi(a)gmail.com> wrote:
On Sun, Sep 12, 2021 at 10:35 AM Yedidyah Bar David <didi(a)redhat.com> wrote:
>
>
> >>
> >> It was the step I suspect there was a regression for in 4.4.8 (comparing
with 4.4.7) when updating the first hosted-engine host during the upgrade flow and
retaining its hostname details.
>
> What's the regression?
I thought that in 4.4.7 there was not this problem if you use the same hostname but with
different (real or virtual) hw as the first host during your SHE upgrade from 4.3.10 to
4.4.7.
But probably it was not so and I didn't remember correctly....
>
> >> I'm going to test with latest async 2 4.4.8 and see if it solves the
problem. Otherwise I'm going to open a bugzilla sending the logs.
>
> Can you clarify what the bug is?
The automatic mgmt of host adding during the "hosted-engine --deploy
--restore-from-file=backup.bck" step if you have different hw and you want to recycle
your previous hostname.
In the past it often happened to me to combine upgrades of systems with hw refreshing
(with standalone hosts, rhcs clusters, also ovirt/rhv from 4.2 to 4.3 if I remember
correctly, ecc.) where you re-use an existing hostname on new hardware
More than a bug it would be an RFE perhaps....
OK, now filed it:
https://bugzilla.redhat.com/show_bug.cgi?id=2003515
>
> > As novirt2 and novirt1 (in 4.3) are VMS running on the same hypervisor I see
that in their hw details I have the same serial number and the usual random uuid
>
> Same serial number? Doesn't sound right. Any idea why it's the same?
My env is nested oVirt and my hypervisors are Vms.
I notice that in oVirt if you clone a VM it changes the uuid in the clone but it retains
the serial number...
OK, understood. Unrelated to current issue, but it might be worth to
optionally change this as well during a clone.
> > Unfortunately I cannot try at the moment the scenario where I deploy the new
novirt2 on the same virtual hw, because in the first 4.3 install I configured the OS disk
as 50Gb and with this size 4.4.8 complains about insufficient space. And having the
snapshot active in preview I cannot resize the disk
> > Eventually I can reinstall 4.3 on an 80Gb disk and try the same, maintaining the
same hw ... but this would imply that in general I cannot upgrade using different hw and
reusing the same hostnames.... correct?
>
> Yes. Either reuse a host and keep its name (what we recommend in the
> upgrade guide) or use a new host and a new name (backup/restore
> guide).
>
> The condition to remove the host prior to adding it is based on
> unique_id_out, which is set in (see also bz 1642440, 1654697):
>
> - name: Get host unique id
> shell: |
> if [ -e /etc/vdsm/vdsm.id ];
> then cat /etc/vdsm/vdsm.id;
> elif [ -e /proc/device-tree/system-id ];
> then cat /proc/device-tree/system-id; #ppc64le
> else dmidecode -s system-uuid;
> fi;
> environment: "{{ he_cmd_lang }}"
> changed_when: true
> register: unique_id_out
>
> So if you want to "make this work", you can set the uuid (either in
> your (virtual) BIOS, to affect the /proc value, or in
> /etc/vdsm/vdsm.id) to match the one of the old host (the one you want
> to reuse its name). I didn't test this myself, though.
>
I confirm that I reverted the snapshots of the 2 VMs used as hypervisors taking them
again at initial 4.3 status and remade all the steps, but right after the install of the
OS of 4.4.8 oVirt node I created /etc/vdsm/vdsm.id inside novirt2 with the old 4.3 value
(the file was not there at that moment) and then all the flow went as expected and I was
then able to reach the final 4.4.8 async 2 env with both hosts at 4.4.8, cluster and DC
updated to 4.6 compatibility level and no downtime for the VMs inside the env, because I
was able to execute live migration after upgrading the first host
Thanks for the report!
Best regards,
--
Didi