Hi Yedidyah,

 

Thank you for such honest answer.

 

Debugging gives me better insight into how platform operates, and the documentation rarely dives this deep. I did learn a lot but my intent was not to debug on my own.

 

I sure did re-install OS of Host2  and not only once (thanks to Foreman) it’s easier and faster than expected.

I also got help from my neighbor Strahil, who is always happy to help, but I could not get to the bottom of what is wrong, in order to see if its fixable or not.

 

2. It's very hard to help you by only guessing around. If you have a
concrete issue, such as "I add a host with 'Deploy Hosted Engine' and
this fails", then please provide all relevant logs.

>>  the list of problems I reported came due to power outage, so it was hard to know where to start.

 

>> I will open new issue with focus on problem with adding new HA Host and we take it from there.

 

 

3. If you still want to continue debugging by yourself, fine - the
code is open, and at least I personally try quite hard to make it
easy, in the relatively small parts of the code I touched, to search
around it even without having a complete picture of its structure,
mainly by making texts inside logs be "unique enough" so that you can
easily find them in the code.
>> I am proud of oVirt / I am running two production platforms, for almost three years, upgraded several times, in two regions with 1000+ tenants at any moment

>> but each time I have an issue, it takes me a while to get used to where to look and how to search

 

Thank you both Strahil and Yedidyah .

 

-----

kind regards/met vriendelijke groeten

 

Marko Vrgotic
Sr. System Engineer @ System Administration


ActiveVideo

o: +31 (35) 6774131

m: +31 (65) 5734174

e: m.vrgotic@activevideo.com
w: www.activevideo.com

 

ActiveVideo Networks BV. Mediacentrum 3745 Joop van den Endeplein 1.1217 WJ Hilversum, The Netherlands. The information contained in this message may be legally privileged and confidential. It is intended to be read only by the individual or entity to whom it is addressed or by their designee. If the reader of this message is not the intended recipient, you are on notice that any distribution of this message, in any form, is strictly prohibited.  If you have received this message in error, please immediately notify the sender and/or ActiveVideo Networks, LLC by telephone at +1 408.931.9200 and delete or destroy any copy of this message.

 

 

 

From: Yedidyah Bar David <didi@redhat.com>
Date: Wednesday, 5 May 2021 at 15:18
To: Marko Vrgotic <M.Vrgotic@activevideo.com>
Cc: Strahil Nikolov <hunter86_bg@yahoo.com>, users@ovirt.org <users@ovirt.org>
Subject: Re: [ovirt-users] Re: Unable to migrate Engine to another HE Host

***CAUTION: This email originated from outside of the organization. Do not click links or open attachments unless you recognize the sender!!!***

On Wed, May 5, 2021 at 3:18 PM Marko Vrgotic <M.Vrgotic@activevideo.com> wrote:
>
> Status Update:
>
>
>
> During migration between Host1 and Host3, links are being created and I see no Errors as before in the agent/broker logs.
>
>
>
> Having all links in place, and having waited 24hours, for potential Engine updates, I tried to deploy HE on Host2:
>
> Part1:
>
> From oVirt UI – HostedEngine Deploy
> Imeddiately on Host2 started seeing messages like “ Is the HostedEngine deployed ?”
> hosted-engine.conf got populated only with host_id and ca_path
> it failed
>
> Part2:
>
> I noticed that during deployment no links in /var/rund/vdsm/storage/<hosted_storage_id> were created
> Copied hosted-engine.conf from Host1 to Host2
> Replace the host_id with correct value
> Reran the deployment
> Noticed that /var/rund/vdsm/storage/<hosted_storage_id> two links got created, one of them being the link to metadata_image
> Host1 and Host3 hosted-engine –vm-status was showing Host2 but with status unknown/stale data
> Deployment failed
>
> Part3:
>
> Since link to conf_image was not created in first phase, I added it manually
> Populated the hosted-engine.conf
> Reran the deployment
> Same result
> Hosted-engine.conf on Host2 would end up with only host_id and ca_path values and deployment would fail
>
>
>
> At this point, I cleaned up all hosted-engie remains from Host2 using ovirt-hosted-engine-cleanup and removed metadata of Host2 from host1 and Host3
>
>
>
> I am out of ideas. It seems that Host1 and Host3 are happily operating, but I am unable to add any other hosts to HE pool.
>
>
>
> Please assist if you have any ideas.

Look, Marko - I admit it seems to me like you simply enjoy debugging
this yourself...

1. Did you try to reinstall the OS on host2? If not, is there any
reason not to? Other than the (legitimate!) wish to "understand what's
broken and fix just that"? Would reinstallation take a lot of
time/work? You can also do a full backup beforehand and then compare
later, to see what the differences were.

2. It's very hard to help you by only guessing around. If you have a
concrete issue, such as "I add a host with 'Deploy Hosted Engine' and
this fails", then please provide all relevant logs.

3. If you still want to continue debugging by yourself, fine - the
code is open, and at least I personally try quite hard to make it
easy, in the relatively small parts of the code I touched, to search
around it even without having a complete picture of its structure,
mainly by making texts inside logs be "unique enough" so that you can
easily find them in the code.

>
> I am open to executing restore – but I would much rather like to at least discover where or what is the problem, before moving to planning restore.

Good luck and best regards,

--
Didi