This is an OpenPGP/MIME signed message (RFC 4880 and 3156)
--3N1PKr85t4TLJVS6xjfoXcr8C5fQLqaNA
Content-Type: multipart/mixed; boundary="JqTK0cT4Us9RSbRu37EqjLa65fAnOElVe"
From: Richard Neuboeck <hawk(a)tbi.univie.ac.at>
To: users(a)ovirt.org
Message-ID: <570DF78B.9020004(a)tbi.univie.ac.at>
Subject: Re: [ovirt-users] HA agent fails to start
References: <570B402F.4000105(a)tbi.univie.ac.at>
<CAN8-ONqE+b=sschJc=fzd1ExWP0tcs6JhZTb6+N19zDHJ7pyUQ(a)mail.gmail.com>
<570CEB60.9030708(a)tbi.univie.ac.at>
<CAN8-ONoRWH2XivN1HH-UMe74UUzvdgEn92oq8ngdSJmkeyXOvA(a)mail.gmail.com>
In-Reply-To: <CAN8-ONoRWH2XivN1HH-UMe74UUzvdgEn92oq8ngdSJmkeyXOvA(a)mail.gmail.com>
--JqTK0cT4Us9RSbRu37EqjLa65fAnOElVe
Content-Type: text/plain; charset=utf-8
Content-Transfer-Encoding: quoted-printable
The answers file shows the setup time of both machines.
On both machines hosted-engine.conf got rotated right before I wrote
this mail. Is it possible that I managed to interrupt the rotation with
the reboot so the backup was accurate but the update not yet written to
hosted-engine.conf?
[root@cube-two ~]# ls -l /etc/ovirt-hosted-engine
total 16
-rw-r--r--. 1 root root 3252 Apr 8 10:35 answers.conf
-rw-r--r--. 1 root root 1021 Apr 13 09:31 hosted-engine.conf
-rw-r--r--. 1 root root 1021 Apr 13 09:30 hosted-engine.conf~
[root@cube-three ~]# ls -l /etc/ovirt-hosted-engine
total 16
-rw-r--r--. 1 root root 3233 Apr 11 08:02 answers.conf
-rw-r--r--. 1 root root 1002 Apr 13 09:31 hosted-engine.conf
-rw-r--r--. 1 root root 1002 Apr 13 09:31 hosted-engine.conf~
On 12.04.16 16:01, Simone Tiraboschi wrote:
Everything seams fine here,
/etc/ovirt-hosted-engine/hosted-engine.conf seams to be correctly
created with the right name.
Can you please check the latest modification time of your
/etc/ovirt-hosted-engine/hosted-engine.conf~ and compare it with the
setup time?
=20
On Tue, Apr 12, 2016 at 2:34 PM, Richard Neuboeck <hawk(a)tbi.univie.ac.a=
t>
wrote:
> On 04/12/2016 11:32 AM, Simone Tiraboschi wrote:
>> On Mon, Apr 11, 2016 at 8:11 AM, Richard Neuboeck <hawk(a)tbi.univie.ac=
=2Eat> wrote:
>>> Hi oVirt Group,
>>>
>>> in my attempts to get all aspects of oVirt 3.6 up and running I
>>> stumbled upon something I'm not sure how to fix:
>>>
>>> Initially I installed a hosted engine setup. After that I added
>>> another HA host (with hosted-engine --deploy). The host was
>>> registered in the Engine correctly and HA agent came up as expected.=
>>>
>>> However if I reboot the second host (through the Engine UI or
>>> manually) HA agent fails to start. The reason seems to be that
>>> /etc/ovirt-hosted-engine/hosted-engine.conf is empty. The backup
>>> file ending with ~ exists though.
>>
>> Can you please attach hosted-engine-setup logs from your additional h=
osts?
>> AFAIK our code will never take a ~ ending backup of that
file.
>
> ovirt-hosted-engine-setup logs from both additional hosts are
> attached to this mail.
>
>>
>>> Here are the log messages from the journal:
>>> Apr 11 07:29:39 cube-two.tbi.univie.ac.at systemd[1]: Starting oVirt=
>>> Hosted Engine High Availability Monitoring Agent...
>>> Apr 11 07:29:39 cube-two.tbi.univie.ac.at ovirt-ha-agent[3747]:
>>> INFO:ovirt_hosted_engine_ha.agent.agent.Agent:ovirt-hosted-engine-ha=
>>> agent 1.3.5.3-0.0.master started
>>> Apr 11 07:29:39 cube-two.tbi.univie.ac.at ovirt-ha-agent[3747]:
>>> INFO:ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine:Found
>>> certificate common name: cube-two.tbi.univie.ac.at
>>> Apr 11 07:29:39 cube-two.tbi.univie.ac.at ovirt-ha-agent[3747]:
>>> ovirt-ha-agent
>>> ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine ERROR Hosted=
>>> Engine is not configured. Shutting down.
>>> Apr 11 07:29:39 cube-two.tbi.univie.ac.at ovirt-ha-agent[3747]:
>>> ERROR:ovirt_hosted_engine_ha.agent.hosted_engine.HostedEngine:Hosted=
>>> Engine is not configured. Shutting down.
>>> Apr 11 07:29:39 cube-two.tbi.univie.ac.at ovirt-ha-agent[3747]:
>>> INFO:ovirt_hosted_engine_ha.agent.agent.Agent:Agent shutting down
>>> Apr 11 07:29:39 cube-two.tbi.univie.ac.at systemd[1]:
>>> ovirt-ha-agent.service: main process exited, code=3Dexited, status=3D=
255/n/a
>>>
>>> If I restore the configuration from the backup file and manually
>>> restart the HA agent it's working properly.
>>>
>>> For testing purposes I added a third HA host which turn out to
>>> behave exactly the same.
>>>
>>> Any help would be appreciated!
>>> Thanks
>>> Cheers
>>> Richard
>>>
>>> --
>>> /dev/null
>>>
>>>
>>> _______________________________________________
>>> Users mailing list
>>> Users(a)ovirt.org
>>>
http://lists.ovirt.org/mailman/listinfo/users
>>>
>
>
> --
> /dev/null
--JqTK0cT4Us9RSbRu37EqjLa65fAnOElVe--
--3N1PKr85t4TLJVS6xjfoXcr8C5fQLqaNA
Content-Type: application/pgp-signature; name="signature.asc"
Content-Description: OpenPGP digital signature
Content-Disposition: attachment; filename="signature.asc"
-----BEGIN PGP SIGNATURE-----
Version: GnuPG/MacGPG2 v2
iQIcBAEBCgAGBQJXDfeLAAoJEA7XCanqEVqISdUP/1RU0Y2ffG14WOtijdaVqOUJ
SUsjid67NsAc/efLxHG/hECKgVuh4uxJ6lhx0F/WiT7KiWmwi4aY/1Gbn8CtWfT3
rJqf5DDM+2892alnPdNST0R6cgKCTX+hWXHeZqjfe6DRSuUKqhsQSli0Hw70P5uy
bZk5zz0AltREkKRnOERh1NmFdjmDPTK7eQ06utYAPUd9fWXsFKht5mi8VCac5zir
zwRUquH2mF+1B7ofS6YAzpkTX4Xd10DlL3+WLOrcWDBI6S528JlEIsvwJ28MOrrm
IhscFrsuHPvRjDCOwckdJYalkzmHF6JhYbwVaWRnatfxh/NM73K9msI1uMv4F47F
Q0Zwb3SNCgfaLI6e/iej9fcdK4h0tW9pb040pR7o8BkH3Q77S8UsULLk4uVsf8C3
5sRkmXgmYCssZ/6DKr/YVtLW77X9fSX+GCc/53GcyWWHRL+j4QZ2EptVpmL6S6tq
qAAaIdKTRIFJWY9/bhG05LDfsvrQyZZXplEtLJqewExp848Fz+iug9l/NpfDbzFi
1ehGGXnj80W851SYBLOR12pIJ+rGrgH4PbiMhwmm+Ba2z3wu2BPf9GCKDAMBGUak
AMfn5cxnQG9+cEnD/i79BU/+xPqbtwU0Ocy19ta0ipLZkZxDBSpQe32ZQA8f0pPO
EPtMgGYV15UIPQdQ8k82
=sH5S
-----END PGP SIGNATURE-----
--3N1PKr85t4TLJVS6xjfoXcr8C5fQLqaNA--