On Fri, Dec 18, 2015 at 5:31 PM, Stefano Danzi <s.danzi(a)hawai.it> wrote:
I found this in vdsm.log and I think that could be the problem:
Thread-3771::ERROR::2015-12-18
16:18:58,597::brokerlink::279::ovirt_hosted_engine_ha.lib.brokerlink.BrokerLink::(_communicate)
Connection closed: Connection closed
Thread-3771::ERROR::2015-12-18 16:18:58,597::API::1847::vds::(_getHaInfo)
failed to retrieve Hosted Engine HA info
Traceback (most recent call last):
File "/usr/share/vdsm/API.py", line 1827, in _getHaInfo
stats = instance.get_all_stats()
File
"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/client/client.py",
line 103, in get_all_stats
self._configure_broker_conn(broker)
File
"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/client/client.py",
line 180, in _configure_broker_conn
dom_type=dom_type)
File
"/usr/lib/python2.7/site-packages/ovirt_hosted_engine_ha/lib/brokerlink.py",
line 176, in set_storage_domain
.format(sd_type, options, e))
RequestError: Failed to set storage domain FilesystemBackend, options
{'dom_type': 'nfs3', 'sd_uuid':
'46f55a31-f35f-465c-b3e2-df45c05e06a7'}:
Connection closed
My guess is that this is a consequence of your networking problems.
Adding Dan.
Il 17/12/2015 18.51, Stefano Danzi ha scritto:
>
> I partially solve the problem.
>
> My host machine has 2 network interfaces with a bond. The bond was
> configured with mode=4 (802.3ad) and switch was configured in the same way.
> If I remove one network cable the network become stable. With both cables
> attached the network is instable.
>
> I removed the link aggregation configuration from switch and change the
> bond in mode=2 (balance-xor). Now the network are stable.
> The strange thing is that previous configuration worked fine for one
> year... since the last upgrade.
>
> Now ha-agent don't reboot the hosted-engine anymore, but I receive two
> emails from brocker evere 2/5 minutes.
> First a mail with "ovirt-hosted-engine state transition
> StartState-ReinitializeFSM" and after "ovirt-hosted-engine state
transition
> ReinitializeFSM-EngineStarting"
>
>
> Il 17/12/2015 10.51, Stefano Danzi ha scritto:
>>
>> Hello,
>> I have one testing host (only one host) with self hosted engine and 2 VM
>> (one linux and one windows).
>>
>> After upgrade ovirt from 3.6.0 to 3.6.1 the network connection works
>> discontinuously.
>> Every 10 minutes HA agent restart hosted engine VM because result down.
>> But the machine is UP,
>> only the network stop to work for some minutes.
>> I activate global maintenace mode to prevent engine reboot. If I ssh to
>> the hosted engine sometimes
>> the connection work and sometimes no. Using VNC connection to engine I
>> see that sometime VM reach external network
>> and sometimes no.
>> If I do a tcpdump on phisical ethernet interface I don't see any packet
>> when network on vm don't work.
>>
>> Same thing happens fo others two VM.
>>
>> Before the upgrade I never had network problems.
>> _______________________________________________
>> Users mailing list
>> Users(a)ovirt.org
>>
http://lists.ovirt.org/mailman/listinfo/users
>>
>
> _______________________________________________
> Users mailing list
> Users(a)ovirt.org
>
http://lists.ovirt.org/mailman/listinfo/users
_______________________________________________
Users mailing list
Users(a)ovirt.org
http://lists.ovirt.org/mailman/listinfo/users
--
Didi