Today i started troubleshooting more in depth on dns requests and exactly while i was looking at tcpdump an event of EngineUp -> EngineBadHealth happened.
The last dns requests has success and gets the MX address and immediately after i get the email reporting the status change.
This is clearly an issue with name resolution, but that's not clear to me from the broker.log file. The only message about it that i get is:
Thread-16::DEBUG::2017-12-13 14:31:23,657::monitor::126::ovirt_hosted_engine_ha.broker.monitor.Monitor::(get_value) Submonitor engine-health id 139653
412040592 current value: {"reason": "failed liveliness check", "health": "bad", "vm": "up", "detail": "up"}
Thread-16::DEBUG::2017-12-13 14:31:23,657::listener::170::ovirt_hosted_engine_ha.broker.listener.ConnectionHandler::(handle) Response: success {"reaso
n": "failed liveliness check", "health": "bad", "vm": "up", "detail": "up"}
But around that messages i get no signals of error on dns queries or similar. Do i need to check on other log files?