After changing the engine's MTU and ifdown eth0 && ifup eth0, the host is now able to get capabilities and resync the hosts' networks, and allows the host to be activated.

However I have a new problem.  During the course of fighting the above, I "undeployed" hosted-engine, and now it looks it seems to freeze when I try to reinstall with engine deploy... It eventually times out with a failure.


On Sun, Jan 27, 2019 at 8:59 PM Edward Berger <edwberger@gmail.com> wrote:
I have a problem host which also is the one I deployed a hyperconverged oVirt node-ng cluster from with the cockpit's hyperconverged installation wizard.

When I realized after deploying that I hadn't set the MTUs correctly for the engine-mgmt, associated vlan and eno.2 device and also for my infiniband interface ib0, I went in and tried to set them to new values 9000 and 65520 it got into some kind of hung state. 

The engine task window shows a task in "executing" and a never ending spinning widget
"Handing non responsive Host track00..."

I can tried updating the hosts /etc/sysconfig/network-scripts by hand.
I've tried every combination of the engines set host in maintenance mode , "sync networks" "refresh host capabilities" activating and rebooting, but I'm still stuck with an unresponsive host.

I had another host that also failed but it allowed me to put it into maintenance mode and then remove it from the cluster and "add new" it back and it was happy.

This one won't let me remove it because its serving the gluster volume mount point, even though I did give the mount options for the 2nd and 3rd backup volume servers.

I'd appreciate any help restoring it to proper working order.

I'm attaching the gzip'd engine log.