Hi !
Today my hosts (engine + all nodes) certificates expired and I re-run
engine-setup to renew certificates.
Then I did for each node host:
Edit host -> Advanced parameters -> Fetch SSH public key (PEM)
in order to update certificates on nodes, everything was finished just fine.
Unfortunately, one of the most crucial nodes (node14) still shows this
error:
VDSM node14 command Get Host Capabilities failed: PKIX path validation
failed: java.security.cert.CertPathValidatorException: validity check failed
Restarted vdsms and vdsm-network, still same, node is marked as
non-responsive, and all VM with "?" sign (unknown status).
However, node14 pings without any problem, its storage domain shown in
green (OK), and all VMs are running fine.
Service vdsm-network status is OK, vdsmd is NOT:
Aug 08 22:07:27 node14.***.lv vdsm[1264164]: ERROR ssl handshake: socket
error, address: ::ffff:192.168.0.4
This node is running our accounting and stock control system, its
storage domain holds VM disk of that software. If its nonoperational
after restart, its a BIG trouble, I will not be able to migrate VM disk
anywhere. Restoring accounting DB from daily backup is a lengthy process
for 2 - 3 hours.
Please advice what to do next.
Thanks in advance.