(sorry: resending as I wasn’t part of the list, yet)

 

hi,

this is my first post so hallo all and thank you for reading.

I have an issue with my production Ovirt environment (3.5.1.1-1.el6).

 

My system consists of several datancers.

2 of them are connected to an iSCSI SAN and they were working fine.

Until the moment I had the bad idea of deleting a SAN volume from the SAN manager before deleting the associated storage on Ovirt. From that moment, the DC where this storage was mounted became not responsive: it cannot attach the master storage (or any other).

I tried to

1) manually destroy the offending storage (select -> destroy) but still cannot recover the situation.

2) right click on master storage and activate it

3) re-initialize the datacenter using a NFS storage from the working sister DC.

 

All Hosts are still running even though their status is "unknown".

All VM are still running even though their status is "not responding".

 

I half resolved the issue by manually restarting the host where the datastore was originally mounted. This cleared the orphaned multipath.

However, the SPM does not come up still.

This is an extract of the log

2015-04-16 03:51:48,069 WARN  [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (DefaultQuartzScheduler_Worker-14) [61a44b19] could not stop spm of pool 00000002-0002-0002-0002-00000000009c on vds 89254f23-8748-402a-afc9-08438dca0975 - reason: org.ovirt.engine.core.vdsbroker.vdsbroker.VDSNetworkException: VDSGenericException: VDSNetworkException: Message timeout which can be caused by communication issues

2015-04-16 03:51:48,072 INFO  [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (DefaultQuartzScheduler_Worker-14) [61a44b19] FINISH, SpmStopVDSCommand, log id: 4354cf46

2015-04-16 03:51:48,072 WARN  [org.ovirt.engine.core.vdsbroker.irsbroker.IrsProxyData] (DefaultQuartzScheduler_Worker-14) [61a44b19] spm stop on spm failed, stopping spm selection!

2015-04-16 03:51:58,223 INFO  [org.ovirt.engine.core.vdsbroker.irsbroker.IrsProxyData] (DefaultQuartzScheduler_Worker-4) [4ca2d938] hostFromVds::selectedVds - Brachetto, spmStatus Free, storage pool IRDC-INTEL

2015-04-16 03:51:58,225 ERROR [org.ovirt.engine.core.vdsbroker.irsbroker.IrsProxyData] (DefaultQuartzScheduler_Worker-4) [4ca2d938] SPM Init: could not find reported vds or not up - pool:IRDC-INTEL vds_spm_id: 3

2015-04-16 03:51:58,239 INFO  [org.ovirt.engine.core.vdsbroker.irsbroker.IrsProxyData] (DefaultQuartzScheduler_Worker-4) [4ca2d938] SPM selection - vds seems as spm sovana

2015-04-16 03:51:58,252 INFO  [org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStopVDSCommand] (DefaultQuartzScheduler_Worker-4) [4ca2d938] START, SpmStopVDSCommand(HostName = sovana, HostId = 89254f23-8748-402a-afc9-08438dca0975, storagePoolId = 00000002-0002-0002-0002-00000000009c), log id: 63a17687

 

storagePoolId = 00000002-0002-0002-0002-00000000009c is (was) hertz-dstore2 which does not exists anymore on SAN adn ovirt

hostid  89254f23-8748-402a-afc9-08438dca0975 is sovana server (current SPM)

 

 

 

 

I’m thinking about

Put the hosted engine host into Maintenance

Shutdown Ovirt Manager

Rebooted SPM server

Restarted Ovirt Manager

Took hosted engine host out of Maintenance

 

 

any help or clue is highly welcomed with cheers and beers

thank you!

Andrea