SPM in case of Failure

This is a multi-part message in MIME format. --------------2697A26DEF0A6FEBE4ECD7F9 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 8bit Hi Our setup looks like: - 2 clusters in 2 different site connected with 10GBit LAN - Storage based on FC SAN replicated on both site and available for both site (The LUNs are available over 4 pathes, 2 from each site) My observation: In case one site goes down and this site owned SPM is it not possible to move or force SPM on the second site. On the site which is down it's possible to reset all VMs that crashed using the "Confirm Host rebooted" menu on the oVirt Host but this does not reset SPM. The only solution I found was to bring the Host which owned SPM up again to be able to move it to the other site and then reactivate the storage domains. Is this a normal behavior? Is there any way to force SPM reelection ? Thanks for your help or idea... Regards, Arsène -- *Arsène Gschwind* Fa. Sapify AG im Auftrag der Universität Basel IT Services Klingelbergstr. 70 | CH-4056 Basel | Switzerland Tel. +41 79 449 25 63 | http://its.unibas.ch <http://its.unibas.ch/> ITS-ServiceDesk: support-its@unibas.ch | +41 61 267 14 11 --------------2697A26DEF0A6FEBE4ECD7F9 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit <html> <head> <meta http-equiv="content-type" content="text/html; charset=utf-8"> </head> <body text="#000000" bgcolor="#FFFFFF"> <p>Hi <br> </p> <p>Our setup looks like:</p> <p>- 2 clusters in 2 different site connected with 10GBit LAN<br> - Storage based on FC SAN replicated on both site and available for both site (The LUNs are available over 4 pathes, 2 from each site)<br> </p> <p>My observation:</p> <p>In case one site goes down and this site owned SPM is it not possible to move or force SPM on the second site.<br> On the site which is down it's possible to reset all VMs that crashed using the "Confirm Host rebooted" menu on the oVirt Host but this does not reset SPM.<br> The only solution I found was to bring the Host which owned SPM up again to be able to move it to the other site and then reactivate the storage domains.</p> <p>Is this a normal behavior?<br> Is there any way to force SPM reelection ?</p> <p>Thanks for your help or idea...</p> <p>Regards,<br> Arsène<br> </p> <div class="moz-signature">-- <br> <p class="western" style="margin-bottom: 0in; line-height: 150%"> <font color="#000000"><font face="Tahoma, serif"> <font style="font-size: 8pt" size="1"> <b>Arsène Gschwind</b> </font> </font> <font color="#000000"> <font face="Tahoma, serif"> <font style="font-size: 8pt" size="1"> </font> </font> </font> <font face="Tahoma, serif"> <font style="font-size: 8pt" size="1"> </font> </font> <font face="Tahoma, serif"> </font> <font color="#000000"> <font face="Tahoma, serif"> <font style="font-size: 8pt" size="1"> <br> </font> </font> </font> <font color="#7f7f7f"> <font face="Tahoma, serif"> <font style="font-size: 8pt" size="1"> Fa. Sapify AG im Auftrag der Universität Basel<br> IT Services<br> Klingelbergstr. 70 | CH-4056 Basel | Switzerland<br> Tel. +41 79 449 25 63 | </font> </font> </font> <a href="http://its.unibas.ch/"> <font face="Tahoma, serif"> <font style="font-size: 8pt" size="1"> http://its.unibas.ch </font> </font> </a><br> <font color="#7f7f7f"> <font face="Tahoma, serif"> <font style="font-size: 8pt" size="1"> ITS-ServiceDesk: <a class="moz-txt-link-abbreviated" href="mailto:support-its@unibas.ch">support-its@unibas.ch</a> | +41 61 267 14 11 </font> </font> </font> </font></p> <font color="#000000"> </font></div> </body> </html> --------------2697A26DEF0A6FEBE4ECD7F9--

Hi arsene, See my comments inline On Mon, Jun 12, 2017 at 1:02 PM, Arsène Gschwind <arsene.gschwind@unibas.ch> wrote:
Hi
Our setup looks like:
- 2 clusters in 2 different site connected with 10GBit LAN - Storage based on FC SAN replicated on both site and available for both site (The LUNs are available over 4 pathes, 2 from each site)
My observation:
In case one site goes down and this site owned SPM is it not possible to move or force SPM on the second site.
It could be a sanlock issue. The SPM uses sanlock on the storage domain, so once the SPM host will be rebooted and sanlock will be released from the storage domain (IINM after 80 seconds) another Host can obtain a lock on that storage domain and become the new SPM. What is the message in the logs that you get when you try to do that?
On the site which is down it's possible to reset all VMs that crashed using the "Confirm Host rebooted" menu on the oVirt Host but this does not reset SPM. The only solution I found was to bring the Host which owned SPM up again to be able to move it to the other site and then reactivate the storage domains.
I would try to attach the storage domain ( detach it first if it is already attached) so you could register any VMs/Templates/Disks that were added in the original env.
Is this a normal behavior? Is there any way to force SPM reelection ?
Thanks for your help or idea...
Regards, Arsène
--
Arsène Gschwind Fa. Sapify AG im Auftrag der Universität Basel IT Services Klingelbergstr. 70 | CH-4056 Basel | Switzerland Tel. +41 79 449 25 63 | http://its.unibas.ch ITS-ServiceDesk: support-its@unibas.ch | +41 61 267 14 11
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
participants (2)
-
Arsène Gschwind
-
Maor Lipchuk