------=_NextPart_000_0353_01D01A39.5FE62240
Content-Type: text/plain; charset="us-ascii"
Content-Transfer-Encoding: 7bit
Have the following:
6 hosts - virt + Gluster shared
Gluster volume is distributed-replicate - replica 2
Shutting down servers one at a time all work except for 1 brick. If we shut
down one specific brick (1 brick per host) - we're unable to activate the
storage domain. VM's that were actively running from other bricks continue
to run. Whatever was running form that specific brick fails to run, gets
paused etc.
Error log shows the entry below. I'm not certain what it's saying is read
only.nothing is read only that I can find.
2014-12-17 19:57:13,362 ERROR
[org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStatusVDSCommand]
(DefaultQuartzScheduler_Worker-47) [4e9290a2] Command
SpmStatusVDSCommand(HostName =
U23.domainame.net, HostId =
0db58e46-68a3-4ba0-a8aa-094893c045a1, storagePoolId =
7ccd6ea9-7d80-4170-afa1-64c10c185aa6) execution failed. Exception:
VDSErrorException: VDSGenericException: VDSErrorException: Failed to
SpmStatusVDS, error = [Errno 30] Read-only file system, code = 100
2014-12-17 19:57:13,363 INFO
[org.ovirt.engine.core.vdsbroker.irsbroker.IrsProxyData]
(DefaultQuartzScheduler_Worker-47) [4e9290a2] hostFromVds::selectedVds -
U23.domainname.net, spmStatus returned null!
According to Ovirt/Gluster, if a brick goes down, the VM should be able to
be restarted from another brick without issue. This does not appear to be
the case. If we take other bricks offline, it appears to work as expected.
Something with this specific brick cases everything to break which then
makes any VM's that were running from the brick unable to start.
------=_NextPart_000_0353_01D01A39.5FE62240
Content-Type: text/html; charset="us-ascii"
Content-Transfer-Encoding: quoted-printable
<html xmlns:v=3D"urn:schemas-microsoft-com:vml" =
xmlns:o=3D"urn:schemas-microsoft-com:office:office" =
xmlns:w=3D"urn:schemas-microsoft-com:office:word" =
xmlns:m=3D"http://schemas.microsoft.com/office/2004/12/omml" =
xmlns=3D"http://www.w3.org/TR/REC-html40"><head><meta =
http-equiv=3DContent-Type content=3D"text/html; =
charset=3Dus-ascii"><meta name=3DGenerator content=3D"Microsoft Word 15 =
(filtered medium)"><style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0in;
margin-bottom:.0001pt;
font-size:11.0pt;
font-family:"Calibri",serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:#0563C1;
text-decoration:underline;}
a:visited, span.MsoHyperlinkFollowed
{mso-style-priority:99;
color:#954F72;
text-decoration:underline;}
span.EmailStyle17
{mso-style-type:personal-compose;
font-family:"Calibri",serif;
color:windowtext;}
.MsoChpDefault
{mso-style-type:export-only;
font-family:"Calibri",serif;}
@page WordSection1
{size:8.5in 11.0in;
margin:1.0in 1.0in 1.0in 1.0in;}
div.WordSection1
{page:WordSection1;}
--></style><!--[if gte mso 9]><xml>
<o:shapedefaults v:ext=3D"edit" spidmax=3D"1026" />
</xml><![endif]--><!--[if gte mso 9]><xml>
<o:shapelayout v:ext=3D"edit">
<o:idmap v:ext=3D"edit" data=3D"1" />
</o:shapelayout></xml><![endif]--></head><body lang=3DEN-US =
link=3D"#0563C1" vlink=3D"#954F72"><div
class=3DWordSection1><p =
class=3DMsoNormal>Have the following:<o:p></o:p></p><p =
class=3DMsoNormal><o:p> </o:p></p><p
class=3DMsoNormal>6 hosts =
– virt + Gluster shared<o:p></o:p></p><p =
class=3DMsoNormal><o:p> </o:p></p><p
class=3DMsoNormal>Gluster =
volume is distributed-replicate – replica
2<o:p></o:p></p><p =
class=3DMsoNormal><o:p> </o:p></p><p
class=3DMsoNormal>Shutting =
down servers one at a time all work except for 1 brick. If we shut down =
one specific brick (1 brick per host) – we’re unable to =
activate the storage domain. VM’s that were actively running from =
other bricks continue to run. Whatever was running form that specific =
brick fails to run, gets paused etc. <o:p></o:p></p><p =
class=3DMsoNormal><o:p> </o:p></p><p
class=3DMsoNormal>Error log =
shows the entry below. I’m not certain what it’s saying is =
read only…nothing is read only that I can find. =
<o:p></o:p></p><p
class=3DMsoNormal><o:p> </o:p></p><p =
class=3DMsoNormal><o:p> </o:p></p><p
class=3DMsoNormal>2014-12-17 =
19:57:13,362 ERROR =
[org.ovirt.engine.core.vdsbroker.vdsbroker.SpmStatusVDSCommand] =
(DefaultQuartzScheduler_Worker-47) [4e9290a2] Command =
SpmStatusVDSCommand(HostName =3D
U23.domainame.net, HostId =3D =
0db58e46-68a3-4ba0-a8aa-094893c045a1, storagePoolId =3D =
7ccd6ea9-7d80-4170-afa1-64c10c185aa6) execution failed. Exception: =
VDSErrorException: VDSGenericException: VDSErrorException: Failed to =
SpmStatusVDS, error =3D [Errno 30] Read-only file system, code =3D =
100<o:p></o:p></p><p class=3DMsoNormal>2014-12-17 19:57:13,363 =
INFO [org.ovirt.engine.core.vdsbroker.irsbroker.IrsProxyData] =
(DefaultQuartzScheduler_Worker-47) [4e9290a2] hostFromVds::selectedVds - =
U23.domainname.net, spmStatus returned null!<o:p></o:p></p><p =
class=3DMsoNormal><o:p> </o:p></p><p =
class=3DMsoNormal><o:p> </o:p></p><p
class=3DMsoNormal>According to =
Ovirt/Gluster, if a brick goes down, the VM should be able to be =
restarted from another brick without issue. This does not appear to be =
the case… If we take other bricks offline, it appears to work as =
expected. Something with this specific brick cases everything to break =
which then makes any VM’s that were running from the brick unable =
to start.<o:p></o:p></p></div></body></html>
------=_NextPart_000_0353_01D01A39.5FE62240--