This is a multi-part message in MIME format.
--------------050104050809050706040602
Content-Type: text/plain; charset=utf-8; format=flowed
Content-Transfer-Encoding: 8bit
On 09/24/2015 02:38 AM, Darrell Budic wrote:
This is a known issue in overt 3.5.x and below. It’s been solved in
the upcoming ovirt 3.6.
Related to
https://bugzilla.redhat.com/show_bug.cgi?id=1172905, the
fix involved setting up a special cgroup for the mount, but i can’t
find the exact details atm.
I have vdsm 4.17.6-0.el7.centos already installed on the hosts. So I am
not sure above bug 1172905
<
https://bugzilla.redhat.com/show_bug.cgi?id=1172905> fixes this correctly.
Regards,
Ramesh
> On Sep 23, 2015, at 7:38 AM, Ramesh Nachimuthu <rnachimu(a)redhat.com
> <mailto:rnachimu@redhat.com>> wrote:
>
>
>
> On 09/22/2015 05:57 PM, Alastair Neil wrote:
>> You need to set the gluster.server-quorum-ratio to 51%
>>
>
> I did that. But still I am facing the same issue. VM get paused when
> I do some I/O using fio on some disks backed by gluster. I am not
> able to resume the VM after this. Now only way is to bring down the
> VM and run again. It runs successfully on the same host without any
> issue.
>
> Regards,
> Ramesh
>
>> On 22 September 2015 at 08:25, Ramesh Nachimuthu
>> <rnachimu(a)redhat.com <mailto:rnachimu@redhat.com>> wrote:
>>
>>
>>
>> On 09/22/2015 05:43 PM, Alastair Neil wrote:
>>> what are the gluster-quorum-type
>>> and gluster.server-quorum-ratio settings on the volume?
>>>
>>
>> *cluster.server-quorum-type*:server
>> *cluster.quorum-type*:auto
>> *gluster.server-quorum-ratio is not set.*
>>
>> One brick process is purposefully killed but remaining two
>> bricks are up and running.
>>
>> Regards,
>> Ramesh
>>
>>> On 22 September 2015 at 06:24, Ramesh Nachimuthu
>>> <rnachimu(a)redhat.com> wrote:
>>>
>>> Hi,
>>>
>>> I am not able to resume a VM which was paused because of
>>> gluster client quorum issue. Here is what happened in my
>>> setup.
>>>
>>> 1. Created a gluster storage domain which is backed by
>>> gluster volume with replica 3.
>>> 2. Killed one brick process. So only two bricks are running
>>> in replica 3 setup.
>>> 3. Created two VMs
>>> 4. Started some IO using fio on both of the VMs
>>> 5. After some time got the following error in gluster mount
>>> and VMs moved to paused state.
>>> " server 10.70.45.17:49217
>>> <
http://10.70.45.17:49217/> has not responded in the last
>>> 42 seconds, disconnecting."
>>> "vmstore-replicate-0:
>>> e16d1e40-2b6e-4f19-977d-e099f465dfc6: Failing WRITE as
>>> quorum is not met"
>>> more gluster mount logs at
http://pastebin.com/UmiUQq0F
>>> 6. After some time gluster quorum is active and I am able
>>> to write the the gluster file system.
>>> 7. When I try to resume the VM it doesn't work and I got
>>> following error in vdsm log.
>>>
http://pastebin.com/aXiamY15
>>>
>>>
>>> Regards,
>>> Ramesh
>>>
>>>
>>> _______________________________________________
>>> Users mailing list
>>> Users(a)ovirt.org <mailto:Users@ovirt.org>
>>>
http://lists.ovirt.org/mailman/listinfo/users
>>>
>>>
>>
>>
>
> _______________________________________________
> Users mailing list
> Users(a)ovirt.org <mailto:Users@ovirt.org>
>
http://lists.ovirt.org/mailman/listinfo/users
--------------050104050809050706040602
Content-Type: text/html; charset=utf-8
Content-Transfer-Encoding: 8bit
<html>
<head>
<meta content="text/html; charset=utf-8"
http-equiv="Content-Type">
</head>
<body text="#000000" bgcolor="#FFFFFF">
<br>
<br>
<div class="moz-cite-prefix">On 09/24/2015 02:38 AM, Darrell Budic
wrote:<br>
</div>
<blockquote
cite="mid:F5280320-1598-4721-A5D4-CE3035F1E7C3@onholyground.com"
type="cite">
<meta http-equiv="Content-Type" content="text/html;
charset=utf-8">
This is a known issue in overt 3.5.x and below. It’s been solved
in the upcoming ovirt 3.6.
<div class=""><br class="">
</div>
<div class="">Related to <a moz-do-not-send="true"
href="https://bugzilla.redhat.com/show_bug.cgi?id=1172905"
class="">https://bugzilla.redhat.com/show_bug.cgi?id=1172905...;,
the fix involved setting up a special cgroup for the mount, but
i can’t find the exact details atm.</div>
<div class=""><br class="">
</div>
</blockquote>
<br>
I have vdsm 4.17.6-0.el7.centos already installed on the hosts. So I
am not sure above bug <a moz-do-not-send="true"
href="https://bugzilla.redhat.com/show_bug.cgi?id=1172905"
class="">1172905</a> fixes this correctly.<br>
<br>
Regards,<br>
Ramesh<br>
<br>
<blockquote
cite="mid:F5280320-1598-4721-A5D4-CE3035F1E7C3@onholyground.com"
type="cite">
<div class=""><br class="">
<div>
<blockquote type="cite" class="">
<div class="">On Sep 23, 2015, at 7:38 AM, Ramesh Nachimuthu
<<a moz-do-not-send="true"
href="mailto:rnachimu@redhat.com"
class="">rnachimu(a)redhat.com</a>&gt;
wrote:</div>
<br class="Apple-interchange-newline">
<div class="">
<meta content="text/html; charset=utf-8"
http-equiv="Content-Type" class="">
<div text="#000000" bgcolor="#FFFFFF"
class=""> <br
class="">
<br class="">
<div class="moz-cite-prefix">On 09/22/2015 05:57 PM,
Alastair Neil wrote:<br class="">
</div>
<blockquote
cite="mid:CA+SarwoorU3LWG6+sR-tJ1BEbQa1k4WRrRXkaXy3z-EPRrz7Uw@mail.gmail.com"
type="cite" class="">
<div dir="ltr" class="">You need to set the
gluster.server-quorum-ratio to 51%</div>
<div class="gmail_extra"><br class="">
</div>
</blockquote>
<br class="">
I did that. But still I am facing the same issue. VM get
paused when I do some I/O using fio on some disks backed
by gluster. I am not able to resume the VM after this.
Now only way is to bring down the VM and run again. It
runs successfully on the same host without any issue.<br
class="">
<br class="">
Regards,<br class="">
Ramesh<br class="">
<br class="">
<blockquote
cite="mid:CA+SarwoorU3LWG6+sR-tJ1BEbQa1k4WRrRXkaXy3z-EPRrz7Uw@mail.gmail.com"
type="cite" class="">
<div class="gmail_extra">
<div class="gmail_quote">On 22 September 2015 at
08:25, Ramesh Nachimuthu <span dir="ltr"
class=""><<a
moz-do-not-send="true"
href="mailto:rnachimu@redhat.com"
target="_blank" class=""><a
class="moz-txt-link-abbreviated"
href="mailto:rnachimu@redhat.com">rnachimu@redhat.com</a></a>></span>
wrote:<br class="">
<blockquote class="gmail_quote" style="margin:0 0
0 .8ex;border-left:1px #ccc
solid;padding-left:1ex">
<div text="#000000" bgcolor="#FFFFFF"
class=""><span
class=""> <br class="">
<br class="">
<div class="">On 09/22/2015 05:43 PM,
Alastair Neil wrote:<br class="">
</div>
<blockquote type="cite" class="">
<div dir="ltr" class="">what are
the gluster-quorum-type
and gluster.server-quorum-ratio
settings on the volume?</div>
<div class="gmail_extra"><br
class="">
</div>
</blockquote>
<br class="">
</span>
<div style="outline-style:none"
class="">
<div
style="overflow:hidden;text-overflow:ellipsis;white-space:nowrap"
class=""><b
class="">cluster.server-quorum-type</b>:server<br
class="">
<div title=""
style="outline-style:none"
class="">
<div
style="overflow:hidden;text-overflow:ellipsis;white-space:nowrap"
class=""><b
class="">cluster.quorum-type</b>:auto<br
class="">
<b class="">gluster.server-quorum-ratio
is not set.</b><br class="">
<br class="">
</div>
</div>
One brick process is purposefully killed
but remaining two bricks are up and
running.<br class="">
<br class="">
Regards,<br class="">
Ramesh<br class="">
</div>
</div>
<span class=""> <br class="">
<blockquote type="cite" class="">
<div class="gmail_extra">
<div class="gmail_quote">On 22 September
2015 at 06:24, Ramesh Nachimuthu <span
dir="ltr"
class=""><<a
moz-do-not-send="true"
class="moz-txt-link-abbreviated"
href="mailto:rnachimu@redhat.com"><a
class="moz-txt-link-abbreviated"
href="mailto:rnachimu@redhat.com">rnachimu@redhat.com</a></a>></span>
wrote:<br class="">
<blockquote class="gmail_quote"
style="margin:0 0 0
.8ex;border-left:1px #ccc
solid;padding-left:1ex">
<div text="#000000"
bgcolor="#FFFFFF" class="">
Hi,<br
class="">
<br class="">
I am not able to resume a VM
which was paused because of
gluster client quorum issue. Here
is what happened in my setup. <br
class="">
<br class="">
1. Created a gluster storage
domain which is backed by gluster
volume with replica 3. <br
class="">
2. Killed one brick process. So
only two bricks are running in
replica 3 setup.<br class="">
3. Created two VMs<br class="">
4. Started some IO using fio on
both of the VMs<br class="">
5. After some time got the
following error in gluster mount
and VMs moved to paused state.<br
class="">
" <span
style="color:rgb(51,51,51);font-family:monospace;font-size:11px;font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:13.2px;text-align:left;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;display:inline!important;float:none;background-color:rgb(255,255,255)"
class="">server <a
moz-do-not-send="true"
href="http://10.70.45.17:49217/"
target="_blank"
class="">10.70.45.17:49217</a>
has not responded in the last 42
seconds, disconnecting."<br
class="">
"</span><span
style="color:rgb(51,51,51);font-family:monospace;font-size:11px;font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:13.2px;text-align:left;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;display:inline!important;float:none;background-color:rgb(255,255,255)"
class=""><span
style="color:rgb(51,51,51);font-family:monospace;font-size:11px;font-style:normal;font-variant:normal;font-weight:normal;letter-spacing:normal;line-height:13.2px;text-align:left;text-indent:0px;text-transform:none;white-space:normal;word-spacing:0px;display:inline!important;float:none;background-color:rgb(255,255,255)"
class="">vmstore-replicate-0:
e16d1e40-2b6e-4f19-977d-e099f465dfc6:
Failing WRITE as quorum is not
met</span>"<br
class="">
more gluster mount logs at
<a moz-do-not-send="true"
class="moz-txt-link-freetext"
href="http://pastebin.com/UmiUQq0F">http://pastebin.com/UmiU...
class="">
</span>6. After some time gluster
quorum is active and I am able to
write the the gluster file system.<br
class="">
7. When I try to resume the VM it
doesn't work and I got following
error in vdsm log.<br class="">
<a moz-do-not-send="true"
href="http://pastebin.com/aXiamY15"
target="_blank"
class="">http://pastebin.com/aXiamY15</a><br
class="">
<br class="">
<br class="">
Regards,<br class="">
Ramesh<br class="">
<br class="">
</div>
<br class="">
_______________________________________________<br class="">
Users mailing list<br class="">
<a moz-do-not-send="true"
href="mailto:Users@ovirt.org"
target="_blank"
class="">Users(a)ovirt.org</a><br
class="">
<a moz-do-not-send="true"
href="http://lists.ovirt.org/mailman/listinfo/users"
rel="noreferrer"
target="_blank"
class="">http://lists.ovirt.org/mailman/listinfo/users</a...
class="">
<br class="">
</blockquote>
</div>
<br class="">
</div>
</blockquote>
<br class="">
</span></div>
</blockquote>
</div>
<br class="">
</div>
</blockquote>
<br class="">
</div>
_______________________________________________<br
class="">
Users mailing list<br class="">
<a moz-do-not-send="true"
href="mailto:Users@ovirt.org"
class="">Users(a)ovirt.org</a><br
class="">
<a class="moz-txt-link-freetext"
href="http://lists.ovirt.org/mailman/listinfo/users">http://...
class="">
</div>
</blockquote>
</div>
<br class="">
</div>
</blockquote>
<br>
</body>
</html>
--------------050104050809050706040602--