On Mon, Jul 13, 2015 at 5:57 AM, Shubhendu Tripathi
<shtripat(a)redhat.com> wrote:
> On 07/12/2015 09:53 PM, Omer Frenkel wrote:
>>
>> ----- Original Message -----
>>> From: "Liron Aravot" <laravot(a)redhat.com>
>>> To: "Ryan Groten" <Ryan.Groten(a)stantec.com>
>>> Cc: users(a)ovirt.org
>>> Sent: Sunday, July 12, 2015 5:44:28 PM
>>> Subject: Re: [ovirt-users] Concerns with increasing vdsTimeout value on
>>> engine?
>>>
>>>
>>>
>>> ----- Original Message -----
>>>> From: "Ryan Groten" <Ryan.Groten(a)stantec.com>
>>>> To: users(a)ovirt.org
>>>> Sent: Friday, July 10, 2015 10:45:11 PM
>>>> Subject: [ovirt-users] Concerns with increasing vdsTimeout value on
>>>> engine?
>>>>
>>>>
>>>>
>>>> When I try to attach new direct lun disks, the scan takes a very long
>>>> time
>>>> to
>>>> complete because of the number of pvs presented to my hosts (there is
>>>> already a bug on this, related to the pvcreate command taking a very
>>>> long
>>>> time -
https://bugzilla.redhat.com/show_bug.cgi?id=1217401 )
>>>>
>>>>
>>>>
>>>> I discovered a workaround by setting the vdsTimeout value higher (it is
>>>> 180
>>>> seconds by default). I changed it to 300 seconds and now the direct lun
>>>> scan
>>>> returns properly, but I’m hoping someone can warn me if this workaround
>>>> is
>>>> safe or if it’ll cause other potential issues? I made this change
>>>> yesterday
>>>> and so far so good.
>>>>
>>> Hi, no serious issue can be caused by that.
>>> Keep in mind though that any other operation will have that amount of
>>> time to
>>> complete before failing on timeout - which will
>>> cause delays before failing (as the timeout was increased for all
>>> executions)
>>> when not everything is operational and up as expected (as in most of the
>>> time).
>>> I'd guess that a RFE could be opened to allow increasing the timeout of
>>> specific operations if a user want to do that.
>>>
>>> thanks,
>>> Liron.
>> if you have HA vms and use power management (fencing),
>> this might cause longer downtime for HA vms if host has network timeouts:
>> the engine will wait for 3 network failures before trying to fence the
>> host,
>> so in case of timeouts, and increasing it to 5mins,
>> you should expect 15mins before engine will decide host is non-responsive
>> and fence,
>> so if you have HA vm on this host, this will be the vm downtime as well,
>> as the engine will restart HA vms only after fencing.
>>
>> you can read more on
>>
http://www.ovirt.org/Automatic_Fencing
>
> Even I am in a need where, I try to delete all the 256 gluster volume
> snapshots using a single gluster CLI command, and engine gets timed out.
> So, as Liron suggested it would be better if at VDSM verb level we are able
> to set timeout. That would be better option and caller needs to use the
> feature judicially :)
>
Please open a RFE for being able to set operation timeout for single
command call with description of use cases for which
you would like to set the timeout.
>>>> Thanks,
>>>>
>>>> Ryan
>>>>
>>>> _______________________________________________
>>>> Users mailing list
>>>> Users(a)ovirt.org
>>>>
http://lists.ovirt.org/mailman/listinfo/users
>>>>
>>> _______________________________________________
>>> Users mailing list
>>> Users(a)ovirt.org
>>>
http://lists.ovirt.org/mailman/listinfo/users
>>>
>> _______________________________________________
>> Users mailing list
>> Users(a)ovirt.org
>>
http://lists.ovirt.org/mailman/listinfo/users
>
> _______________________________________________
> Users mailing list
> Users(a)ovirt.org
>
http://lists.ovirt.org/mailman/listinfo/users