[ovirt-users] VDSM memory consumption

Dan Kenigsberg danken at redhat.com
Mon Mar 9 22:29:42 UTC 2015


On Mon, Mar 09, 2015 at 10:40:51AM -0500, Darrell Budic wrote:
> > On Mar 9, 2015, at 4:51 AM, Dan Kenigsberg <danken at redhat.com> wrote:
> > 
> > On Fri, Mar 06, 2015 at 10:58:53AM -0600, Darrell Budic wrote:
> >> I believe the supervdsm leak was fixed, but 3.5.1 versions of vdsmd still leaks slowly, ~300k/hr, yes.
> >> 
> >> https://bugzilla.redhat.com/show_bug.cgi?id=1158108
> >> 
> >> 
> >>> On Mar 6, 2015, at 10:23 AM, Chris Adams <cma at cmadams.net> wrote:
> >>> 
> >>> Once upon a time, Federico Alberto Sayd <fsayd at uncu.edu.ar> said:
> >>>> I am experiencing troubles with VDSM memory consuption.
> >>>> 
> >>>> I am running
> >>>> 
> >>>> Engine: ovirt 3.5.1
> >>>> 
> >>>> Nodes:
> >>>> 
> >>>> Centos 6.6
> >>>> VDSM 4.16.10-8
> >>>> Libvirt: libvirt-0.10.2-46
> >>>> Kernel: 2.6.32
> >>>> 
> >>>> When the host boots, memory consuption is normal, but after 2 or 3
> >>>> days running, VDSM memory consuption grows and it consumes more
> >>>> memory that all vm's running in the host. If I restart the vdsm
> >>>> service, memory consuption normalizes, but then it start growing
> >>>> again.
> >>>> 
> >>>> I have seen some BZ about vdsm and supervdsm about memory leaks, but
> >>>> I don't know if VDSM 4.6.10.8 is still affected by a related bug.
> >>> 
> >>> Can't help, but I see the same thing with CentOS 7 nodes and the same
> >>> version of vdsm.
> >>> -- 
> >>> Chris Adams <cma at cmadams.net>
> >>> _______________________________________________
> >>> Users mailing list
> >>> Users at ovirt.org
> >>> http://lists.ovirt.org/mailman/listinfo/users
> > 
> > I'm afraid that we are yet to find a solution for this issue, which is
> > completly different from the horrible leak of supervdsm < 4.16.7.
> > 
> > Could you corroborate the claim of
> >    Bug 1147148 - M2Crypto usage in vdsm leaks memory
> > ? Does the leak disappear once you start using plaintext transport?
> > 
> > Regards,
> > Dan.
> 
> I don’t think this is crypto related, but I could try that if you still need some confirmation (and point me at a quick doc on switching to plaintext?).
> 
> This is from #ovirt around November 18th I think, Saggi thought he’d found something related:
> 
> 9:58:43 AM saggi: YamakasY: Found the leak
> 9:58:48 AM saggi: YamakasY: Or at least the flow
> 9:58:57 AM saggi: YamakasY: The good news is that I can reproduce
> 9:59:20 AM YamakasY: saggi: that's kewl!
> 9:59:25 AM YamakasY: saggi: what happens ?
> 9:59:41 AM YamakasY: I know from Telsin (ping ping!) that he sees it going faster on gluster usage
> tdosek left the room (quit: Ping timeout: 480 seconds). (10:00:02 AM)
> djasa left the room (quit: Quit: Leaving). (10:00:24 AM)
> mlipchuk left the room (quit: Quit: Leaving.). (10:00:29 AM)
> laravot left the room (quit: Quit: Leaving.). (10:01:19 AM)
> 10:01:54 AM saggi: YamakasY: it's in getCapabilities(). Here is the RSS graph. The flatlines are when I stopped calling it and called other verbs. http://i.imgur.com/CLm0Q75.png

I do recall what is the issue Saggi and YamakasY were dicussing (CCing
the pair), or if it reached fruition as a patch. It is certainly
something other than Bug 1158108, as the latter speak about a leak in a
normal working state, with no getCapabilities calls.





More information about the Users mailing list