Hi Nir -
Thanks for getting back to me. Will the patch to 3.6 be backported to 3.5? As you can tell from the images, it takes days and days for it to increase over time. I also wasnt sure if that was the right bug because VDSM memory shows normal from top ...
PID USER PR NI VIRT RES SHR S %CPU %MEM TIME+ COMMAND
5134 vdsm 0 -20 4368m 111m 10m S 2.0 0.3 3709:28 vdsm
Res is only 111M. This is from node1 which is showing currently 20GB of 32GB used with only 2 VMs running on it - 1 with 4G and another with ~1 GB of RAM configured
The images are from nagios and the value here is a direct correlation to what you would see in the free command output. See below from an example of node 1 and node 2
[compute[root@node1 ~]$ free
total used free shared buffers cached
Mem: 32765316 20318156 12447160 252 30884 628948
-/+ buffers/cache: 19658324 13106992
Swap: 19247100 0 19247100
[compute[root@node1 ~]$ free -m
total used free shared buffers cached
Mem: 31997 19843 12153 0 30 614
-/+ buffers/cache: 19199 12798
Swap: 18795 0 18795
And as a control, node 2 that I just restarted today
[compute[root@node2 ~]$ free
total used free shared buffers cached
Mem: 32765316 1815324 30949992 212 35784 717320
-/+ buffers/cache: 1062220 31703096
Swap: 19247100 0 19247100
[compute[root@node2 ~]$ free -m
total used free shared buffers cached
Mem: 31997 1772 30225 0 34 700
-/+ buffers/cache: 1036 30960
Swap: 18795 0 18795
And its correlated image
http://i.imgur.com/8ldPVqY.png (~2GB used). Note how 1772 in the image is exactly what is registered under 'used' in free command