[Users] snapshot taken while live VM operates cause unresponsive hypervisor behavior

7 Feb 2013

      This is a multi-part message in MIME format.
--------------080000040408020604050403
Content-Type: text/plain; charset=ISO-8859-1; format=flowed
Content-Transfer-Encoding: 7bit

Hi,

Using 3.1.0-3.19 from dreyou's repo on Centos 6.3 version, running 
multiple VM on the hypervisor node and attempting to take a snapshot 
from one of the running VMs generates errors for VDSM daemon as show below:
/
//[root@hyper02 ~]# vdsClient -s 0 list//
//Traceback (most recent call last)://
//  File "/usr/share/vdsm/vdsClient.py", line 2275, in <module>//
//    code, message = commands[command][0](commandArgs)//
//  File "/usr/share/vdsm/vdsClient.py", line 280, in do_list//
//    response = self.s.list(True, vms)//
//  File "/usr/lib64/python2.6/xmlrpclib.py", line 1199, in __call__//
//    return self.__send(self.__name, args)//
//  File "/usr/lib64/python2.6/xmlrpclib.py", line 1489, in __request//
//    verbose=self.__verbose//
//  File "/usr/lib64/python2.6/xmlrpclib.py", line 1253, in request//
//    return self._parse_response(h.getfile(), sock)//
//  File "/usr/lib64/python2.6/xmlrpclib.py", line 1392, in 
_parse_response//
//    return u.close()//
//  File "/usr/lib64/python2.6/xmlrpclib.py", line 838, in close//
//    raise Fault(**self._stack[0])//
//Fault: <Fault 1: "<type 'exceptions.TypeError'>:cannot marshal None 
unless allow_none is enabled">/

Paste from ovirt admin GUI log:

/2013-Feb-07, 15:29:59 VM ipa01 is down. Exit message: User shut down//
//2013-Feb-07, 15:29:34 Migration failed due to Error: Fatal error 
during migration (VM: ipa01, Source Host: Hyper02).//
//2013-Feb-07, 15:29:32 Starting migration of VM ipa01 from Host Hyper02 
to Host Hyper01 (User: admin@internal.).//
//2013-Feb-07, 15:29:10 Migration failed due to Error: Fatal error 
during migration (VM: ipa01, Source Host: Hyper02).//
//2013-Feb-07, 15:29:10 Migration failed due to Error: Fatal error 
during migration (VM: ipa01, Source Host: Hyper02). Trying to migrate to 
another Host.//
//2013-Feb-07, 15:29:08 Starting migration of VM ipa01 from Host Hyper02 
to Host Hyper01 (User: admin@internal.).//
//2013-Feb-07, 15:24:58 Detected new Host Hyper02. Host state was set to 
Up.//
//2013-Feb-07, 15:24:53 Host Hyper02 is initializing. Message: 
Recovering from crash or Initializing//
//2013-Feb-07, 15:22:18 VM ipa01 was resumed by admin@internal (Host: 
Hyper02).//
//2013-Feb-07, 15:22:18 VM ipa01 was resumed by admin@internal (Host: 
Hyper02).//
//2013-Feb-07, 15:20:43 Snapshot [initial installation][after ipa server 
installation] creation for VM ipa01 has been completed.//
//2013-Feb-07, 15:20:43 VM ipa01 has paused due to unknown storage error./

Looking through htop command could see that VM were still running and 
responsive but any vdsclient command failed with the upper code paste I 
have given. Solution here was to connect to each VM, shut it down and 
restart vdsmd and power it up again. There's no problem with the storage 
size as it has aprox 800GB free of space and running through NFS. All 
nfs mount points were running okay at that time.

After the VM has been snapshoot-ed went into pause mode and had to 
un-pause it from GUI, but the VM was still responsive through network 
and operations.

Has anybody seen something related to this by now or it's just related 
to 3.1 version ? Mentioning the fact that I did a research on the 
similar mailing list topics and could not find something related to this 
error. I would happily provide logs if needed.

Regards,
Alex

--------------080000040408020604050403
Content-Type: text/html; charset=ISO-8859-1
Content-Transfer-Encoding: 7bit

<html>
  <head>

    <meta http-equiv="content-type" content="text/html; charset=ISO-8859-1">
  </head>
  <body bgcolor="#FFFFFF" text="#000000">
    <br>
    Hi,<br>
    <br>
    <br>
    Using 3.1.0-3.19 from dreyou's repo on Centos 6.3 version, running
    multiple VM on the hypervisor node and attempting to take a snapshot
    from one of the running VMs generates errors for VDSM daemon as show
    below:<br>
    <small><i><br>
      </i><i>[root@hyper02 ~]# vdsClient -s 0 list</i><i><br>
      </i><i>Traceback (most recent call last):</i><i><br>
      </i><i>  File "/usr/share/vdsm/vdsClient.py", line 2275, in
        <module></i><i><br>
      </i><i>    code, message = commands[command][0](commandArgs)</i><i><br>
      </i><i>  File "/usr/share/vdsm/vdsClient.py", line 280, in do_list</i><i><br>
      </i><i>    response = self.s.list(True, vms)</i><i><br>
      </i><i>  File "/usr/lib64/python2.6/xmlrpclib.py", line 1199, in
        __call__</i><i><br>
      </i><i>    return self.__send(self.__name, args)</i><i><br>
      </i><i>  File "/usr/lib64/python2.6/xmlrpclib.py", line 1489, in
        __request</i><i><br>
      </i><i>    verbose=self.__verbose</i><i><br>
      </i><i>  File "/usr/lib64/python2.6/xmlrpclib.py", line 1253, in
        request</i><i><br>
      </i><i>    return self._parse_response(h.getfile(), sock)</i><i><br>
      </i><i>  File "/usr/lib64/python2.6/xmlrpclib.py", line 1392, in
        _parse_response</i><i><br>
      </i><i>    return u.close()</i><i><br>
      </i><i>  File "/usr/lib64/python2.6/xmlrpclib.py", line 838, in
        close</i><i><br>
      </i><i>    raise Fault(**self._stack[0])</i><i><br>
      </i><i>Fault: <Fault 1: "<type
        'exceptions.TypeError'>:cannot marshal None unless allow_none
        is enabled"></i></small><br>
    <br>
    <br>
    Paste from ovirt admin GUI log:<br>
        <br>
    <small><i>2013-Feb-07, 15:29:59 VM ipa01 is down. Exit message: User
        shut down</i><i><br>
      </i><i>2013-Feb-07, 15:29:34 Migration failed due to Error: Fatal
        error during migration (VM: ipa01, Source Host: Hyper02).</i><i><br>
      </i><i>2013-Feb-07, 15:29:32 Starting migration of VM ipa01 from
        Host Hyper02 to Host Hyper01 (User: admin@internal.).</i><i><br>
      </i><i>2013-Feb-07, 15:29:10 Migration failed due to Error: Fatal
        error during migration (VM: ipa01, Source Host: Hyper02).</i><i><br>
      </i><i>2013-Feb-07, 15:29:10 Migration failed due to Error: Fatal
        error during migration (VM: ipa01, Source Host: Hyper02). Trying
        to migrate to another Host.</i><i><br>
      </i><i>2013-Feb-07, 15:29:08 Starting migration of VM ipa01 from
        Host Hyper02 to Host Hyper01 (User: admin@internal.).</i><i><br>
      </i><i>2013-Feb-07, 15:24:58 Detected new Host Hyper02. Host state
        was set to Up.</i><i><br>
      </i><i>2013-Feb-07, 15:24:53 Host Hyper02 is initializing.
        Message: Recovering from crash or Initializing</i><i><br>
      </i><i>2013-Feb-07, 15:22:18 VM ipa01 was resumed by
        admin@internal (Host: Hyper02).</i><i><br>
      </i><i>2013-Feb-07, 15:22:18 VM ipa01 was resumed by
        admin@internal (Host: Hyper02).</i><i><br>
      </i><i>2013-Feb-07, 15:20:43 Snapshot [initial installation][after
        ipa server installation] creation for VM ipa01 has been
        completed.</i><i><br>
      </i><i>2013-Feb-07, 15:20:43 VM ipa01 has paused due to unknown
        storage error.</i></small><br>
    <br>
    Looking through htop command could see that VM were still running
    and responsive but any vdsclient command failed with the upper code
    paste I have given. Solution here was to connect to each VM, shut it
    down and restart vdsmd and power it up again. There's no problem
    with the storage size as it has aprox 800GB free of space and
    running through NFS. All nfs mount points were running okay at that
    time.<br>
    <br>
    After the VM has been snapshoot-ed went into pause mode and had to
    un-pause it from GUI, but the VM was still responsive through
    network and operations.<br>
    <br>
    Has anybody seen something related to this by now or it's just
    related to 3.1 version ? Mentioning the fact that I did a research
    on the similar mailing list topics and could not find something
    related to this error. I would happily provide logs if needed. <br>
    <br>
    <br>
    Regards,<br>
    Alex<br>
  </body>
</html>

--------------080000040408020604050403--

Alexandru Vladulescu

Shu Ming

Dafna Ron

Kevin Maziere Aubry

Alexandru Vladulescu

tags

participants (4)