[ovirt-users] Diagnosing Migration Error
Nir Soffer
nsoffer at redhat.com
Fri Mar 18 19:27:21 UTC 2016
On Fri, Mar 18, 2016 at 7:59 PM, Charles Tassell <charles at islandadmin.ca> wrote:
> Hi Martin,
>
> Thanks, I checked that log and found the error " AttributeError:
> 'SourceThread' object has no attribute '_destServer'" which from what I see
> from my googling was supposedly fixed... I'm running
> vdsm-4.17.23-0.el7.centos.noarch which I believe is the latest. The full
> error is:
>
> Thread-2789::DEBUG::2016-03-18
> 14:47:04,200::blockSD::422::Storage.Misc.excCmd::(getReadDelay)
> /usr/bin/taskset --cpu-list 0-23 /usr/bin/dd
> if=/dev/383a1e7d-8b86-4a50-8bc3-e4374c57a3e5/metadata iflag=direct
> of=/dev/null bs=4096 count=1 (cwd None)
> Thread-2789::DEBUG::2016-03-18
> 14:47:04,211::blockSD::422::Storage.Misc.excCmd::(getReadDelay) SUCCESS:
> <err> = '1+0 records in\n1+0 records out\n4096 bytes (4.1 kB) copied,
> 0.000377289 s, 10.9 MB/s\n'; <rc> = 0
> mailbox.SPMMonitor::DEBUG::2016-03-18
> 14:47:04,999::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail)
> /usr/bin/taskset --cpu-list 0-23 dd
> if=/rhev/data-center/00000001-0001-0001-0001-0000000002e9/mastersd/dom_md/inbox
> iflag=direct,fullblock count=1 bs=1024000 (cwd None)
> mailbox.SPMMonitor::DEBUG::2016-03-18
> 14:47:05,015::storage_mailbox::735::Storage.Misc.excCmd::(_checkForMail)
> SUCCESS: <err> = '1+0 records in\n1+0 records out\n1024000 bytes (1.0 MB)
> copied, 0.0057206 s, 179 MB/s\n'; <rc> = 0
> jsonrpc.Executor/0::DEBUG::2016-03-18
> 14:47:06,551::__init__::503::jsonrpc.JsonRpcServer::(_serveRequest) Calling
> 'VM.migrate' in bridge with {u'params': {u'tunneled': u'false',
> u'autoConverge': u'false', u'src': u'ovirt-01.virt.roblib.upei.ca', u'dst':
> u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId':
> u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true',
> u'compressed': u'false', u'method': u'online'}, u'vmID':
> u'2be4938e-f4a3-4322-bae3-8a9628b81835'}
> jsonrpc.Executor/0::DEBUG::2016-03-18 14:47:06,552::API::546::vds::(migrate)
> {u'tunneled': u'false', u'autoConverge': u'false', u'src':
> u'ovirt-01.virt.roblib.upei.ca', u'dst':
> u'ovirt-02.virt.roblib.upei.ca:54321', u'vmId':
> u'2be4938e-f4a3-4322-bae3-8a9628b81835', u'abortOnError': u'true',
> u'compressed': u'false', u'method': u'online'}
> jsonrpc.Executor/0::DEBUG::2016-03-18
> 14:47:06,555::__init__::206::jsonrpc.Notification::(emit) Sending event
> {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration
> Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method":
> "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"}
> jsonrpc.Executor/0::DEBUG::2016-03-18
> 14:47:06,555::__init__::533::jsonrpc.JsonRpcServer::(_serveRequest) Return
> 'VM.migrate' in bridge with True
> Thread-41096::ERROR::2016-03-18
> 14:47:06,559::migration::208::virt.vm::(_recover)
> vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::[Errno -2] Name or service not
> known
> Thread-41096::ERROR::2016-03-18
> 14:47:06,559::migration::213::virt.vm::(_recover)
> vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to destroy remote VM
> Traceback (most recent call last):
> File "/usr/share/vdsm/virt/migration.py", line 211, in _recover
> self._destServer.destroy(self._vm.id)
> AttributeError: 'SourceThread' object has no attribute '_destServer'
> Thread-41096::DEBUG::2016-03-18
> 14:47:06,560::__init__::206::jsonrpc.Notification::(emit) Sending event
> {"params": {"2be4938e-f4a3-4322-bae3-8a9628b81835": {"status": "Migration
> Source"}, "notify_time": 4311153220}, "jsonrpc": "2.0", "method":
> "|virt|VM_status|2be4938e-f4a3-4322-bae3-8a9628b81835"}
> Thread-41096::ERROR::2016-03-18 14:47:06,560::migration::309::virt.vm::(run)
> vmId=`2be4938e-f4a3-4322-bae3-8a9628b81835`::Failed to migrate
> Traceback (most recent call last):
> File "/usr/share/vdsm/virt/migration.py", line 278, in run
> self._setupVdsConnection()
> File "/usr/share/vdsm/virt/migration.py", line 143, in _setupVdsConnection
> client = self._createClient(port)
> File "/usr/share/vdsm/virt/migration.py", line 130, in _createClient
> self.remoteHost, int(port), sslctx)
> File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 1269, in
> create_connected_socket
> sock.connect((host, port))
> File "/usr/lib64/python2.7/site-packages/M2Crypto/SSL/Connection.py", line
> 181, in connect
> self.socket.connect(addr)
> File "/usr/lib64/python2.7/socket.py", line 224, in meth
> return getattr(self._sock,name)(*args)
> gaierror: [Errno -2] Name or service not known
This looks like a bug in the current code (looking in 4.0 development code).
1. We try to connect to the remote host:
2. Connectioin fails with DNS resolving error
3. self._destServer is not initialized becuase of 2
4. Code in _recover fail when access self._destServer
We should fix the initialization to avoid this error, but it will fix
your issue, which is
DNS error.
Maybe the address of the destination vm is wrong, or you have DNS issues in your
environment?
Adding Francesco.
Nir
>
>
> On 16-03-18 01:22 PM, Martin Sivak wrote:
>>
>> Hi Charles,
>>
>> you can probably get much more detailed explanation from the vdsm and
>> libvirt logs from the two hosts. Take a look at
>> /var/log/vdsm/vdsm.log.
>>
>> Best regards
>>
>> Martin Sivak
>> SLA / oVirt
>>
>>
>> On Fri, Mar 18, 2016 at 3:37 PM, Charles Tassell <charles at islandadmin.ca>
>> wrote:
>>>
>>> Hello,
>>>
>>> I'm trying to test migrating a VM from one host to another and it's
>>> failing, but with no apparent error message. I took a look at the server
>>> logs (which I've copied below) but there doesn't seem to be anything in
>>> here
>>> that would help track down the problem. Should I be looking somewhere
>>> besides engine.log on the hosted engine? This is the latest version of
>>> oVirt
>>> (3.6.3.4-1.el7.centos) that I just upgraded this morning. The VM has an
>>> ISO
>>> attached over NFS storage and the disk image is on fiber channel.
>>>
>>> 2016-03-18 14:31:18,913 INFO
>>> [org.ovirt.engine.core.bll.MigrateVmToServerCommand] (default task-25)
>>> [549545f2] Lock Acquired to object
>>> 'EngineLock:{exclusiveLocks='[2be4938e-f4a3-4322-bae3-8a9628b81835=<VM,
>>> ACTION_TYPE_FAILED_VM_IS_BEING_MIGRATED$VmName cdTest02>]',
>>> sharedLocks='null'}'
>>> 2016-03-18 14:31:19,131 INFO
>>> [org.ovirt.engine.core.bll.MigrateVmToServerCommand]
>>> (org.ovirt.thread.pool-8-thread-45) [549545f2] Running command:
>>> MigrateVmToServerCommand internal: false. Entities affected : ID:
>>> 2be4938e-f4a3-4322-bae3-8a9628b81835 Type: VMAction group MIGRATE_VM with
>>> role type USER
>>> 2016-03-18 14:31:19,186 INFO
>>> [org.ovirt.engine.core.vdsbroker.MigrateVDSCommand]
>>> (org.ovirt.thread.pool-8-thread-45) [549545f2] START, MigrateVDSCommand(
>>> MigrateVDSCommandParameters:{runAsync='true',
>>> hostId='cbfd733b-8ced-487d-8754-a2217ce1210f',
>>> vmId='2be4938e-f4a3-4322-bae3-8a9628b81835',
>>> srcHost='ovirt-01.virt.roblib.upei.ca',
>>> dstVdsId='1200a78f-6d05-4e5e-9ef7-6798cf741310',
>>> dstHost='ovirt-02.virt.roblib.upei.ca:54321', migrationMethod='ONLINE',
>>> tunnelMigration='false', migrationDowntime='0', autoConverge='false',
>>> migrateCompressed='false', consoleAddress='null'}), log id: 8f303ba
>>> 2016-03-18 14:31:19,188 INFO
>>> [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateBrokerVDSCommand]
>>> (org.ovirt.thread.pool-8-thread-45) [549545f2] START,
>>> MigrateBrokerVDSCommand(HostName = oVirt-01,
>>> MigrateVDSCommandParameters:{runAsync='true',
>>> hostId='cbfd733b-8ced-487d-8754-a2217ce1210f',
>>> vmId='2be4938e-f4a3-4322-bae3-8a9628b81835',
>>> srcHost='ovirt-01.virt.roblib.upei.ca',
>>> dstVdsId='1200a78f-6d05-4e5e-9ef7-6798cf741310',
>>> dstHost='ovirt-02.virt.roblib.upei.ca:54321', migrationMethod='ONLINE',
>>> tunnelMigration='false', migrationDowntime='0', autoConverge='false',
>>> migrateCompressed='false', consoleAddress='null'}), log id: 792d7e9e
>>> 2016-03-18 14:31:19,455 INFO
>>> [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateBrokerVDSCommand]
>>> (org.ovirt.thread.pool-8-thread-45) [549545f2] FINISH,
>>> MigrateBrokerVDSCommand, log id: 792d7e9e
>>> 2016-03-18 14:31:19,470 INFO
>>> [org.ovirt.engine.core.vdsbroker.MigrateVDSCommand]
>>> (org.ovirt.thread.pool-8-thread-45) [549545f2] FINISH, MigrateVDSCommand,
>>> return: MigratingFrom, log id: 8f303ba
>>> 2016-03-18 14:31:19,498 INFO
>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
>>> (org.ovirt.thread.pool-8-thread-45) [549545f2] Correlation ID: 549545f2,
>>> Job
>>> ID: 3f97f737-d32a-4ffa-b213-122dcd3ca048, Call Stack: null, Custom Event
>>> ID:
>>> -1, Message: Migration started (VM: cdTest02, Source: oVirt-01,
>>> Destination:
>>> oVirt-02, User: admin at internal).
>>> 2016-03-18 14:31:28,055 INFO [org.ovirt.engine.core.vdsbroker.VmAnalyzer]
>>> (DefaultQuartzScheduler_Worker-39) [] VM
>>> '2be4938e-f4a3-4322-bae3-8a9628b81835'(cdTest02) moved from
>>> 'MigratingFrom'
>>> --> 'Up'
>>> 2016-03-18 14:31:28,055 INFO [org.ovirt.engine.core.vdsbroker.VmAnalyzer]
>>> (DefaultQuartzScheduler_Worker-39) [] Adding VM
>>> '2be4938e-f4a3-4322-bae3-8a9628b81835' to re-run list
>>> 2016-03-18 14:31:28,114 ERROR
>>> [org.ovirt.engine.core.vdsbroker.VmsMonitoring]
>>> (DefaultQuartzScheduler_Worker-39) [] Rerun VM
>>> '2be4938e-f4a3-4322-bae3-8a9628b81835'. Called from VDS 'oVirt-01'
>>> 2016-03-18 14:31:28,202 INFO
>>> [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateStatusVDSCommand]
>>> (org.ovirt.thread.pool-8-thread-46) [] START,
>>> MigrateStatusVDSCommand(HostName = oVirt-01,
>>> MigrateStatusVDSCommandParameters:{runAsync='true',
>>> hostId='cbfd733b-8ced-487d-8754-a2217ce1210f',
>>> vmId='2be4938e-f4a3-4322-bae3-8a9628b81835'}), log id: 618baaa6
>>> 2016-03-18 14:31:29,209 INFO
>>> [org.ovirt.engine.core.vdsbroker.vdsbroker.MigrateStatusVDSCommand]
>>> (org.ovirt.thread.pool-8-thread-46) [] FINISH, MigrateStatusVDSCommand,
>>> log
>>> id: 618baaa6
>>> 2016-03-18 14:31:29,250 ERROR
>>> [org.ovirt.engine.core.dal.dbbroker.auditloghandling.AuditLogDirector]
>>> (org.ovirt.thread.pool-8-thread-46) [] Correlation ID: 549545f2, Job ID:
>>> 3f97f737-d32a-4ffa-b213-122dcd3ca048, Call Stack: null, Custom Event ID:
>>> -1,
>>> Message: Migration failed (VM: cdTest02, Source: oVirt-01, Destination:
>>> oVirt-02).
>>> 2016-03-18 14:31:29,262 INFO
>>> [org.ovirt.engine.core.bll.MigrateVmToServerCommand]
>>> (org.ovirt.thread.pool-8-thread-46) [] Lock freed to object
>>> 'EngineLock:{exclusiveLocks='[2be4938e-f4a3-4322-bae3-8a9628b81835=<VM,
>>> ACTION_TYPE_FAILED_VM_IS_BEING_MIGRATED$VmName cdTest02>]',
>>> sharedLocks='null'}'
>>> _______________________________________________
>>> Users mailing list
>>> Users at ovirt.org
>>> http://lists.ovirt.org/mailman/listinfo/users
>
>
> _______________________________________________
> Users mailing list
> Users at ovirt.org
> http://lists.ovirt.org/mailman/listinfo/users
More information about the Users
mailing list