[JIRA] (OVIRT-609) Jenkins snapshot creation failed

Evgheni Dereveanchin (oVirt JIRA) jira at ovirt-jira.atlassian.net
Fri Jun 24 09:26:00 UTC 2016


    [ https://ovirt-jira.atlassian.net/browse/OVIRT-609?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=17605#comment-17605 ] 

Evgheni Dereveanchin commented on OVIRT-609:
--------------------------------------------

The memory dump file is 33498670080 bytes (32GB) in size.
It took 11 minutes to copy it means the average speed was
around 40 megabytes per second. As the logs show storage
latency errors on other hosts during this time, it means
the storage was overwhelmed again - just not by builds,
but by this single consecutive write during snapshotting.

Similar messages can be seen during snapshotting artifactory
earlier the same day, but as that VM has less RAM it managed
to dump RAM within 3 minutes and succeeded.

> Jenkins snapshot creation failed
> --------------------------------
>
>                 Key: OVIRT-609
>                 URL: https://ovirt-jira.atlassian.net/browse/OVIRT-609
>             Project: oVirt - virtualization made easy
>          Issue Type: Bug
>            Reporter: Evgheni Dereveanchin
>            Assignee: infra
>
> [~ngoldin at redhat.com] issued a live snapshot creation on the Jenkins VM to prepare it for cluster move. This failed and it's not really clear why. Relevant event logs below, suggesting that the hypervisor  started dumping VM memory to the snapshot which caused a storage slowdown.
> {quote}2016-Jun-23, 18:06 Snapshot 'ngoldin_before_cluster_move' creation for VM 'jenkins-phx-ovirt-org' was initiated by admin.
> 2016-Jun-23, 18:09 Failed to create live snapshot 'ngoldin_before_cluster_move' for VM 'jenkins-phx-ovirt-org'. VM restart is recommended. Note that using the created snapshot might cause data inconsistency.
> 2016-Jun-23, 18:13 Host ovirt-srv02 has network interface which exceeded the defined threshold [95%] (em1: transmit rate[100%], receive rate [0%])
> 2016-Jun-23, 18:13 Storage domain Production experienced a high latency of 18.7802 seconds from host ovirt-srv11. This may cause performance and functional issues. Please consult your Storage Administrator.{quote}



--
This message was sent by Atlassian JIRA
(v1000.98.4#100004)



More information about the Infra mailing list