[JIRA] (OVIRT-1763) Increase entropy for hosts

Evgheni Dereveanchin (oVirt JIRA) jira at ovirt-jira.atlassian.net
Wed Nov 15 13:27:54 UTC 2017


    [ https://ovirt-jira.atlassian.net/browse/OVIRT-1763?page=com.atlassian.jira.plugin.system.issuetabpanels:comment-tabpanel&focusedCommentId=35342#comment-35342 ] 

Evgheni Dereveanchin commented on OVIRT-1763:
---------------------------------------------

>From the provided job log I see an upgrade suite failure during add-host. 
In engine.log [1] there's a long update sequence at the end, that contains 570 packages including the kernel, systemd, glibc, rpm and looks like a full "yum update" is being run on the hypervisor. Is this expected? Shouldn't we just install VDSM and friends?

On the host itself [2] I see the upgrade progressing normally, no severe hangups. So I cannot see any direct proof of lack of enropy causing this. On successful runs the "yum update" with 570 packages takes 5 minutes, not 15 so will continue investigating.

In general, the timeout is not happening on lago hosts themselves but in VMs where OST is running. Those should have 'haveged' installed and running inside to provide entropy. If they don't - it needs to be installed and running. [~gbenhaim at redhat.com] - could you please confirm if we have haveged in lago VMs?

[1] http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/3795/artifact/exported-artifacts/upgrade-from-release-suit-master-el7/test_logs/upgrade-from-release-suite-master/post-002_bootstrap.py/lago-upgrade-from-release-suite-master-engine/_var_log/ovirt-engine/engine.log
[2] http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/3795/artifact/exported-artifacts/upgrade-from-release-suit-master-el7/test_logs/upgrade-from-release-suite-master/post-002_bootstrap.py/lago-upgrade-from-release-suite-master-host0/_var_log/messages

> Increase entropy for hosts
> --------------------------
>
>                 Key: OVIRT-1763
>                 URL: https://ovirt-jira.atlassian.net/browse/OVIRT-1763
>             Project: oVirt - virtualization made easy
>          Issue Type: Bug
>            Reporter: Dafna Ron
>            Assignee: infra
>
> we had a failure in ost that was really hard to debug: http://jenkins.ovirt.org/job/ovirt-master_change-queue-tester/3795/
> There are no failures in the logs and the test itself was terminated by a timeout.
> It took the vms a long time to download packages and install and didi seems to think that this is due to limited entropy on the physical host. 
> we need to review this issue and increase the entropy on the hosts. 



--
This message was sent by Atlassian Jira
(v1001.0.0-SNAPSHOT#100071)
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.ovirt.org/pipermail/infra/attachments/20171115/ef6311f3/attachment-0001.html>


More information about the Infra mailing list