ngn build jobs take more than twice (x) as long as in the last days

[oVirt Jenkins]...

Jenkins build is back to stable :...

Fabian Deutsch

Tuesday, 24 May 2016 Tue, 24 May '16

10:57 a.m.

Hey, $subj says it all. Affected jobs are: http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ I.e. 3.6 - before: ~46min, now 1:23hrs In master it's even worse: >1:30hrs Can someone help to idnetify the reason? - fabian -- Fabian Deutsch <fdeutsch(a)redhat.com> RHEV Hypervisor Red Hat

Show replies by date

David Caro

Tuesday, 24 May Tue, 24 May

11:03 a.m.

--q8dntDJTu318bll0 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 05/24 17:57, Fabian Deutsch wrote:

...

Hey, =20 $subj says it all. =20 Affected jobs are: http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ =20 I.e. 3.6 - before: ~46min, now 1:23hrs =20 In master it's even worse: >1:30hrs =20 Can someone help to idnetify the reason?

I see that this is where there's a big jump in time: 06:39:38 Domain installation still in progress. You can reconnect to=20 06:39:38 the console to complete the installation process. 07:21:51 ..................................................................= =2E........................................................................= =2E........................................................................= =2E........................................2016-05-24 03:21:51,341: Install= finished. Or at least virt shut down. So it looks as if the code that checks if the domain is shut down is not working properly, or maybe the virt-install is taking very very long to wor= k.

...

=20 - fabian =20 --=20 Fabian Deutsch <fdeutsch(a)redhat.com> RHEV Hypervisor Red Hat _______________________________________________ Infra mailing list Infra(a)ovirt.org http://lists.ovirt.org/mailman/listinfo/infra

--=20 David Caro Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605 --q8dntDJTu318bll0 Content-Type: application/pgp-signature; name="signature.asc"

...PGP SIGNATURE...

--q8dntDJTu318bll0--

David Caro

11:06 a.m.

--i6vqABX3nJKXLk01 Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 05/24 18:03, David Caro wrote:

...

On 05/24 17:57, Fabian Deutsch wrote: > Hey, >=20 > $subj says it all. >=20 > Affected jobs are: > http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ >=20 > I.e. 3.6 - before: ~46min, now 1:23hrs >=20 > In master it's even worse: >1:30hrs >=20 > Can someone help to idnetify the reason? =20 =20 I see that this is where there's a big jump in time: =20 06:39:38 Domain installation still in progress. You can reconnect to=20 06:39:38 the console to complete the installation process. 07:21:51 ................................................................=

=2E........................................................................= =2E........................................................................= =2E..........................................2016-05-24 03:21:51,341: Insta= ll finished. Or at least virt shut down.

...

=20 So it looks as if the code that checks if the domain is shut down is not working properly, or maybe the virt-install is taking very very long to w=

ork. It seems that the virt-install log is not being archived, and the workdir h= as already been cleaned up so I can check the logfile: /home/jenkins/workspace/ovirt-node-ng_ovirt-3.6_build-artifacts-fc22-x86= _64/ovirt-node-ng/virt-install.log Maybe you can archive it too on the next run to debug

...

=20 >=20 > - fabian >=20 > --=20 > Fabian Deutsch <fdeutsch(a)redhat.com> > RHEV Hypervisor > Red Hat > _______________________________________________ > Infra mailing list > Infra(a)ovirt.org > http://lists.ovirt.org/mailman/listinfo/infra =20 --=20 David Caro =20 Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D =20 Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605

--=20 David Caro Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605 --i6vqABX3nJKXLk01 Content-Type: application/pgp-signature; name="signature.asc"

...PGP SIGNATURE...

--i6vqABX3nJKXLk01--

Eyal Edri

11:50 a.m.

Is it running on the same slave as before? Can we see a changelog of things changed from the Last time it took less time? On May 24, 2016 7:06 PM, "David Caro" <dcaro(a)redhat.com> wrote:

...

On 05/24 18:03, David Caro wrote: > On 05/24 17:57, Fabian Deutsch wrote: > > Hey, > > > > $subj says it all. > > > > Affected jobs are: > > http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ > > > > I.e. 3.6 - before: ~46min, now 1:23hrs > > > > In master it's even worse: >1:30hrs > > > > Can someone help to idnetify the reason? > > > I see that this is where there's a big jump in time: > > 06:39:38 Domain installation still in progress. You can reconnect to > 06:39:38 the console to complete the installation process. > 07:21:51 .............................................................................................................................................................................................................................................................2016-05-24 03:21:51,341: Install finished. Or at least virt shut down. > > So it looks as if the code that checks if the domain is shut down is not > working properly, or maybe the virt-install is taking very very long to work. It seems that the virt-install log is not being archived, and the workdir has already been cleaned up so I can check the logfile: /home/jenkins/workspace/ovirt-node-ng_ovirt-3.6_build-artifacts-fc22-x86_64/ovirt-node-ng/virt-install.log Maybe you can archive it too on the next run to debug > > > > > - fabian > > > > -- > > Fabian Deutsch <fdeutsch(a)redhat.com> > > RHEV Hypervisor > > Red Hat > > _______________________________________________ > > Infra mailing list > > Infra(a)ovirt.org > > http://lists.ovirt.org/mailman/listinfo/infra > > -- > David Caro > > Red Hat S.L. > Continuous Integration Engineer - EMEA ENG Virtualization R&D > > Tel.: +420 532 294 605 > Email: dcaro(a)redhat.com > IRC: dcaro|dcaroest@{freenode|oftc|redhat} > Web: www.redhat.com > RHT Global #: 82-62605 -- David Caro Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605 _______________________________________________ Infra mailing list Infra(a)ovirt.org http://lists.ovirt.org/mailman/listinfo/infra

Fabian Deutsch

1:30 p.m.

...

Is it running on the same slave as before? Can we see a changelog of things changed from the Last time it took less time? On May 24, 2016 7:06 PM, "David Caro" <dcaro(a)redhat.com> wrote: > > On 05/24 18:03, David Caro wrote: > > On 05/24 17:57, Fabian Deutsch wrote: > > > Hey, > > > > > > $subj says it all. > > > > > > Affected jobs are: > > > http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ > > > > > > I.e. 3.6 - before: ~46min, now 1:23hrs > > > > > > In master it's even worse: >1:30hrs > > > > > > Can someone help to idnetify the reason? > > > > > > I see that this is where there's a big jump in time: > > > > 06:39:38 Domain installation still in progress. You can reconnect to > > 06:39:38 the console to complete the installation process. > > 07:21:51 > > .............................................................................................................................................................................................................................................................2016-05-24 > > 03:21:51,341: Install finished. Or at least virt shut down. > > > > So it looks as if the code that checks if the domain is shut down is not > > working properly, or maybe the virt-install is taking very very long to > > work. > > > It seems that the virt-install log is not being archived, and the workdir > has > already been cleaned up so I can check the logfile: > > > /home/jenkins/workspace/ovirt-node-ng_ovirt-3.6_build-artifacts-fc22-x86_64/ovirt-node-ng/virt-install.log > > > Maybe you can archive it too on the next run to debug > > > > > > > > > - fabian > > > > > > -- > > > Fabian Deutsch <fdeutsch(a)redhat.com> > > > RHEV Hypervisor > > > Red Hat > > > _______________________________________________ > > > Infra mailing list > > > Infra(a)ovirt.org > > > http://lists.ovirt.org/mailman/listinfo/infra > > > > -- > > David Caro > > > > Red Hat S.L. > > Continuous Integration Engineer - EMEA ENG Virtualization R&D > > > > Tel.: +420 532 294 605 > > Email: dcaro(a)redhat.com > > IRC: dcaro|dcaroest@{freenode|oftc|redhat} > > Web: www.redhat.com > > RHT Global #: 82-62605 > > > > -- > David Caro > > Red Hat S.L. > Continuous Integration Engineer - EMEA ENG Virtualization R&D > > Tel.: +420 532 294 605 > Email: dcaro(a)redhat.com > IRC: dcaro|dcaroest@{freenode|oftc|redhat} > Web: www.redhat.com > RHT Global #: 82-62605 > > _______________________________________________ > Infra mailing list > Infra(a)ovirt.org > http://lists.ovirt.org/mailman/listinfo/infra >

-- Fabian Deutsch <fdeutsch(a)redhat.com> RHEV Hypervisor Red Hat

Eyal Edri

1:49 p.m.

btw, feel free to increase history for node builds now, we have the space for it. I wouldn't got crazy with keeping history of artifacts, but build info with console output there is no problem keeping a month old. I'm adding Barak as this weekly infra owner to help debug this tomorrow if needed. Also, Evgheni is working on network configuration, might be relevant as well. On Tue, May 24, 2016 at 9:30 PM, Fabian Deutsch <fdeutsch(a)redhat.com> wrote:

...

I don't know if it was quicker on different salves, because I can not look at the history. There were no Node sided changes, maybe the package set in master got larger - well see that on the first regular build. But: I enabled kvm in mock, and this seems to help. The first job post this chnage looks good. My assumption is that previous builds were running on a beefier slave. - fabian On Tue, May 24, 2016 at 6:50 PM, Eyal Edri <eedri(a)redhat.com> wrote: > Is it running on the same slave as before? Can we see a changelog of things > changed from the Last time it took less time? > > On May 24, 2016 7:06 PM, "David Caro" <dcaro(a)redhat.com> wrote: >> >> On 05/24 18:03, David Caro wrote: >> > On 05/24 17:57, Fabian Deutsch wrote: >> > > Hey, >> > > >> > > $subj says it all. >> > > >> > > Affected jobs are: >> > > http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ >> > > >> > > I.e. 3.6 - before: ~46min, now 1:23hrs >> > > >> > > In master it's even worse: >1:30hrs >> > > >> > > Can someone help to idnetify the reason? >> > >> > >> > I see that this is where there's a big jump in time: >> > >> > 06:39:38 Domain installation still in progress. You can reconnect to >> > 06:39:38 the console to complete the installation process. >> > 07:21:51 >> > .............................................................................................................................................................................................................................................................2016-05-24 >> > 03:21:51,341: Install finished. Or at least virt shut down. >> > >> > So it looks as if the code that checks if the domain is shut down is not >> > working properly, or maybe the virt-install is taking very very long to >> > work. >> >> >> It seems that the virt-install log is not being archived, and the workdir >> has >> already been cleaned up so I can check the logfile: >> >> >> /home/jenkins/workspace/ovirt-node-ng_ovirt-3.6_build-artifacts-fc22-x86_64/ovirt-node-ng/virt-install.log >> >> >> Maybe you can archive it too on the next run to debug >> >> > >> > > >> > > - fabian >> > > >> > > -- >> > > Fabian Deutsch <fdeutsch(a)redhat.com> >> > > RHEV Hypervisor >> > > Red Hat >> > > _______________________________________________ >> > > Infra mailing list >> > > Infra(a)ovirt.org >> > > http://lists.ovirt.org/mailman/listinfo/infra >> > >> > -- >> > David Caro >> > >> > Red Hat S.L. >> > Continuous Integration Engineer - EMEA ENG Virtualization R&D >> > >> > Tel.: +420 532 294 605 >> > Email: dcaro(a)redhat.com >> > IRC: dcaro|dcaroest@{freenode|oftc|redhat} >> > Web: www.redhat.com >> > RHT Global #: 82-62605 >> >> >> >> -- >> David Caro >> >> Red Hat S.L. >> Continuous Integration Engineer - EMEA ENG Virtualization R&D >> >> Tel.: +420 532 294 605 >> Email: dcaro(a)redhat.com >> IRC: dcaro|dcaroest@{freenode|oftc|redhat} >> Web: www.redhat.com >> RHT Global #: 82-62605 >> >> _______________________________________________ >> Infra mailing list >> Infra(a)ovirt.org >> http://lists.ovirt.org/mailman/listinfo/infra >> > -- Fabian Deutsch <fdeutsch(a)redhat.com> RHEV Hypervisor Red Hat

-- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

Fabian Deutsch

2:10 p.m.

On Tue, May 24, 2016 at 8:49 PM, Eyal Edri <eedri(a)redhat.com> wrote:

...

~2GB * 3 * 30 == 180GB of disk space, ok?

...

I'm adding Barak as this weekly infra owner to help debug this tomorrow if needed. Also, Evgheni is working on network configuration, might be relevant as well.

Actually, one thought: I think it might make sense to change the standard job template to support nesting. Currently i need to create /dev/kvm to get at least that acceleration. (If the app inside mock s speaking to /dev/kvm directly). Also for libvirt a mount is needed. My take would be: Change the standard ci job template so that every job can use libvirt + geustfosh (w/ acceleration) right away without needing these hacks. If that sounds viable, then I can open a ticket. - fabain

...

On Tue, May 24, 2016 at 9:30 PM, Fabian Deutsch <fdeutsch(a)redhat.com> wrote: > > I don't know if it was quicker on different salves, because I can not > look at the history. > > There were no Node sided changes, maybe the package set in master got > larger - well see that on the first regular build. > > But: I enabled kvm in mock, and this seems to help. The first job post > this chnage looks good. > > My assumption is that previous builds were running on a beefier slave. > > - fabian > > On Tue, May 24, 2016 at 6:50 PM, Eyal Edri <eedri(a)redhat.com> wrote: > > Is it running on the same slave as before? Can we see a changelog of > > things > > changed from the Last time it took less time? > > > > On May 24, 2016 7:06 PM, "David Caro" <dcaro(a)redhat.com> wrote: > >> > >> On 05/24 18:03, David Caro wrote: > >> > On 05/24 17:57, Fabian Deutsch wrote: > >> > > Hey, > >> > > > >> > > $subj says it all. > >> > > > >> > > Affected jobs are: > >> > > http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ > >> > > > >> > > I.e. 3.6 - before: ~46min, now 1:23hrs > >> > > > >> > > In master it's even worse: >1:30hrs > >> > > > >> > > Can someone help to idnetify the reason? > >> > > >> > > >> > I see that this is where there's a big jump in time: > >> > > >> > 06:39:38 Domain installation still in progress. You can reconnect to > >> > 06:39:38 the console to complete the installation process. > >> > 07:21:51 > >> > > >> > .............................................................................................................................................................................................................................................................2016-05-24 > >> > 03:21:51,341: Install finished. Or at least virt shut down. > >> > > >> > So it looks as if the code that checks if the domain is shut down is > >> > not > >> > working properly, or maybe the virt-install is taking very very long > >> > to > >> > work. > >> > >> > >> It seems that the virt-install log is not being archived, and the > >> workdir > >> has > >> already been cleaned up so I can check the logfile: > >> > >> > >> > >> /home/jenkins/workspace/ovirt-node-ng_ovirt-3.6_build-artifacts-fc22-x86_64/ovirt-node-ng/virt-install.log > >> > >> > >> Maybe you can archive it too on the next run to debug > >> > >> > > >> > > > >> > > - fabian > >> > > > >> > > -- > >> > > Fabian Deutsch <fdeutsch(a)redhat.com> > >> > > RHEV Hypervisor > >> > > Red Hat > >> > > _______________________________________________ > >> > > Infra mailing list > >> > > Infra(a)ovirt.org > >> > > http://lists.ovirt.org/mailman/listinfo/infra > >> > > >> > -- > >> > David Caro > >> > > >> > Red Hat S.L. > >> > Continuous Integration Engineer - EMEA ENG Virtualization R&D > >> > > >> > Tel.: +420 532 294 605 > >> > Email: dcaro(a)redhat.com > >> > IRC: dcaro|dcaroest@{freenode|oftc|redhat} > >> > Web: www.redhat.com > >> > RHT Global #: 82-62605 > >> > >> > >> > >> -- > >> David Caro > >> > >> Red Hat S.L. > >> Continuous Integration Engineer - EMEA ENG Virtualization R&D > >> > >> Tel.: +420 532 294 605 > >> Email: dcaro(a)redhat.com > >> IRC: dcaro|dcaroest@{freenode|oftc|redhat} > >> Web: www.redhat.com > >> RHT Global #: 82-62605 > >> > >> _______________________________________________ > >> Infra mailing list > >> Infra(a)ovirt.org > >> http://lists.ovirt.org/mailman/listinfo/infra > >> > > > > > > -- > Fabian Deutsch <fdeutsch(a)redhat.com> > RHEV Hypervisor > Red Hat -- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

-- Fabian Deutsch <fdeutsch(a)redhat.com> RHEV Hypervisor Red Hat

Sandro Bonazzola

3:19 p.m.

Il 24/Mag/2016 17:57, "Fabian Deutsch" <fdeutsch(a)redhat.com> ha scritto:

...

I have no numbers but I have the feeling that all jobs are getting slower since a couple of weeks ago. Yum install phase takes ages. I thoughtit was some temporary storage i/o peak but looks like it's not temporary.

...

- fabian -- Fabian Deutsch <fdeutsch(a)redhat.com> RHEV Hypervisor Red Hat _______________________________________________ Infra mailing list Infra(a)ovirt.org http://lists.ovirt.org/mailman/listinfo/infra

Eyal Edri

Wednesday, 25 May Wed, 25 May

2:31 a.m.

...

Il 24/Mag/2016 17:57, "Fabian Deutsch" <fdeutsch(a)redhat.com> ha scritto: > > Hey, > > $subj says it all. > > Affected jobs are: > http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ > > I.e. 3.6 - before: ~46min, now 1:23hrs > > In master it's even worse: >1:30hrs > > Can someone help to idnetify the reason? I have no numbers but I have the feeling that all jobs are getting slower since a couple of weeks ago. Yum install phase takes ages. I thoughtit was some temporary storage i/o peak but looks like it's not temporary. > > - fabian > > -- > Fabian Deutsch <fdeutsch(a)redhat.com> > RHEV Hypervisor > Red Hat > _______________________________________________ > Infra mailing list > Infra(a)ovirt.org > http://lists.ovirt.org/mailman/listinfo/infra _______________________________________________ Infra mailing list Infra(a)ovirt.org http://lists.ovirt.org/mailman/listinfo/infra

-- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

Eyal Edri

2:33 a.m.

Fabian, Please open a ticket to enable all VMs to support nested, I don't see a reason why not to. Evgheni - we'll need to make sure all hosts supports nested, it will require reboot of a server if its not enabled now. On Wed, May 25, 2016 at 10:31 AM, Eyal Edri <eedri(a)redhat.com> wrote:

...

It might be more load on the storage servers with now running much more jobs. Evgheni - can you check if the load on the storage servers has changed significantly to justify this degradation of service? We need to expedite the enablement of SSDs in the hypervisors and move to local hooks. Anton - do we have a test VM that uses a local DISK we can use to test if it improves the runtime? On Tue, May 24, 2016 at 11:19 PM, Sandro Bonazzola <sbonazzo(a)redhat.com> wrote: > > Il 24/Mag/2016 17:57, "Fabian Deutsch" <fdeutsch(a)redhat.com> ha scritto: > > > > Hey, > > > > $subj says it all. > > > > Affected jobs are: > > http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ > > > > I.e. 3.6 - before: ~46min, now 1:23hrs > > > > In master it's even worse: >1:30hrs > > > > Can someone help to idnetify the reason? > > I have no numbers but I have the feeling that all jobs are getting slower > since a couple of weeks ago. Yum install phase takes ages. I thoughtit was > some temporary storage i/o peak but looks like it's not temporary. > > > > > - fabian > > > > -- > > Fabian Deutsch <fdeutsch(a)redhat.com> > > RHEV Hypervisor > > Red Hat > > _______________________________________________ > > Infra mailing list > > Infra(a)ovirt.org > > http://lists.ovirt.org/mailman/listinfo/infra > > _______________________________________________ > Infra mailing list > Infra(a)ovirt.org > http://lists.ovirt.org/mailman/listinfo/infra > > -- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

-- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

Barak Korren

2:59 a.m.

On 25 May 2016 at 10:33, Eyal Edri <eedri(a)redhat.com> wrote:

...

The point is not to make VMs support nesting (they all already do probably) Its to configure std-ci to pre-configure KVM access for mock (without having the devs specify that in the automation dir) I'm not sure I'm in favour of that.

Eyal Edri

3:02 a.m.

If that is the case, and the required need can be set in the automation dir the I agree. Is that possible to set via scripts in the automation dir? On Wed, May 25, 2016 at 10:59 AM, Barak Korren <bkorren(a)redhat.com> wrote:

...

On 25 May 2016 at 10:33, Eyal Edri <eedri(a)redhat.com> wrote: > Fabian, > Please open a ticket to enable all VMs to support nested, I don't see a > reason why not to. > Evgheni - we'll need to make sure all hosts supports nested, it will require > reboot of a server if its not enabled now. > The point is not to make VMs support nesting (they all already do probably) Its to configure std-ci to pre-configure KVM access for mock (without having the devs specify that in the automation dir) I'm not sure I'm in favour of that.

-- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

Barak Korren

3:11 a.m.

On 25 May 2016 at 11:02, Eyal Edri <eedri(a)redhat.com> wrote:

...

If that is the case, and the required need can be set in the automation dir the I agree. Is that possible to set via scripts in the automation dir?

Yes. that is how Lago does it. -- Barak Korren bkorren(a)redhat.com RHEV-CI Team

Evgheni Dereveanchin

4:38 a.m.

...

-- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

Eyal Edri

4:44 a.m.

OK, I suggest to test using a VM with local disk (preferably on a host with SSD configured), if its working, lets expedite moving all VMs or at least a large amount of VMs to it until we see network load reduced. e. On Wed, May 25, 2016 at 12:38 PM, Evgheni Dereveanchin <ederevea(a)redhat.com> wrote:

...

Hi, I checked SAR data on storage servers and compared loads yesterday and three weeks ago (May 3). Load values are in pretty much the same range, yet now most of the time they are around the "high" mark so we may be nearing a bottleneck, specifically on I/O where we mostly do writes to the NAS, not reads and there's quite a bit of overhead: VM -> QCOW -> file -> network -> NFS -> DRBD -> disk Surely, using local scratch disks stored on SSDs should greatly improve performance as at least half of the above steps will be gone. We don't really need to centrally store (NFS) or mirror (DRBD) data that slaves write to their disks all the time anyways. For VMs where we do need redundancy, I'd suggest using iSCSI storage domains in the long run. Regards, Evgheni Dereveanchin ----- Original Message ----- From: "Eyal Edri" <eedri(a)redhat.com> To: "Sandro Bonazzola" <sbonazzo(a)redhat.com>, "Evgheni Dereveanchin" < ederevea(a)redhat.com>, "Anton Marchukov" <amarchuk(a)redhat.com> Cc: "Fabian Deutsch" <fdeutsch(a)redhat.com>, "infra" <infra(a)ovirt.org> Sent: Wednesday, 25 May, 2016 9:31:43 AM Subject: Re: ngn build jobs take more than twice (x) as long as in the last days It might be more load on the storage servers with now running much more jobs. Evgheni - can you check if the load on the storage servers has changed significantly to justify this degradation of service? We need to expedite the enablement of SSDs in the hypervisors and move to local hooks. Anton - do we have a test VM that uses a local DISK we can use to test if it improves the runtime? On Tue, May 24, 2016 at 11:19 PM, Sandro Bonazzola <sbonazzo(a)redhat.com> wrote: > > Il 24/Mag/2016 17:57, "Fabian Deutsch" <fdeutsch(a)redhat.com> ha scritto: > > > > Hey, > > > > $subj says it all. > > > > Affected jobs are: > > http://jenkins.ovirt.org/user/fabiand/my-views/view/ovirt-node-ng/ > > > > I.e. 3.6 - before: ~46min, now 1:23hrs > > > > In master it's even worse: >1:30hrs > > > > Can someone help to idnetify the reason? > > I have no numbers but I have the feeling that all jobs are getting slower > since a couple of weeks ago. Yum install phase takes ages. I thoughtit was > some temporary storage i/o peak but looks like it's not temporary. > > > > > - fabian > > > > -- > > Fabian Deutsch <fdeutsch(a)redhat.com> > > RHEV Hypervisor > > Red Hat > > _______________________________________________ > > Infra mailing list > > Infra(a)ovirt.org > > http://lists.ovirt.org/mailman/listinfo/infra > > _______________________________________________ > Infra mailing list > Infra(a)ovirt.org > http://lists.ovirt.org/mailman/listinfo/infra > > -- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

-- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

Barak Korren

6:42 a.m.

On 25 May 2016 at 12:44, Eyal Edri <eedri(a)redhat.com> wrote:

...

This is not that easy, oVirt doesn't support mixing local disk and storage in the same cluster, so we will need to move hosts to a new cluster for this. Also we will lose the ability to use templates, or otherwise have to create the templates on each and every disk. The scratch disk is a good solution for this, where you can have the OS image on the central storage and the ephemeral data on the local disk. WRT to the storage architecture - a single huge (10.9T) ext4 is used as the FS on top of the DRBD, this is probably not the most efficient thing one can do (XFS would probably have been better, RAW via iSCSI - even better). I'm guessing that those 10/9TB are not made from a single disk but with a hardware RAID of some sort. In this case deactivating the hardware RAID and re-exposing it as multiple separate iSCSI LUNs (That are then re-joined to a single sotrage domain in oVirt) will enable different VMs to concurrently work on different disks. This should lower the per-vm storage latency. Looking at the storage machine I see strong indication it is IO bound - the load average is ~12 while there are just 1-5 working processes and the CPU is ~80% idle and the rest is IO wait. Running 'du *' at: /srv/ovirt_storage/jenkins-dc/658e5b87-1207-4226-9fcc-4e5fa02b86b4/images one can see that most images are ~40G in size (that is _real_ 40G not sparse!). This means that despite having most VMs created based on templates, the VMs are full template copies rather then COW clones. What this means is that using pools (where all VMs are COW copies of the single pool template) is expected to significantly reduce the storage utilization and therefore the IO load on it (the less you store, the less you need to read back). -- Barak Korren bkorren(a)redhat.com RHEV-CI Team

David Caro

6:52 a.m.

--jigfid2yHjNFZUTO Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 05/25 14:42, Barak Korren wrote:

...

On 25 May 2016 at 12:44, Eyal Edri <eedri(a)redhat.com> wrote: > OK, > I suggest to test using a VM with local disk (preferably on a host with=

SSD

...

> configured), if its working, > lets expedite moving all VMs or at least a large amount of VMs to it un=

til

...

> we see network load reduced. > =20 This is not that easy, oVirt doesn't support mixing local disk and storage in the same cluster, so we will need to move hosts to a new cluster for this. Also we will lose the ability to use templates, or otherwise have to create the templates on each and every disk. =20 The scratch disk is a good solution for this, where you can have the OS image on the central storage and the ephemeral data on the local disk. =20 WRT to the storage architecture - a single huge (10.9T) ext4 is used as the FS on top of the DRBD, this is probably not the most efficient thing one can do (XFS would probably have been better, RAW via iSCSI - even better).

That was done >3 years ago, xfs was not quite stable and widely used and supported back then.

...

=20 I'm guessing that those 10/9TB are not made from a single disk but with a hardware RAID of some sort. In this case deactivating the hardware RAID and re-exposing it as multiple separate iSCSI LUNs (That are then re-joined to a single sotrage domain in oVirt) will enable different VMs to concurrently work on different disks. This should lower the per-vm storage latency.

That would get rid of the drbd too, it's a totally different setup, from scratch (no nfs either).

...

=20 Looking at the storage machine I see strong indication it is IO bound - the load average is ~12 while there are just 1-5 working processes and the CPU is ~80% idle and the rest is IO wait. =20 Running 'du *' at: /srv/ovirt_storage/jenkins-dc/658e5b87-1207-4226-9fcc-4e5fa02b86b4/images one can see that most images are ~40G in size (that is _real_ 40G not sparse!). This means that despite having most VMs created based on templates, the VMs are full template copies rather then COW clones.

That should not be like that, maybe the templates are wrongly configured? or foreman images?

...

What this means is that using pools (where all VMs are COW copies of the single pool template) is expected to significantly reduce the storage utilization and therefore the IO load on it (the less you store, the less you need to read back).

That should happen too without pools, with normal qcow templates. And in any case, that will not lower the normal io, when not actually creat= ing vms, as any read and write will still hit the disk anyhow, it only alleviat= es the io when creating new vms. The local disk (scratch disk) is the best opt= ion imo, now and for the foreseeable future.

...

=20 --=20 Barak Korren bkorren(a)redhat.com RHEV-CI Team _______________________________________________ Infra mailing list Infra(a)ovirt.org http://lists.ovirt.org/mailman/listinfo/infra

--=20 David Caro Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605 --jigfid2yHjNFZUTO Content-Type: application/pgp-signature; name="signature.asc"

...PGP SIGNATURE...

--jigfid2yHjNFZUTO--

Barak Korren

8:09 a.m.

On 25 May 2016 at 14:52, David Caro <dcaro(a)redhat.com> wrote:

...

On 05/25 14:42, Barak Korren wrote: > On 25 May 2016 at 12:44, Eyal Edri <eedri(a)redhat.com> wrote: > > OK, > > I suggest to test using a VM with local disk (preferably on a host with SSD > > configured), if its working, > > lets expedite moving all VMs or at least a large amount of VMs to it until > > we see network load reduced. > > > > This is not that easy, oVirt doesn't support mixing local disk and > storage in the same cluster, so we will need to move hosts to a new > cluster for this. > Also we will lose the ability to use templates, or otherwise have to > create the templates on each and every disk. > > The scratch disk is a good solution for this, where you can have the > OS image on the central storage and the ephemeral data on the local > disk. > > WRT to the storage architecture - a single huge (10.9T) ext4 is used > as the FS on top of the DRBD, this is probably not the most efficient > thing one can do (XFS would probably have been better, RAW via iSCSI - > even better). That was done >3 years ago, xfs was not quite stable and widely used and supported back then.

AFAIK it pre-dates EXT4, in any case this does not detract from the fact that the current configuration in not as efficient as we can make it.

...

> > I'm guessing that those 10/9TB are not made from a single disk but > with a hardware RAID of some sort. In this case deactivating the > hardware RAID and re-exposing it as multiple separate iSCSI LUNs (That > are then re-joined to a single sotrage domain in oVirt) will enable > different VMs to concurrently work on different disks. This should > lower the per-vm storage latency. That would get rid of the drbd too, it's a totally different setup, from scratch (no nfs either).

We can and should still use DRBD, just setup a device for each disk. But yeah, NFS should probably go away. (We are seeing dramatically better performance for iSCSI in integration-engine)

...

> > Looking at the storage machine I see strong indication it is IO bound > - the load average is ~12 while there are just 1-5 working processes > and the CPU is ~80% idle and the rest is IO wait. > > Running 'du *' at: > /srv/ovirt_storage/jenkins-dc/658e5b87-1207-4226-9fcc-4e5fa02b86b4/images > one can see that most images are ~40G in size (that is _real_ 40G not > sparse!). This means that despite having most VMs created based on > templates, the VMs are full template copies rather then COW clones. That should not be like that, maybe the templates are wrongly configured? or foreman images?

This is the expected behaviour when creating a VM from template in the oVirt admin UI. I thought Foreman might behave differently, but it seems it does not. This behaviour is determined by the parameters you pass to the engine API when instantiating a VM, so it most probably doesn't have anything to do with the template configuration.

...

> What this means is that using pools (where all VMs are COW copies of > the single pool template) is expected to significantly reduce the > storage utilization and therefore the IO load on it (the less you > store, the less you need to read back). That should happen too without pools, with normal qcow templates.

Not unless you create all the VMs via the API and pass the right parameters. Pools are the easiest way to ensure you never mess that up...

...

And in any case, that will not lower the normal io, when not actually creating vms, as any read and write will still hit the disk anyhow, it only alleviates the io when creating new vms.

Since you are reading the same bits over and over (for different VMs) you enable the various buffer caches along the way (in the storage machines and in the hypevirsors) to do what they are supposed to.

...

The local disk (scratch disk) is the best option imo, now and for the foreseeable future.

This is not an either/or thing, IMO we need to do both. -- Barak Korren bkorren(a)redhat.com RHEV-CI Team

David Caro

10:06 a.m.

--qjXtncIm5b3tWrFJ Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 05/25 16:09, Barak Korren wrote:

...

On 25 May 2016 at 14:52, David Caro <dcaro(a)redhat.com> wrote: > On 05/25 14:42, Barak Korren wrote: >> On 25 May 2016 at 12:44, Eyal Edri <eedri(a)redhat.com> wrote: >> > OK, >> > I suggest to test using a VM with local disk (preferably on a host w=

ith SSD

...

>> > configured), if its working, >> > lets expedite moving all VMs or at least a large amount of VMs to it=

until

...

>> > we see network load reduced. >> > >> >> This is not that easy, oVirt doesn't support mixing local disk and >> storage in the same cluster, so we will need to move hosts to a new >> cluster for this. >> Also we will lose the ability to use templates, or otherwise have to >> create the templates on each and every disk. >> >> The scratch disk is a good solution for this, where you can have the >> OS image on the central storage and the ephemeral data on the local >> disk. >> >> WRT to the storage architecture - a single huge (10.9T) ext4 is used >> as the FS on top of the DRBD, this is probably not the most efficient >> thing one can do (XFS would probably have been better, RAW via iSCSI - >> even better). > > That was done >3 years ago, xfs was not quite stable and widely used and > supported back then. > AFAIK it pre-dates EXT4

It does, but for el6, it was performing way poorly, and with more bugs (for what the reviews of it said at the time).

...

in any case this does not detract from the fact that the current configuration in not as efficient as we can make it. =20

It does not, I agree to better focus on what we can do now on, now what sho= uld have been done then.

...

=20 >> >> I'm guessing that those 10/9TB are not made from a single disk but >> with a hardware RAID of some sort. In this case deactivating the >> hardware RAID and re-exposing it as multiple separate iSCSI LUNs (That >> are then re-joined to a single sotrage domain in oVirt) will enable >> different VMs to concurrently work on different disks. This should >> lower the per-vm storage latency. > > That would get rid of the drbd too, it's a totally different setup, from > scratch (no nfs either). =20 We can and should still use DRBD, just setup a device for each disk. But yeah, NFS should probably go away. (We are seeing dramatically better performance for iSCSI in integration-engine)

I don't understand then what you said about splitting the hardware raids, y= ou mean to setup one drdb device on top of each hard drive instead?=20 btw. I think that the nfs is used also for something more than just the eng= ine storage domain (just to keep it in mind that it has to be checked if we are going to get rid of it)

...

=20 > >> >> Looking at the storage machine I see strong indication it is IO bound >> - the load average is ~12 while there are just 1-5 working processes >> and the CPU is ~80% idle and the rest is IO wait. >> >> Running 'du *' at: >> /srv/ovirt_storage/jenkins-dc/658e5b87-1207-4226-9fcc-4e5fa02b86b4/ima=

ges

...

>> one can see that most images are ~40G in size (that is _real_ 40G not >> sparse!). This means that despite having most VMs created based on >> templates, the VMs are full template copies rather then COW clones. > > That should not be like that, maybe the templates are wrongly configure=

d? or

...

> foreman images? =20 This is the expected behaviour when creating a VM from template in the oVirt admin UI. I thought Foreman might behave differently, but it seems it does not. =20 This behaviour is determined by the parameters you pass to the engine API when instantiating a VM, so it most probably doesn't have anything to do with the template configuration.

So maybe a misconfiguration in foreman?

...

=20 > >> What this means is that using pools (where all VMs are COW copies of >> the single pool template) is expected to significantly reduce the >> storage utilization and therefore the IO load on it (the less you >> store, the less you need to read back). > > That should happen too without pools, with normal qcow templates. =20 Not unless you create all the VMs via the API and pass the right parameters. Pools are the easiest way to ensure you never mess that up...

That was the idea

...

=20 > And in any case, that will not lower the normal io, when not actually > creating vms, as any read and write will still hit the disk anyhow, it > only alleviates the io when creating new vms. =20 Since you are reading the same bits over and over (for different VMs) you enable the various buffer caches along the way (in the storage machines and in the hypevirsors) to do what they are supposed to.

Once the vm is started, mostly all that's needed is on ram, so there are not that much reads from disk, unless you start writing down to it, and that's mostly what we are hitting, lots of writes.

...

=20 > The local disk (scratch disk) is the best option > imo, now and for the foreseeable future. =20 This is not an either/or thing, IMO we need to do both.

I think that it's way more useful, because it will solve our current issues faster and for longer, so IMO it should get more attention sooner. Any improvement that does not remove the current bottleneck, is not really giving any value to the overall infra (even if it might become valuable lat= er).

...

=20 --=20 Barak Korren bkorren(a)redhat.com RHEV-CI Team

--=20 David Caro Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605 --qjXtncIm5b3tWrFJ Content-Type: application/pgp-signature; name="signature.asc"

...PGP SIGNATURE...

--qjXtncIm5b3tWrFJ--

David Caro

10:17 a.m.

--iiZKCn1f/U0ES2iY Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 05/25 17:06, David Caro wrote:

...

On 05/25 16:09, Barak Korren wrote: > On 25 May 2016 at 14:52, David Caro <dcaro(a)redhat.com> wrote: > > On 05/25 14:42, Barak Korren wrote: > >> On 25 May 2016 at 12:44, Eyal Edri <eedri(a)redhat.com> wrote: > >> > OK, > >> > I suggest to test using a VM with local disk (preferably on a host=

with SSD

...

> >> > configured), if its working, > >> > lets expedite moving all VMs or at least a large amount of VMs to =

it until

...

> >> > we see network load reduced. > >> > > >> > >> This is not that easy, oVirt doesn't support mixing local disk and > >> storage in the same cluster, so we will need to move hosts to a new > >> cluster for this. > >> Also we will lose the ability to use templates, or otherwise have to > >> create the templates on each and every disk. > >> > >> The scratch disk is a good solution for this, where you can have the > >> OS image on the central storage and the ephemeral data on the local > >> disk. > >> > >> WRT to the storage architecture - a single huge (10.9T) ext4 is used > >> as the FS on top of the DRBD, this is probably not the most efficient > >> thing one can do (XFS would probably have been better, RAW via iSCSI=

...

> >> even better). > > > > That was done >3 years ago, xfs was not quite stable and widely used =

and

...

> > supported back then. > > > AFAIK it pre-dates EXT4 =20 It does, but for el6, it was performing way poorly, and with more bugs (f=

...

what the reviews of it said at the time). =20 > in any case this does not detract from the > fact that the current configuration in not as efficient as we can make > it. >=20 =20 It does not, I agree to better focus on what we can do now on, now what s=

hould

...

have been done then. =20 >=20 > >> > >> I'm guessing that those 10/9TB are not made from a single disk but > >> with a hardware RAID of some sort. In this case deactivating the > >> hardware RAID and re-exposing it as multiple separate iSCSI LUNs (Th=

...

> >> are then re-joined to a single sotrage domain in oVirt) will enable > >> different VMs to concurrently work on different disks. This should > >> lower the per-vm storage latency. > > > > That would get rid of the drbd too, it's a totally different setup, f=

rom

...

> > scratch (no nfs either). >=20 > We can and should still use DRBD, just setup a device for each disk. > But yeah, NFS should probably go away. > (We are seeing dramatically better performance for iSCSI in > integration-engine) =20 I don't understand then what you said about splitting the hardware raids,=

you

...

mean to setup one drdb device on top of each hard drive instead?=20

Though I really think we should move to gluster/ceph instead for the jenkins vms, anyone knows what's the current status of the hyperconverge? That would allow us for better scalable distributed storage, and properly u= se the hosts local disks (we have more space on the combined hosts right now t= hat on the storage servers).

...

=20 =20 btw. I think that the nfs is used also for something more than just the e=

ngine

...

storage domain (just to keep it in mind that it has to be checked if we a=

...

going to get rid of it) =20 >=20 > > > >> > >> Looking at the storage machine I see strong indication it is IO bound > >> - the load average is ~12 while there are just 1-5 working processes > >> and the CPU is ~80% idle and the rest is IO wait. > >> > >> Running 'du *' at: > >> /srv/ovirt_storage/jenkins-dc/658e5b87-1207-4226-9fcc-4e5fa02b86b4/i=

mages

...

> >> one can see that most images are ~40G in size (that is _real_ 40G not > >> sparse!). This means that despite having most VMs created based on > >> templates, the VMs are full template copies rather then COW clones. > > > > That should not be like that, maybe the templates are wrongly configu=

red? or

...

> > foreman images? >=20 > This is the expected behaviour when creating a VM from template in the > oVirt admin UI. I thought Foreman might behave differently, but it > seems it does not. >=20 > This behaviour is determined by the parameters you pass to the engine > API when instantiating a VM, so it most probably doesn't have anything > to do with the template configuration. =20 So maybe a misconfiguration in foreman? =20 >=20 > > > >> What this means is that using pools (where all VMs are COW copies of > >> the single pool template) is expected to significantly reduce the > >> storage utilization and therefore the IO load on it (the less you > >> store, the less you need to read back). > > > > That should happen too without pools, with normal qcow templates. >=20 > Not unless you create all the VMs via the API and pass the right > parameters. Pools are the easiest way to ensure you never mess that > up... =20 That was the idea =20 >=20 > > And in any case, that will not lower the normal io, when not actually > > creating vms, as any read and write will still hit the disk anyhow, it > > only alleviates the io when creating new vms. >=20 > Since you are reading the same bits over and over (for different VMs) > you enable the various buffer caches along the way (in the storage > machines and in the hypevirsors) to do what they are supposed to. =20 =20 Once the vm is started, mostly all that's needed is on ram, so there are =

not

...

that much reads from disk, unless you start writing down to it, and that's mostly what we are hitting, lots of writes. =20 >=20 > > The local disk (scratch disk) is the best option > > imo, now and for the foreseeable future. >=20 > This is not an either/or thing, IMO we need to do both. =20 I think that it's way more useful, because it will solve our current issu=

...

faster and for longer, so IMO it should get more attention sooner. =20 Any improvement that does not remove the current bottleneck, is not really giving any value to the overall infra (even if it might become valuable l=

ater).

...

=20 >=20 > --=20 > Barak Korren > bkorren(a)redhat.com > RHEV-CI Team =20 --=20 David Caro =20 Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D =20 Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605

--=20 David Caro Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605 --iiZKCn1f/U0ES2iY Content-Type: application/pgp-signature; name="signature.asc"

...PGP SIGNATURE...

--iiZKCn1f/U0ES2iY--

Eyal Edri

10:21 a.m.

On Wed, May 25, 2016 at 6:17 PM, David Caro <dcaro(a)redhat.com> wrote:

...

On 05/25 17:06, David Caro wrote: > On 05/25 16:09, Barak Korren wrote: > > On 25 May 2016 at 14:52, David Caro <dcaro(a)redhat.com> wrote: > > > On 05/25 14:42, Barak Korren wrote: > > >> On 25 May 2016 at 12:44, Eyal Edri <eedri(a)redhat.com> wrote: > > >> > OK, > > >> > I suggest to test using a VM with local disk (preferably on a host with SSD > > >> > configured), if its working, > > >> > lets expedite moving all VMs or at least a large amount of VMs to it until > > >> > we see network load reduced. > > >> > > > >> > > >> This is not that easy, oVirt doesn't support mixing local disk and > > >> storage in the same cluster, so we will need to move hosts to a new > > >> cluster for this. > > >> Also we will lose the ability to use templates, or otherwise have to > > >> create the templates on each and every disk. > > >> > > >> The scratch disk is a good solution for this, where you can have the > > >> OS image on the central storage and the ephemeral data on the local > > >> disk. > > >> > > >> WRT to the storage architecture - a single huge (10.9T) ext4 is used > > >> as the FS on top of the DRBD, this is probably not the most efficient > > >> thing one can do (XFS would probably have been better, RAW via iSCSI - > > >> even better). > > > > > > That was done >3 years ago, xfs was not quite stable and widely used and > > > supported back then. > > > > > AFAIK it pre-dates EXT4 > > It does, but for el6, it was performing way poorly, and with more bugs (for > what the reviews of it said at the time). > > > in any case this does not detract from the > > fact that the current configuration in not as efficient as we can make > > it. > > > > It does not, I agree to better focus on what we can do now on, now what should > have been done then. > > > > > >> > > >> I'm guessing that those 10/9TB are not made from a single disk but > > >> with a hardware RAID of some sort. In this case deactivating the > > >> hardware RAID and re-exposing it as multiple separate iSCSI LUNs (That > > >> are then re-joined to a single sotrage domain in oVirt) will enable > > >> different VMs to concurrently work on different disks. This should > > >> lower the per-vm storage latency. > > > > > > That would get rid of the drbd too, it's a totally different setup, from > > > scratch (no nfs either). > > > > We can and should still use DRBD, just setup a device for each disk. > > But yeah, NFS should probably go away. > > (We are seeing dramatically better performance for iSCSI in > > integration-engine) > > I don't understand then what you said about splitting the hardware raids, you > mean to setup one drdb device on top of each hard drive instead? Though I really think we should move to gluster/ceph instead for the jenkins vms, anyone knows what's the current status of the hyperconverge?

Neither Gluster nor Hyper-converge I think is stable enough to move all production infra into. Hyperconverged is not supported yet for oVirt as well (might be a 4.x feature)

...

That would allow us for better scalable distributed storage, and properly use the hosts local disks (we have more space on the combined hosts right now that on the storage servers).

I agree a stable distributed storage solution is the way to go if we can find one :)

...

> > > btw. I think that the nfs is used also for something more than just the engine > storage domain (just to keep it in mind that it has to be checked if we are > going to get rid of it) > > > > > > > > >> > > >> Looking at the storage machine I see strong indication it is IO bound > > >> - the load average is ~12 while there are just 1-5 working processes > > >> and the CPU is ~80% idle and the rest is IO wait. > > >> > > >> Running 'du *' at: > > >> /srv/ovirt_storage/jenkins-dc/658e5b87-1207-4226-9fcc-4e5fa02b86b4/images > > >> one can see that most images are ~40G in size (that is _real_ 40G not > > >> sparse!). This means that despite having most VMs created based on > > >> templates, the VMs are full template copies rather then COW clones. > > > > > > That should not be like that, maybe the templates are wrongly configured? or > > > foreman images? > > > > This is the expected behaviour when creating a VM from template in the > > oVirt admin UI. I thought Foreman might behave differently, but it > > seems it does not. > > > > This behaviour is determined by the parameters you pass to the engine > > API when instantiating a VM, so it most probably doesn't have anything > > to do with the template configuration. > > So maybe a misconfiguration in foreman? > > > > > > > > >> What this means is that using pools (where all VMs are COW copies of > > >> the single pool template) is expected to significantly reduce the > > >> storage utilization and therefore the IO load on it (the less you > > >> store, the less you need to read back). > > > > > > That should happen too without pools, with normal qcow templates. > > > > Not unless you create all the VMs via the API and pass the right > > parameters. Pools are the easiest way to ensure you never mess that > > up... > > That was the idea > > > > > > And in any case, that will not lower the normal io, when not actually > > > creating vms, as any read and write will still hit the disk anyhow, it > > > only alleviates the io when creating new vms. > > > > Since you are reading the same bits over and over (for different VMs) > > you enable the various buffer caches along the way (in the storage > > machines and in the hypevirsors) to do what they are supposed to. > > > Once the vm is started, mostly all that's needed is on ram, so there are not > that much reads from disk, unless you start writing down to it, and that's > mostly what we are hitting, lots of writes. > > > > > > The local disk (scratch disk) is the best option > > > imo, now and for the foreseeable future. > > > > This is not an either/or thing, IMO we need to do both. > > I think that it's way more useful, because it will solve our current issues > faster and for longer, so IMO it should get more attention sooner. > > Any improvement that does not remove the current bottleneck, is not really > giving any value to the overall infra (even if it might become valuable later). > > > > > -- > > Barak Korren > > bkorren(a)redhat.com > > RHEV-CI Team > > -- > David Caro > > Red Hat S.L. > Continuous Integration Engineer - EMEA ENG Virtualization R&D > > Tel.: +420 532 294 605 > Email: dcaro(a)redhat.com > IRC: dcaro|dcaroest@{freenode|oftc|redhat} > Web: www.redhat.com > RHT Global #: 82-62605 -- David Caro Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605

-- Eyal Edri Associate Manager RHEV DevOps EMEA ENG Virtualization R&D Red Hat Israel phone: +972-9-7692018 irc: eedri (on #tlv #rhev-dev #rhev-integ)

Barak Korren

Thursday, 26 May Thu, 26 May

2:20 a.m.

...

I agree a stable distributed storage solution is the way to go if we can find one :)

Distributed storages usually suffer from a large overhead because: 1. They try to be resilient to node failure, which means keeping two or more copies of the same file, which results in I/O overhead. 2. They need to coordinate metadata access for large amounts of files. Bottlenecks in the metadata management system are a common issue for distributes FS storages. Since most of our data is ephemeral anyway I don't think we need to pay this overhead. -- Barak Korren bkorren(a)redhat.com RHEV-CI Team

David Caro

2:49 a.m.

--+VeMz1SRxFChf6vr Content-Type: text/plain; charset=us-ascii Content-Disposition: inline Content-Transfer-Encoding: quoted-printable On 05/26 10:20, Barak Korren wrote:

...

> > > I agree a stable distributed storage solution is the way to go if we can > find one :) > =20 Distributed storages usually suffer from a large overhead because: 1. They try to be resilient to node failure, which means keeping two or more copies of the same file, which results in I/O overhead. 2. They need to coordinate metadata access for large amounts of files. Bottlenecks in the metadata management system are a common issue for distributes FS storages. =20 Since most of our data is ephemeral anyway I don't think we need to pay this overhead.

The solution for our current temporary ephemeral data would be for each node to create the vms locally, that's the scratch disks solution we started wit= h. The distributed storage would be used to store the jenkins machines templat= es, that mostly would be read by the hosts, and thus, properly cached locally w= ith a low miss rate (as they don't usually change). To actually not use at all = the central storage, whose extra levels of redundancy are only useful for more critical data (aka production datacenter machines).

...

=20 =20 --=20 Barak Korren bkorren(a)redhat.com RHEV-CI Team

--=20 David Caro Red Hat S.L. Continuous Integration Engineer - EMEA ENG Virtualization R&D Tel.: +420 532 294 605 Email: dcaro(a)redhat.com IRC: dcaro|dcaroest@{freenode|oftc|redhat} Web: www.redhat.com RHT Global #: 82-62605 --+VeMz1SRxFChf6vr Content-Type: application/pgp-signature; name="signature.asc"

...PGP SIGNATURE...

--+VeMz1SRxFChf6vr--

3322

days inactive

3324

days old

infra@ovirt.org

Manage subscription

22 comments

6 participants

tags (0)

participants (6)

Barak Korren
David Caro
Evgheni Dereveanchin
Eyal Edri
Fabian Deutsch
Sandro Bonazzola

2025

2024

2023

2022

2021

2020

2019

2018

2017

2016

2015

2014

2013

2012

2011

ngn build jobs take more than twice (x) as long as in the last days