Infra issu retrospective
Eyal Edri
eedri at redhat.com
Wed Jan 22 16:14:20 UTC 2014
one more pending item: upgrading jenkins to latest LTS + updating jenkins plugins.
seems like lots of issues are fixed and we should upgrade.
eyal.
----- Original Message -----
> From: "Eyal Edri" <eedri at redhat.com>
> To: "R P Herrold" <herrold at owlriver.com>, "Kiril Nesenko" <knesenko at redhat.com>, "David Caro Estevez"
> <dcaroest at redhat.com>
> Cc: "oVirt infrastructure ML" <infra at ovirt.org>
> Sent: Wednesday, January 22, 2014 6:05:29 PM
> Subject: Re: Infra issu retrospective
>
>
>
> ----- Original Message -----
> > From: "R P Herrold" <herrold at owlriver.com>
> > To: "oVirt infrastructure ML" <infra at ovirt.org>
> > Sent: Wednesday, January 22, 2014 5:35:31 PM
> > Subject: Infra issu retrospective
> >
> >
> > for the weekly sync, I see the following matters
> >
> > I was absent Monday for an appt, and do not see an email with
> > minutes.
>
> i was absent as well, but i think there was a meeting held,
> maybe summary wasn't sent, kiril/dcaro?
>
> > Prior week was skipped becuase of member availability issues
> > as well.
> > So this is a summary from the list traffic for the last few
> > days
> >
> > In no particular order:
> > - Kimchi asks for jenkins coverage #105
>
> you mixed up 2 requests:
> #105 Jenkins server for oVirt Kimchi incubator project
> personally, i'm not familiar with Kimchi, but considering our very limited
> resources now on ovirt
> (both physical resources like servers/storage/etc... and especially human
> resources which right now is
> mostly dcaro handling multiple failures on infra issues on jenkins).
>
> but if they are willing to pitch in with resources such as hosts and people
> to support jenkins failures,
> we can consider integrating them into jenkins.ovirt.org, otherwise i think we
> can mostly give support
> in knowledge
>
> #107 Enable coverage report during vdsm unit and functional tests,
> again, will be handles after most issues will be resolved with infra,
> unless someone from the vdsm team power users is ready to take this on.
>
> > - ditto standing up an Ubuntu test instance was requested
>
> at first, a minidell was thought to be added, but due to the lack of
> resources
> for running findbugs/other per patch tests it was decided to allocate it to
> fedora/centos for now.
> we can reinstall on of the rackspace vms for that.
>
> >
> > - Disk space issues on lists were hit on a transient basis
> > Sunday
>
> this is a well known hurting issue, i think we should address that ASAP,
> either buy purchasing a storage server running SSD's from softlayer or
> expanding the 50GB disk we have now
> on linode (how much it costs to add a 100-200 GB disk there?)
>
> >
> > - I have observed wink outages on gerrit, and lists of less
> > than an hour's duration
>
> gerrit is a major issue which we suffer almost on a daily basis, not sure if
> it's from tlv slow network
> or the VM itself needs to migrate to a much more strong infra with
> high-availability feature.
>
> >
> > - linnode PTR and it turns out A and AAAA record have not
> > proceded, as the request was being 'sat on'
> > This is really needed to solve an email filtering issue at
> > Comcast, and one assumes other ISPs, They also examine this data,
> > along with _SPF TXT records.
> >
> > DNS management is weak as responsibility and capability to
> > solve are not unified here
> >
> > - the gerrit is sluggish for unknown reasons doing version
> > control CO's (several reports)
> > ... possible BW limitations on some link paths?
>
> possible post the upgrade to 2.8?
>
> >
> > - jenkins got a 'just in case' reboot last week, but no root
> > cause analysis was performed
>
> i belive i know the reason for the reboot, which was jobs being stuck and
> running for hours.
> this issue was solved by dcaro for finding root cause on findbugs job running
> for more than one hour
> due to an option enabled on the job, comparing to older builds results,
> disabling that reduced time to 15 min.
>
> >
> > Personally I am building a 'knock-off' iscsi and NFS unit,
> > based on the QNAP doco and git content, for oVirt testing
> > locally, ... particularly performance timing trials
>
>
> all in all, we're suffering from a limited infra on jenkins due to slow
> network connection i belive to tlv office (mini dells)
> and high load on rackspace servers (which we are schedule to migrate from).
> decision on migration to softlayer is on halt due to limited budget and
> consideration on the optimal layout,
> i'll try to bring up a suggestion on the next meeting.
>
> Eyal.
>
> >
> > --
> > --
> > end
> > ==================================
> > .-- -... ---.. ... -.- -.--
> > Copyright (C) 2014 R P Herrold
> > herrold at owlriver.com
> > My words are not deathless prose,
> > but they are mine.
> > _______________________________________________
> > Infra mailing list
> > Infra at ovirt.org
> > http://lists.ovirt.org/mailman/listinfo/infra
> >
> _______________________________________________
> Infra mailing list
> Infra at ovirt.org
> http://lists.ovirt.org/mailman/listinfo/infra
>
More information about the Infra
mailing list