
This is a multi-part message in MIME format. ------------MIME-3653757893-462597306-delim Content-Type: text/plain; charset="windows-1252" Content-Transfer-Encoding: quoted-printable On 07/29/2015 12=3A12 PM=2C NUNIN Roberto wrote=3A =3E=3E -----Messaggio originale----- =3E=3E Da=3A Michael S=2E Tsirkin =5Bmailto=3Amst=40redhat=2Ecom=5D =3E=3E Inviato=3A mercoled=EC=20=329 luglio 2015 12=3A03 =3E=3E A=3A NUNIN Roberto =3E=3E Cc=3A Fabian Deutsch=3B users=40ovirt=2Eorg =3E=3E Oggetto=3A Re=3A R=3A =5Bovirt-users=5D R=3A R=3A R=3A R=3A R=3A R= =3A PXE boot of a VM on vdsm don=27t =3E=3E read DHCP offer =3E=3E =3E=3E On Wed=2C Jul 29=2C 2015 at 12=3A00=3A38PM +0200=2C NUNIN Roberto wr= ote=3A =3E=3E=3E=3E -----Messaggio originale----- =3E=3E=3E=3E Da=3A users-bounces=40ovirt=2Eorg =5Bmailto=3Ausers-bounces=40= ovirt=2Eorg=5D Per conto =3E=3E di =3E=3E=3E=3E Michael S=2E Tsirkin =3E=3E=3E=3E Inviato=3A gioved=EC=20=39 luglio 2015 15=3A15 =3E=3E=3E=3E A=3A Fabian Deutsch =3E=3E=3E=3E Cc=3A users=40ovirt=2Eorg =3E=3E=3E=3E Oggetto=3A Re=3A =5Bovirt-users=5D R=3A R=3A R=3A R=3A R=3A R= =3A PXE boot of a VM on vdsm don=27t =3E=3E read =3E=3E=3E=3E DHCP offer =3E=3E=3E=3E =3E=3E=3E=3E On Thu=2C Jul 09=2C 2015 at 08=3A57=3A50AM -0400=2C Fabian Deu= tsch wrote=3A =3E=3E=3E=3E=3E ----- Original Message ----- =3E=3E=3E=3E=3E=3E On Wed=2C Jul 08=2C 2015 at 09=3A11=3A42AM +0300=2C Mich= ael S=2E Tsirkin wrote=3A =3E=3E=3E=3E=3E=3E=3E On Tue=2C Jul 07=2C 2015 at 05=3A13=3A28PM +0100=2C D= an Kenigsberg wrote=3A =3E=3E=3E=3E=3E=3E=3E=3E On Tue=2C Jul 07=2C 2015 at 10=3A14=3A54AM +0200= =2C NUNIN Roberto wrote=3A =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E On Mon=2C Jul 06=2C 2015 at 10=3A33=3A59AM += 0200=2C NUNIN Roberto =3E=3E wrote=3A =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E Hi Dan =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E Sorry for question=3A what do you mean fo= r interface vnetxxxx =3F =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E Currently our path is =3A =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E eno1 - eno2 ---- bond0 ----- bond=2E3500= =28VLAN=29 ------ bridge ----- =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E vm=2E =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E Which one of these =3F =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E Moreover=2C reading Fabian statements abo= ut bonding limits=2C =3E=3E=3E=3E today I =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E=3E can try =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E to switch to a config without bonding=2E =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E =22vm=22 is a complicated term=2E =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E =60brctl show=60 would not show you a =22vm= =22 connected to a bridge=2E =3E=3E=3E=3E When =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E you =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E WOULD see is a vnet888 tap device=2E The=20= =22other side=22 of this =3E=3E device =3E=3E=3E=3E is =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E held by qemu=2C which implement the VM=2E =3E=3E=3E=3E=3E=3E=3E=3E=3E Ok=2C understood and found it=2C vnet2 =3E=3E=3E=3E=3E=3E=3E=3E=3E =3E=3E=3E=3E=3E=3E=3E=3E=3E=3E I=27m asking if the dhcp offer has reached t= hat tap device=2E =3E=3E=3E=3E=3E=3E=3E=3E=3E No=2C the DHCP offer packet do not reach the vn= et2 interface=2C I can =3E=3E see =3E=3E=3E=3E=3E=3E=3E=3E=3E only DHCP DISCOVER=2E =3E=3E=3E=3E=3E=3E=3E=3E Ok=2C so it seems that we have a problem in the ho= st bridging=2E =3E=3E=3E=3E=3E=3E=3E=3E =3E=3E=3E=3E=3E=3E=3E=3E Is it the latest kernel-3=2E10=2E0-229=2E7=2E2=2Ee= l7=2Ex86=5F64 =3F =3E=3E=3E=3E=3E=3E=3E=3E =3E=3E=3E=3E=3E=3E=3E=3E Michael=2C a DHCP DISCOVER is sent out of a just-b= ooted guest=2C and =3E=3E=3E=3E OFFER =3E=3E=3E=3E=3E=3E=3E=3E returns to the bridge=2C but is not propagated to= the tap device=2E =3E=3E=3E=3E=3E=3E=3E=3E Can you suggest how to debug this further=3F =3E=3E=3E=3E=3E=3E=3E Dump packets including the ethernet headers=2E =3E=3E=3E=3E=3E=3E=3E Likely something interfered with them so the eth addr= ess is wrong=2E =3E=3E=3E=3E=3E=3E=3E =3E=3E=3E=3E=3E=3E=3E Since bonding does this sometimes=2C this is the most= likely culprit=2E =3E=3E=3E=3E=3E=3E We=27ve ruled this out already - Roberto reproduces the= issue without a =3E=3E=3E=3E=3E=3E bond=2E =3E=3E=3E=3E=3E To me this looks like either a regression in the host side= bridging=2E But =3E=3E otoh it =3E=3E=3E=3E doesn=27t look =3E=3E=3E=3E=3E like it=27s happening always=2C because otherwise I=27d exp= ect more noise =3E=3E around =3E=3E=3E=3E this issue=2E =3E=3E=3E=3E=3E - fabian =3E=3E=3E=3E Hard to say=2E E=2Eg=2E forwarding delay would do this for a w= hile=2E =3E=3E=3E=3E If eth address of the packets is okay=2C poke at the fbd=2C ma= ybe there=27s =3E=3E=3E=3E something wrong there=2E Maybe stp is detecting a loop - try c= hecking that=2E =3E=3E=3E Someone is checking this =3F =3E=3E=3E In tested config SPT was off=2E =3E=3E Then maybe you have a loop =3A=29 =3E That was already checked=2C the MAC was unique in the VLAN=2E =3E =3E RN =3E Did you also try a reboot of the VM=3F We have the same issue with foreman= and both Libvirt and oVirt=2E On second boot PXE boots properly from DHCP= =2E Haven=27t had the time to investigate yet so we=27re using mostly image based provisioning on oVirt at the moment=2E Met vriendelijke groet=2C With kind regards=2C Jorick Astrego Netbulae Virtualization Experts=20 ---------------- =09Tel=3A 053 20 30 270 =09info=40netbulae=2Eeu =09Staalsteden 4-3A =09KvK= 08198180 =09Fax=3A 053 20 30 271 =09www=2Enetbulae=2Eeu =097547 TA Enschede =09BTW= NL821234584B01 ---------------- ------------MIME-3653757893-462597306-delim Content-Type: text/html; charset="utf-8" Content-Transfer-Encoding: quoted-printable =3Chtml=3E =3Cbody=3E <br> <br> On 07/29/2015 12:12 PM, NUNIN Roberto wrote: <br> <font color=3D"#000000">>> -----Messaggio originale----- </font><= br> <font color=3D"#000000">>> Da: Michael S. Tsirkin [mailto:mst@<a href= =3D"mailto:redhat.com">redhat.com</a>] </font><br> <font color=3D"#000000">>> Inviato: mercoledì 29 luglio 2015 12:= 03 </font><br> <font color=3D"#000000">>> A: NUNIN Roberto </font><br> <font color=3D"#000000">>> Cc: Fabian Deutsch; users@<a href=3D"mailt= o:ovirt.org">ovirt.org</a> </font><br> <font color=3D"#000000">>> Oggetto: Re: R: [ovirt-users] R: R: R: R: = R: R: PXE boot of a VM on vdsm don't </font><br> <font color=3D"#000000">>> read DHCP offer </font><br> <font color=3D"#000000">>> </font><br> <font color=3D"#000000">>> On Wed, Jul 29, 2015 at 12:00:38PM +0200, = NUNIN Roberto wrote: </font><br> <font color=3D"#000000">>>>> -----Messaggio originale----- = </font><br> <font color=3D"#000000">>>>> Da: users-bounces@<a href=3D"mailt= o:ovirt.org">ovirt.org</a> [mailto:users-bounces@<a href=3D"mailto:ovi= rt.org">ovirt.org</a>] Per conto </font><br> <font color=3D"#000000">>> di </font><br> <font color=3D"#000000">>>>> Michael S. Tsirkin </font><br> <font color=3D"#000000">>>>> Inviato: giovedì 9 luglio 201= 5 15:15 </font><br> <font color=3D"#000000">>>>> A: Fabian Deutsch </font><br> <font color=3D"#000000">>>>> Cc: users@<a href=3D"mailto:ovirt.= org">ovirt.org</a> </font><br> <font color=3D"#000000">>>>> Oggetto: Re: [ovirt-users] R: R: R= : R: R: R: PXE boot of a VM on vdsm don't </font><br> <font color=3D"#000000">>> read </font><br> <font color=3D"#000000">>>>> DHCP offer </font><br> <font color=3D"#000000">>>>> </font><br> <font color=3D"#000000">>>>> On Thu, Jul 09, 2015 at 08:57:50AM= -0400, Fabian Deutsch wrote: </font><br> <font color=3D"#000000">>>>>> ----- Original Message -----= 13;</font><br> <font color=3D"#000000">>>>>>> On Wed, Jul 08, 2015 at 09= :11:42AM +0300, Michael S. Tsirkin wrote: </font><br> <font color=3D"#000000">>>>>>>> On Tue, Jul 07, 2015 a= t 05:13:28PM +0100, Dan Kenigsberg wrote: </font><br> <font color=3D"#000000">>>>>>>>> On Tue, Jul 07, 20= 15 at 10:14:54AM +0200, NUNIN Roberto wrote: </font><br> <font color=3D"#000000">>>>>>>>>>> On Mon, Ju= l 06, 2015 at 10:33:59AM +0200, NUNIN Roberto </font><br> <font color=3D"#000000">>> wrote: </font><br> <font color=3D"#000000">>>>>>>>>>>> Hi Dan= </font><br> <font color=3D"#000000">>>>>>>>>>>> </= font><br> <font color=3D"#000000">>>>>>>>>>>> Sorry = for question: what do you mean for interface vnetxxxx ? </font><br> <font color=3D"#000000">>>>>>>>>>>> Curren= tly our path is : </font><br> <font color=3D"#000000">>>>>>>>>>>> eno1 -= eno2 ---- bond0 ----- bond.3500 (VLAN) ------ bridge ----- </fon= t><br> <font color=3D"#000000">>>>>>>>>>>> vm.= 3;</font><br> <font color=3D"#000000">>>>>>>>>>>> </= font><br> <font color=3D"#000000">>>>>>>>>>>> Which = one of these ? </font><br> <font color=3D"#000000">>>>>>>>>>>> Moreov= er, reading Fabian statements about bonding limits, </font><br> <font color=3D"#000000">>>>> today I </font><br> <font color=3D"#000000">>>>>>>>>>>> can tr= y </font><br> <font color=3D"#000000">>>>>>>>>>> to switch = to a config without bonding. </font><br> <font color=3D"#000000">>>>>>>>>>> </font=
<br> <font color=3D"#000000">>>>>>>>>>> "vm&q= uot; is a complicated term. </font><br> <font color=3D"#000000">>>>>>>>>>> </font= <br> <font color=3D"#000000">>>>>>>>>>> `brctl sho= w` would not show you a "vm" connected to a bridge. </font><b= r> <font color=3D"#000000">>>>> When </font><br> <font color=3D"#000000">>>>>>>>>>> you </= font><br> <font color=3D"#000000">>>>>>>>>>> WOULD see = is a vnet888 tap device. The "other side" of this </font><br> <font color=3D"#000000">>> device </font><br> <font color=3D"#000000">>>>> is </font><br> <font color=3D"#000000">>>>>>>>>>> held by qe= mu, which implement the VM. </font><br> <font color=3D"#000000">>>>>>>>>> Ok, understood= and found it, vnet2 </font><br> <font color=3D"#000000">>>>>>>>>> </font><br=
<font color=3D"#000000">>>>>>>>>>> I'm asking= if the dhcp offer has reached that tap device. </font><br> <font color=3D"#000000">>>>>>>>>> No, the DHCP o= ffer packet do not reach the vnet2 interface, I can </font><br> <font color=3D"#000000">>> see </font><br> <font color=3D"#000000">>>>>>>>>> only DHCP DISC= OVER. </font><br> <font color=3D"#000000">>>>>>>>> Ok, so it seems th= at we have a problem in the host bridging. </font><br> <font color=3D"#000000">>>>>>>>> </font><br> <font color=3D"#000000">>>>>>>>> Is it the latest k= ernel-3.10.0-229.7.2.el7.x86_64 ? </font><br> <font color=3D"#000000">>>>>>>>> </font><br> <font color=3D"#000000">>>>>>>>> Michael, a DHCP DI= SCOVER is sent out of a just-booted guest, and </font><br> <font color=3D"#000000">>>>> OFFER </font><br> <font color=3D"#000000">>>>>>>>> returns to the bri= dge, but is not propagated to the tap device. </font><br> <font color=3D"#000000">>>>>>>>> Can you suggest ho= w to debug this further? </font><br> <font color=3D"#000000">>>>>>>> Dump packets including= the ethernet headers. </font><br> <font color=3D"#000000">>>>>>>> Likely something inter= fered with them so the eth address is wrong. </font><br> <font color=3D"#000000">>>>>>>> </font><br> <font color=3D"#000000">>>>>>>> Since bonding does thi= s sometimes, this is the most likely culprit. </font><br> <font color=3D"#000000">>>>>>> We've ruled this out alrea= dy - Roberto reproduces the issue without a </font><br> <font color=3D"#000000">>>>>>> bond. </font><br> <font color=3D"#000000">>>>>> To me this looks like either a= regression in the host side bridging. But </font><br> <font color=3D"#000000">>> otoh it </font><br> <font color=3D"#000000">>>>> doesn't look </font><br> <font color=3D"#000000">>>>>> like it's happening always, be= cause otherwise I'd expect more noise </font><br> <font color=3D"#000000">>> around </font><br> <font color=3D"#000000">>>>> this issue. </font><br> <font color=3D"#000000">>>>>> - fabian </font><br> <font color=3D"#000000">>>>> Hard to say. E.g. forwarding delay= would do this for a while. </font><br> <font color=3D"#000000">>>>> If eth address of the packets is o= kay, poke at the fbd, maybe there's </font><br> <font color=3D"#000000">>>>> something wrong there. Maybe stp i= s detecting a loop - try checking that. </font><br> <font color=3D"#000000">>>> Someone is checking this ? </font>= <br> <font color=3D"#000000">>>> In tested config SPT was off. </fo= nt><br> <font color=3D"#000000">>> Then maybe you have a loop :) </font><= br> <font color=3D"#000000">> That was already checked, the MAC was unique i= n the VLAN. </font><br> <font color=3D"#000000">> </font><br> <font color=3D"#000000">> RN </font><br> <font color=3D"#000000">> </font><br> <br> Did you also try a reboot of the VM? We have the same issue with foreman= 3;<br> and both Libvirt and oVirt. On second boot PXE boots properly from DHCP.= 3;<br> <br> Haven't had the time to investigate yet so we're using mostly image <br=
based provisioning on oVirt at the moment. <br> <br> <br> = =3CBR /=3E =3CBR /=3E =3Cb style=3D=22color=3A=23604c78=22=3E=3C/b=3E=3Cbr=3E=3Cspan style=3D=22c= olor=3A=23604c78=3B=22=3E=3Cfont color=3D=22000000=22=3E=3Cspan style=3D=22= mso-fareast-language=3Aen-gb=3B=22 lang=3D=22NL=22=3EMet vriendelijke groet= =2C With kind regards=2C=3Cbr=3E=3Cbr=3E=3C/span=3EJorick Astrego=3C/font= =3E=3C/span=3E=3Cb style=3D=22color=3A=23604c78=22=3E=3Cbr=3E=3Cbr=3ENetbul= ae Virtualization Experts =3C/b=3E=3Cbr=3E=3Chr style=3D=22border=3Anone=3B= border-top=3A1px solid =23ccc=3B=22=3E=3Ctable style=3D=22width=3A 522px=22= =3E=3Ctbody=3E=3Ctr=3E=3Ctd style=3D=22width=3A 130px=3Bfont-size=3A 10px= =22=3ETel=3A 053 20 30 270=3C/td=3E =3Ctd style=3D=22width=3A 130px=3Bf= ont-size=3A 10px=22=3Einfo=40netbulae=2Eeu=3C/td=3E =3Ctd style=3D=22wid= th=3A 130px=3Bfont-size=3A 10px=22=3EStaalsteden 4-3A=3C/td=3E =3Ctd sty= le=3D=22width=3A 130px=3Bfont-size=3A 10px=22=3EKvK 08198180=3C/td=3E=3C/tr= =3E=3Ctr=3E =3Ctd style=3D=22width=3A 130px=3Bfont-size=3A 10px=22=3EFax= =3A 053 20 30 271=3C/td=3E =3Ctd style=3D=22width=3A 130px=3Bfont-size= =3A 10px=22=3Ewww=2Enetbulae=2Eeu=3C/td=3E =3Ctd style=3D=22width=3A 130= px=3Bfont-size=3A 10px=22=3E7547 TA Enschede=3C/td=3E =3Ctd style=3D=22w= idth=3A 130px=3Bfont-size=3A 10px=22=3EBTW NL821234584B01=3C/td=3E=3C/tr=3E= =3C/tbody=3E=3C/table=3E=3Cbr=3E=3Chr style=3D=22border=3Anone=3Bborder-top= =3A1px solid =23ccc=3B=22=3E=3CBR /=3E =3C/body=3E =3C/html=3E ------------MIME-3653757893-462597306-delim--