Re: [ovirt-users] Hosted engine deployment error

I made another try with Cockpit, it is the same.</span></div>=0A<div>&n= bsp;</div>=0A<div><span style=3D"font-family: arial, helvetica, sans-ser= if; font-size: 10pt; color: #000000;">Am I doing something wrong or is t= here a bug ?</span></div>=0A</blockquote>=0A<div> </div>=0A<div>I s= uppose that your host was condifured with DHCP, if so it's this one:</di= v>=0A<div><a href=3D"https://bugzilla.redhat.com/1549642" target=3D"_bla= nk" rel=3D"noreferrer noopener">https://bugzilla.redhat.com/1549642</a><= /div>=0A<div> </div>=0A<div>The fix will come with 4.2.2.</div>=0A<=
--=_3724be82c6c86ae6590b11a116c8d205 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Hi,=0A In fact it is a workaround coming from you I found in the b= ugtrack that helped me : =0A=0Achmod 644 /var/cache/vdsm/schema/= * =0A=0AAs the only thing looking like a weird error I have found was= : =0A=0AERROR Exception raised#012Traceback (most recent call last)= :#012 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 156, i= n run#012 serve_clients(log)#012 File "/usr/lib/python2.7/site-packages/= vdsm/vdsmd.py", line 103, in serve_clients#012 cif =3D clientIF.getInsta= nce(irs, log, scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm= /clientIF.py", line 250, in getInstance#012 cls._instance =3D clientIF(i= rs, log, scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm/clie= ntIF.py", line 144, in __init__#012 self._prepareJSONRPCServer()#012 Fil= e "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 307, in _pre= pareJSONRPCServer#012 bridge =3D Bridge.DynamicBridge()#012 File "/usr/l= ib/python2.7/site-packages/vdsm/rpc/Bridge.py", line 67, in __init__#012= self._schema =3D vdsmapi.Schema(paths, api_strict_mode)#012 File "/usr/= lib/python2.7/site-packages/vdsm/api/vdsmapi.py", line 217, in __init__#= 012 raise SchemaNotFound("Unable to find API schema file")#012SchemaNotF= ound: Unable to find API schema file So I can go one step futher, bu= t the installation still fails in the end, with file permission problems= in datastore files (i chose NFS 4.1). I can't indeed touch or get infor= mations even logged in root. But I can create and delete files in the sa= me directory. Is there a workaround for this too ? Regards =0A=0A Le= 19-Mar-2018 17:48:41 +0100, stirabos@redhat.com a crit: =0A=0A On= Mon, Mar 19, 2018 at 4:56 PM, wrote:=0A=0A Hi,=0A I wanted to rebuil= d a new hosted engine setup, as the old was corrupted (too much violent= poweroff !) So the server was not reinstalled, I just runned "ovirt-h= osted-engine-cleanup". The network setup generated by vdsm seems to be s= till in place, so I haven't changed anything there. Then I decided to= update the packages to the latest versions avaible, rebooted the server= and run "ovirt-hosted-engine-setup". But the process never succeeds,= as I get an error after a long time spent in "[ INFO ] TASK [Wait for t= he host to be up]" [ ERROR ] fatal: [localhost]: FAILED! =3D> {"ansi= ble_facts": {"ovirt_hosts": [{"address": "pfm-srv-virt-1.pfm-ad.pfm.loc"= , "affinity_labels": [], "auto_numa_status": "unknown", "certificate": {= "organization": "pfm.loc", "subject": "O=3Dpfm.loc,CN=3Dpfm-srv-virt-1.p= fm-ad.pfm.loc"}, "cluster": {"href": "/ovirt-engine/api/clusters/d6c9358= e-2b8b-11e8-bc86-00163e152701", "id": "d6c9358e-2b8b-11e8-bc86-00163e152= 701"}, "comment": "", "cpu": {"speed": 0.0, "topology": {}}, "device_pas= sthrough": {"enabled": false}, "devices": [], "external_network_provider= _configurations": [], "external_status": "ok", "hardware_information": {= "supported_rng_sources": []}, "hooks": [], "href": "/ovirt-engine/api/ho= sts/542566c4-fc85-4398-9402-10c8adaa9554", "id": "542566c4-fc85-4398-940= 2-10c8adaa9554", "katello_errata": [], "kdump_status": "unknown", "ksm":= {"enabled": false}, "max_scheduling_memory": 0, "memory": 0, "name": "p= fm-srv-virt-1.pfm-ad.pfm.loc", "network_attachments": [], "nics": [], "n= uma_nodes": [], "numa_supported": false, "os": {"custom_kernel_cmdline":= ""}, "permissions": [], "port": 54321, "power_management": {"automatic_= pm_enabled": true, "enabled": false, "kdump_detection": true, "pm_proxie= s": []}, "protocol": "stomp", "se_linux": {}, "spm": {"priority": 5, "st= atus": "none"}, "ssh": {"fingerprint": "SHA256:J75BVLFnmGBGFosXzaxCRnuIY= cOc75HUBQZ4pOKpDg8", "port": 22}, "statistics": [], "status": "non_respo= nsive", "storage_connection_extensions": [], "summary": {"total": 0}, "t= ags": [], "transparent_huge_pages": {"enabled": false}, "type": "rhel",= "unmanaged_networks": [], "update_available": false}]}, "attempts": 120= , "changed": false}=0A[ INFO ] TASK [Remove local vm dir]=0A[ INFO ] TAS= K [Notify the user about a failure]=0A[ ERROR ] fatal: [localhost]: FAIL= ED! =3D> {"changed": false, "msg": "The system may not be provisioned ac= cording to the playbook results: please check the logs for the issue, fi= x accordingly or re-deploy from scratch.n"} I made another try with= Cockpit, it is the same. Am I doing something wrong or is there a bug= ? I suppose that your host was condifured with DHCP, if so it's this= one: https://bugzilla.redhat.com/1549642 The fix will come with 4.2.2= . =0A Regards =0A=0A--------------------------------------------= -----------------------------------------------------=0AFreeMail powered= by mail.fr =0A_______________________________________________=0A Users= mailing list=0AUsers@ovirt.org=0Ahttp://lists.ovirt.org/mailman/listinf= o/users=0A=0A-----------------------------------------------------------= --------------------------------------=0AFreeMail powered by mail.fr --=_3724be82c6c86ae6590b11a116c8d205 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable <div><span style=3D"font-family: arial, helvetica,sans-serif; font-size:= 10pt; color: #000000;"> </span></div>=0A<div><span style=3D"font-f= amily: arial, helvetica, sans-serif; font-size: 10pt; color: #000000;">H= i,<br /></span></div>=0A<div> </div>=0A<div> </div>=0A<div>&nb= sp;</div>=0A<div><span style=3D"font-family: arial, helvetica, sans-seri= f; font-size: 10pt; color: #000000;">In fact it is a workaround coming f= rom you I found in the bugtrack that helped me : </span></div>=0A<div>&n= bsp;</div>=0A<div> </div>=0A<div> </div>=0A<div>=0A<pre id=3D"= comment_text_8" class=3D"bz_comment_text bz_wrap_comment_text">chmod 644= /var/cache/vdsm/schema/*</pre>=0A</div>=0A<div> </div>=0A<p>As the= only thing looking like a weird error I have found was :</p>=0A<div>&nb= sp;</div>=0A<div> </div>=0A<p>ERROR Exception raised#012Traceback (= most recent call last):#012 File "/usr/lib/python2.7/site-packages= /vdsm/vdsmd.py", line 156, in run#012 serve_clients(lo= g)#012 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line= 103, in serve_clients#012 cif =3D clientIF.getInstanc= e(irs, log, scheduler)#012 File "/usr/lib/python2.7/site-packages/= vdsm/clientIF.py", line 250, in getInstance#012 cls._i= nstance =3D clientIF(irs, log, scheduler)#012 File "/usr/lib/pytho= n2.7/site-packages/vdsm/clientIF.py", line 144, in __init__#012 &nb= sp; self._prepareJSONRPCServer()#012 File "/usr/lib/python2.= 7/site-packages/vdsm/clientIF.py", line 307, in _prepareJSONRPCServer#01= 2 bridge =3D Bridge.DynamicBridge()#012 File "/u= sr/lib/python2.7/site-packages/vdsm/rpc/Bridge.py", line 67, in __init__= #012 self._schema =3D vdsmapi.Schema(paths, api_strict= _mode)#012 File "/usr/lib/python2.7/site-packages/vdsm/api/vdsmapi= .py", line 217, in __init__#012 raise SchemaNotFound("= Unable to find API schema file")#012SchemaNotFound: Unable to find API s= chema file</p>=0A<div> </div>=0A<div> </div>=0A<div>So I can g= o one step futher, but the installation still fails in the end, with fil= e permission problems in datastore files (i chose NFS 4.1). I can't inde= ed touch or get informations even logged in root. But I can create and d= elete files in the same directory.</div>=0A<div> </div>=0A<div>Is t= here a workaround for this too ?</div>=0A<div> </div>=0A<div>Regard= s</div>=0A<p><br /><br /> Le 19-Mar-2018 17:48:41 +0100, stirabos@redhat= .com a écrit:</p>=0A<div> </div>=0A<div> </div>=0A<div>= </div>=0A<blockquote style=3D"margin-left: 0; padding-left: 5px; b= order-left: 2px solid #000080;">=0A<div dir=3D"ltr"><br />=0A<div class= =3D"gmail_extra"><br />=0A<div class=3D"gmail_quote">On Mon, Mar 19, 201= 8 at 4:56 PM, <span dir=3D"ltr"><<a href=3D"mailto:spfma.tech@e.mail.= fr" target=3D"_blank" rel=3D"noreferrer noopener">spfma.tech@e.mail.fr</= a>></span> wrote:<br />=0A<blockquote class=3D"gmail_quote" style=3D"= margin: 0px 0px 0px .8ex; border-left: 1px solid #cccccc; padding-left:= 1ex;">=0A<div><span style=3D"font-family: arial, helvetica, sans-serif;= font-size: 10pt; color: #000000;">Hi,<br /></span></div>=0A<div> <= /div>=0A<div><span style=3D"font-family: arial, helvetica, sans-serif; f= ont-size: 10pt; color: #000000;">I wanted to rebuild a new hosted engine= setup, as the old was corrupted (too much violent poweroff !)</span></d= iv>=0A<div> </div>=0A<div><span style=3D"font-family: arial, helvet= ica, sans-serif; font-size: 10pt; color: #000000;">So the server was not= reinstalled, I just runned "ovirt-hosted-engine-cleanup". The network s= etup generated by vdsm seems to be still in place, so I haven't changed= anything there.</span></div>=0A<div> </div>=0A<div><span style=3D"= font-family: arial, helvetica, sans-serif; font-size: 10pt; color: #0000= 00;">Then I decided to update the packages to the latest versions avaibl= e, rebooted the server and run "ovirt-hosted-engine-setup".</span></div>= =0A<div> </div>=0A<div><span style=3D"font-family: arial, helvetica= , sans-serif; font-size: 10pt; color: #000000;">But the process never su= cceeds, as I get an error after a long time spent in "<span class=3D"gma= il-m_-3726384503116450878ansible-output-line">[ INFO ] TASK [Wait for th= e host to be up]</span>"</span></div>=0A<div> </div>=0A<div> <= /div>=0A<div><span style=3D"font-family: arial, helvetica, sans-serif; f= ont-size: 10pt; color: #000000;"><span class=3D"gmail-m_-372638450311645= 0878ansible-output-line">[ ERROR ] fatal: [localhost]: FAILED! =3D> {= "ansible_facts": {"ovirt_hosts": [{"address": "pfm-srv-virt-1.pfm-ad.pfm= .loc", "affinity_labels": [], "auto_numa_status": "unknown", "certificat= e": {"organization": "pfm.loc", "subject": "O=3Dpfm.loc,CN=3Dpfm-srv-vir= t-1.pfm-ad.pfm.loc"}, "cluster": {"href": "/ovirt-engine/api/clusters/d6= c9358e-2b8b-11e8-bc86-00163e152701", "id": "d6c9358e-2b8b-11e8-bc86-0016= 3e152701"}, "comment": "", "cpu": {"speed": 0.0, "topology": {}}, "devic= e_passthrough": {"enabled": false}, "devices": [], "external_network_pro= vider_configurations": [], "external_status": "ok", "hardware_informatio= n": {"supported_rng_sources": []}, "hooks": [], "href": "/ovirt-engine/a= pi/hosts/542566c4-fc85-4398-9402-10c8adaa9554", "id": "542566c4-fc85-439= 8-9402-10c8adaa9554", "katello_errata": [], "kdump_status": "unknown", "= ksm": {"enabled": false}, "max_scheduling_memory": 0, "memory": 0, "name= ": "pfm-srv-virt-1.pfm-ad.pfm.loc", "network_attachments": [], "nics": [= ], "numa_nodes": [], "numa_supported": false, "os": {"custom_kernel_cmdl= ine": ""}, "permissions": [], "port": 54321, "power_management": {"autom= atic_pm_enabled": true, "enabled": false, "kdump_detection": true, "pm_p= roxies": []}, "protocol": "stomp", "se_linux": {}, "spm": {"priority": 5= , "status": "none"}, "ssh": {"fingerprint": "SHA256:J75BVLFnmGBGFosXzaxC= RnuIYcOc75HUBQZ4pOKpDg8", "port": 22}, "statistics": [], "status": "non_= responsive", "storage_connection_extensions": [], "summary": {"total": 0= }, "tags": [], "transparent_huge_pages": {"enabled": false}, "type": "rh= el", "unmanaged_networks": [], "update_available": false}]}, "attempts":= 120, "changed": false}<br /></span><span class=3D"gmail-m_-372638450311= 6450878ansible-output-line">[ INFO ] TASK [Remove local vm dir]<br /></s= pan><span class=3D"gmail-m_-3726384503116450878ansible-output-line">[ IN= FO ] TASK [Notify the user about a failure]<br /></span><span class=3D"g= mail-m_-3726384503116450878ansible-output-line">[ ERROR ] fatal: [localh= ost]: FAILED! =3D> {"changed": false, "msg": "The system may not be p= rovisioned according to the playbook results: please check the logs for= the issue, fix accordingly or re-deploy from scratch.n"}</span></span><= /div>=0A<div> </div>=0A<div> </div>=0A<div><span style=3D"font= -family: arial, helvetica, sans-serif; font-size: 10pt; color: #000000;"= div> </div>=0A<blockquote class=3D"gmail_quote" style=3D"margin: 0p= x 0px 0px .8ex; border-left: 1px solid #cccccc; padding-left: 1ex;">=0A<= div> </div>=0A<div><span style=3D"font-family: arial, helvetica, sa= ns-serif; font-size: 10pt; color: #000000;">Regards</span></div>=0A<div>= </div>=0A<div> </div>=0A<br /><hr />FreeMail powered by <a hr= ef=3D"https://mail.fr" target=3D"_blank" rel=3D"noreferrer noopener">mai= l.fr</a> <br />_______________________________________________<br /> Use= rs mailing list<br /><a href=3D"mailto:Users@ovirt.org" target=3D"_blank= " rel=3D"noreferrer noopener">Users@ovirt.org</a><br /><a href=3D"http:/= /lists.ovirt.org/mailman/listinfo/users" target=3D"_blank" rel=3D"norefe= rrer noopener">http://lists.ovirt.org/mailman/listinfo/users</a><br /><b= r /></blockquote>=0A</div>=0A</div>=0A</div>=0A</blockquote>=0A = <br/><hr>FreeMail powered by <a href=3D"https://mail.fr" tar= get=3D"_blank">mail.fr</a>=0A --=_3724be82c6c86ae6590b11a116c8d205--

On Tue, Mar 20, 2018 at 11:44 AM, <spfma.tech@e.mail.fr> wrote:
Hi,
In fact it is a workaround coming from you I found in the bugtrack that helped me :
chmod 644 /var/cache/vdsm/schema/*
As the only thing looking like a weird error I have found was :
ERROR Exception raised#012Traceback (most recent call last):#012 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 156, in run#012 serve_clients(log)#012 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 103, in serve_clients#012 cif = clientIF.getInstance(irs, log, scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 250, in getInstance#012 cls._instance = clientIF(irs, log, scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 144, in __init__#012 self._prepareJSONRPCServer()#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 307, in _prepareJSONRPCServer#012 bridge = Bridge.DynamicBridge()#012 File "/usr/lib/python2.7/site-packages/vdsm/rpc/Bridge.py", line 67, in __init__#012 self._schema = vdsmapi.Schema(paths, api_strict_mode)#012 File "/usr/lib/python2.7/site-packages/vdsm/api/vdsmapi.py", line 217, in __init__#012 raise SchemaNotFound("Unable to find API schema file")#012SchemaNotFound: Unable to find API schema file
Thanks, it's tracked here: https://bugzilla.redhat.com/1552565 A fix will come in the next build.
So I can go one step futher, but the installation still fails in the end, with file permission problems in datastore files (i chose NFS 4.1). I can't indeed touch or get informations even logged in root. But I can create and delete files in the same directory.
Is there a workaround for this too ?
Everything should get wrote and read on the NFS export as vdsm:kvm (36:36); can you please ensure that everything is fine with that?
Regards
Le 19-Mar-2018 17:48:41 +0100, stirabos@redhat.com a écrit:
On Mon, Mar 19, 2018 at 4:56 PM, <spfma.tech@e.mail.fr> wrote:
Hi,
I wanted to rebuild a new hosted engine setup, as the old was corrupted (too much violent poweroff !)
So the server was not reinstalled, I just runned "ovirt-hosted-engine-cleanup". The network setup generated by vdsm seems to be still in place, so I haven't changed anything there.
Then I decided to update the packages to the latest versions avaible, rebooted the server and run "ovirt-hosted-engine-setup".
But the process never succeeds, as I get an error after a long time spent in "[ INFO ] TASK [Wait for the host to be up]"
[ ERROR ] fatal: [localhost]: FAILED! => {"ansible_facts": {"ovirt_hosts": [{"address": "pfm-srv-virt-1.pfm-ad.pfm.loc", "affinity_labels": [], "auto_numa_status": "unknown", "certificate": {"organization": "pfm.loc", "subject": "O=pfm.loc,CN=pfm-srv-virt-1.pfm-ad.pfm.loc"}, "cluster": {"href": "/ovirt-engine/api/clusters/d6c9358e-2b8b-11e8-bc86-00163e152701", "id": "d6c9358e-2b8b-11e8-bc86-00163e152701"}, "comment": "", "cpu": {"speed": 0.0, "topology": {}}, "device_passthrough": {"enabled": false}, "devices": [], "external_network_provider_configurations": [], "external_status": "ok", "hardware_information": {"supported_rng_sources": []}, "hooks": [], "href": "/ovirt-engine/api/hosts/ 542566c4-fc85-4398-9402-10c8adaa9554", "id": "542566c4-fc85-4398-9402-10c8adaa9554", "katello_errata": [], "kdump_status": "unknown", "ksm": {"enabled": false}, "max_scheduling_memory": 0, "memory": 0, "name": "pfm-srv-virt-1.pfm-ad.pfm.loc", "network_attachments": [], "nics": [], "numa_nodes": [], "numa_supported": false, "os": {"custom_kernel_cmdline": ""}, "permissions": [], "port": 54321, "power_management": {"automatic_pm_enabled": true, "enabled": false, "kdump_detection": true, "pm_proxies": []}, "protocol": "stomp", "se_linux": {}, "spm": {"priority": 5, "status": "none"}, "ssh": {"fingerprint": "SHA256:J75BVLFnmGBGFosXzaxCRnuIYcOc75HUBQZ4pOKpDg8", "port": 22}, "statistics": [], "status": "non_responsive", "storage_connection_extensions": [], "summary": {"total": 0}, "tags": [], "transparent_huge_pages": {"enabled": false}, "type": "rhel", "unmanaged_networks": [], "update_available": false}]}, "attempts": 120, "changed": false} [ INFO ] TASK [Remove local vm dir] [ INFO ] TASK [Notify the user about a failure] [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "The system may not be provisioned according to the playbook results: please check the logs for the issue, fix accordingly or re-deploy from scratch.n"}
I made another try with Cockpit, it is the same.
Am I doing something wrong or is there a bug ?
I suppose that your host was condifured with DHCP, if so it's this one: https://bugzilla.redhat.com/1549642
The fix will come with 4.2.2.
Regards
------------------------------ FreeMail powered by mail.fr _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
------------------------------ FreeMail powered by mail.fr
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

{"ansible_facts": {"ovirt_hosts": [{"address": "pfm-srv-virt-1.pfm-ad.=
spfma.tech@e.mail.fr</a>></span> wrote:<br /></span>=0A<blockquote c= lass=3D"gmail_quote" style=3D"margin: 0px 0px 0px .8ex; border-left: 1px= solid #cccccc; padding-left: 1ex;">=0A<div><span style=3D"font-family:= arial, helvetica, sans-serif; font-size: 10pt; color: #000000;">Hi,<br= /></span></div>=0A<div> </div>=0A<div><span style=3D"font-family:= arial, helvetica, sans-serif; font-size: 10pt; color: #000000;">I wante= d to rebuild a new hosted engine setup, as the old was corrupted (too mu= ch violent poweroff !)</span></div>=0A<div> </div>=0A<div><span sty= le=3D"font-family: arial, helvetica, sans-serif; font-size: 10pt; color:= #000000;">So the server was not reinstalled, I just runned "ovirt-hoste= d-engine-cleanup". The network setup generated by vdsm seems to be still= in place, so I haven't changed anything there.</span></div>=0A<div>&nbs=
--=_2627f23f400ebb389cc57c156f3cb9f7 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable I tried to make a cleaner install : after cleanup, I recreated "/rhev/da= ta-center/mnt/" and ran the installer again.=0A As you can see, it cra= shed again with the same access denied error on this file : [ INFO ]= TASK [Copy configuration archive to storage]=0A[ ERROR ] fatal: [localh= ost]: FAILED! =3D> {"changed": true, "cmd": ["dd", "bs=3D20480", "count= =3D1", "oflag=3Ddirect", "if=3D/var/tmp/localvmVBRLpL/b1884198-69e6-4096= -939d-03c87112de10", "of=3D/rhev/data-center/mnt/10.100.2.132:_volume3_o= virt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/5= 89d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10= "], "delta": "0:00:00.004468", "end": "2018-03-20 15:57:34.199405", "msg= ": "non-zero return code", "rc": 1, "start": "2018-03-20 15:57:34.194937= ", "stderr": "dd: impossible d'ouvrir /rhev/data-center/mnt/10.100.2.132= :_volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db33= 87/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-0= 3c87112de10 : Permission non accorde", "stderr_lines": ["dd: impossible= d'ouvrir /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine__sel= f__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495= -aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 : Permission non= accorde"], "stdout": "", "stdout_lines": []}=0A[ ERROR ] Failed to exec= ute stage 'Closing up': Failed executing ansible-playbook=0A But the f= ile permissions look ok to me : -rw-rw----. 1 vdsm kvm 1,0G 20 mars 2= 018 /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hos= ted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-= 45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10=0A=0A So I decided to= test something : I set a shell for "vdsm", so I could login : su - v= dsm -c "touch /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine_= _self__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-= 4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10" && echo "OK= "=0AOK As far as I can see,still no permission problem =0A=0ABut if I= try the same as "root" : =0A=0Atouch /rhev/data-center/mnt/10.100.2.132= :_volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db33= 87/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-0= 3c87112de10 && echo "OK"=0Atouch: impossible de faire un touch /rhev/dat= a-center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-= af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/= b1884198-69e6-4096-939d-03c87112de10 : Permission non accorde =0A=0AOf c= ourse, "root" and "vdsm" can create, touch and delete other files flawle= ssly in this share. =0A=0AIt looks like some kind of immutable file, but= is is not suppose to exist on NFS, does it ? =0A=0ARegards =0A=0A Le 20= -Mar-2018 12:22:50 +0100, stirabos@redhat.com a crit: =0A=0A On Tue, M= ar 20, 2018 at 11:44 AM, wrote:=0A=0A Hi,=0A In fact it is a wo= rkaround coming from you I found in the bugtrack that helped me : = =0A=0Achmod 644 /var/cache/vdsm/schema/* =0A=0AAs the only thing l= ooking like a weird error I have found was : =0A=0AERROR Exception r= aised#012Traceback (most recent call last):#012 File "/usr/lib/python2.7= /site-packages/vdsm/vdsmd.py", line 156, in run#012 serve_clients(log)#0= 12 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 103, in s= erve_clients#012 cif =3D clientIF.getInstance(irs, log, scheduler)#012 F= ile "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 250, in ge= tInstance#012 cls._instance =3D clientIF(irs, log, scheduler)#012 File "= /usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 144, in __init_= _#012 self._prepareJSONRPCServer()#012 File "/usr/lib/python2.7/site-pac= kages/vdsm/clientIF.py", line 307, in _prepareJSONRPCServer#012 bridge= =3D Bridge.DynamicBridge()#012 File "/usr/lib/python2.7/site-packages/v= dsm/rpc/Bridge.py", line 67, in __init__#012 self._schema =3D vdsmapi.Sc= hema(paths, api_strict_mode)#012 File "/usr/lib/python2.7/site-packages/= vdsm/api/vdsmapi.py", line 217, in __init__#012 raise SchemaNotFound("Un= able to find API schema file")#012SchemaNotFound: Unable to find API sch= ema file Thanks, it's tracked here: https://bugzilla.redhat.com/15525= 65 A fix will come in the next build. =0A So I can go one step f= uther, but the installation still fails in the end, with file permission= problems in datastore files (i chose NFS 4.1). I can't indeed touch or= get informations even logged in root. But I can create and delete files= in the same directory. Is there a workaround for this too ? Everyt= hing should get wrote and read on the NFS export as vdsm:kvm (36:36); ca= n you please ensure that everything is fine with that? =0A Regards= =0A=0A Le 19-Mar-2018 17:48:41 +0100, stirabos@redhat.com a crit: = =0A=0A On Mon, Mar 19, 2018 at 4:56 PM, wrote:=0A=0A Hi,=0A I wante= d to rebuild a new hosted engine setup, as the old was corrupted (too mu= ch violent poweroff !) So the server was not reinstalled, I just runne= d "ovirt-hosted-engine-cleanup". The network setup generated by vdsm see= ms to be still in place, so I haven't changed anything there. Then I d= ecided to update the packages to the latest versions avaible, rebooted t= he server and run "ovirt-hosted-engine-setup". But the process never s= ucceeds, as I get an error after a long time spent in "[ INFO ] TASK [Wa= it for the host to be up]" [ ERROR ] fatal: [localhost]: FAILED! =3D= pfm.loc", "affinity_labels": [], "auto_numa_status": "unknown", "certifi= cate": {"organization": "pfm.loc", "subject": "O=3Dpfm.loc,CN=3Dpfm-srv-= virt-1.pfm-ad.pfm.loc"}, "cluster": {"href": "/ovirt-engine/api/clusters= /d6c9358e-2b8b-11e8-bc86-00163e152701", "id": "d6c9358e-2b8b-11e8-bc86-0= 0163e152701"}, "comment": "", "cpu": {"speed": 0.0, "topology": {}}, "de= vice_passthrough": {"enabled": false}, "devices": [], "external_network_= provider_configurations": [], "external_status": "ok", "hardware_informa= tion": {"supported_rng_sources": []}, "hooks": [], "href": "/ovirt-engin= e/api/hosts/542566c4-fc85-4398-9402-10c8adaa9554", "id": "542566c4-fc85-= 4398-9402-10c8adaa9554", "katello_errata": [], "kdump_status": "unknown"= , "ksm": {"enabled": false}, "max_scheduling_memory": 0, "memory": 0, "n= ame": "pfm-srv-virt-1.pfm-ad.pfm.loc", "network_attachments": [], "nics"= : [], "numa_nodes": [], "numa_supported": false, "os": {"custom_kernel_c= mdline": ""}, "permissions": [], "port": 54321, "power_management": {"au= tomatic_pm_enabled": true, "enabled": false, "kdump_detection": true, "p= m_proxies": []}, "protocol": "stomp", "se_linux": {}, "spm": {"priority"= : 5, "status": "none"}, "ssh": {"fingerprint": "SHA256:J75BVLFnmGBGFosXz= axCRnuIYcOc75HUBQZ4pOKpDg8", "port": 22}, "statistics": [], "status": "n= on_responsive", "storage_connection_extensions": [], "summary": {"total"= : 0}, "tags": [], "transparent_huge_pages": {"enabled": false}, "type":= "rhel", "unmanaged_networks": [], "update_available": false}]}, "attemp= ts": 120, "changed": false}=0A[ INFO ] TASK [Remove local vm dir]=0A[ IN= FO ] TASK [Notify the user about a failure]=0A[ ERROR ] fatal: [localhos= t]: FAILED! =3D> {"changed": false, "msg": "The system may not be provis= ioned according to the playbook results: please check the logs for the i= ssue, fix accordingly or re-deploy from scratch.n"} I made another t= ry with Cockpit, it is the same. Am I doing something wrong or is ther= e a bug ? I suppose that your host was condifured with DHCP, if so it= 's this one: https://bugzilla.redhat.com/1549642 The fix will come wit= h 4.2.2. =0A Regards =0A=0A-------------------------------------= ------------------------------------------------------------=0AFreeMail= powered by mail.fr =0A_______________________________________________= =0A Users mailing list=0AUsers@ovirt.org=0Ahttp://lists.ovirt.org/mailma= n/listinfo/users=0A=0A--------------------------------------------------= -----------------------------------------------=0AFreeMail powered by ma= il.fr =0A_______________________________________________=0A Users maili= ng list=0AUsers@ovirt.org=0Ahttp://lists.ovirt.org/mailman/listinfo/user= s=0A=0A-----------------------------------------------------------------= --------------------------------=0AFreeMail powered by mail.fr --=_2627f23f400ebb389cc57c156f3cb9f7 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable <div><span style=3D"font-family: arial, helvetica,sans-serif; font-size:= 10pt; color: #000000;">I tried to make a cleaner install : after cleanu= p, I recreated "/rhev/data-center/mnt/" and ran the installer again.<br= /></span></div>=0A<div> </div>=0A<div><span style=3D"font-family:= arial, helvetica,sans-serif; font-size: 10pt; color: #000000;">As you c= an see, it crashed again with the same access denied error on this file= : </span></div>=0A<div> </div>=0A<div><span style=3D"font-family:= arial, helvetica,sans-serif; font-size: 10pt; color: #000000;">[ INFO&n= bsp; ] TASK [Copy configuration archive to storage]<br />[ ERROR ] fatal= : [localhost]: FAILED! =3D> {"changed": true, "cmd": ["dd", "bs=3D204= 80", "count=3D1", "oflag=3Ddirect", "if=3D/var/tmp/localvmVBRLpL/b188419= 8-69e6-4096-939d-03c87112de10", "of=3D/rhev/data-center/mnt/10.100.2.132= :_volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db33= 87/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-0= 3c87112de10"], "delta": "0:00:00.004468", "end": "2018-03-20 15:57:34.19= 9405", "msg": "non-zero return code", "rc": 1, "start": "2018-03-20 15:5= 7:34.194937", "stderr": "dd: impossible d'ouvrir « /rhev/data= -center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-a= f01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b= 1884198-69e6-4096-939d-03c87112de10 »: Permission non accord&= eacute;e", "stderr_lines": ["dd: impossible d'ouvrir « /rhev/= data-center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d95= 46-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a185= 26/b1884198-69e6-4096-939d-03c87112de10 »: Permission non acc= ordée"], "stdout": "", "stdout_lines": []}<br />[ ERROR ] Failed= to execute stage 'Closing up': Failed executing ansible-playbook<br /><= /span></div>=0A<div> </div>=0A<div><span style=3D"font-family: aria= l, helvetica,sans-serif; font-size: 10pt; color: #000000;">But the file= permissions look ok to me : </span></div>=0A<div> </div>=0A<div><s= pan style=3D"font-family: arial, helvetica,sans-serif; font-size: 10pt;= color: #000000;">-rw-rw----. 1 vdsm kvm 1,0G 20 mars 2018 /= rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/0= 15d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b= 2a18526/b1884198-69e6-4096-939d-03c87112de10<br /><br /></span></div>=0A= <div><span style=3D"font-family: arial, helvetica,sans-serif; font-size:= 10pt; color: #000000;">So I decided to test something : I set a s= hell for "vdsm", so I could login : </span></div>=0A<div> </di= v>=0A<div>su - vdsm -c "touch /rhev/data-center/mnt/10.100.2.132:_volume= 3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db3387/image= s/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112d= e10" && echo "OK"<br />OK</div>=0A<div> </div>=0A<div>As fa= r as I can see,still no permission problem</div>=0A<p>But if I try= the same as "root" :</p>=0A<p>touch /rhev/data-center/mnt/10.100.2.132:= _volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db338= 7/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03= c87112de10 && echo "OK"<br />touch: impossible de faire un touch= « /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine_= _self__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-= 4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 »= : Permission non accordée</p>=0A<p>Of course, "root" and "vdsm" c= an create, touch and delete other files flawlessly in this share.</p>=0A= <p>It looks like some kind of immutable file, but is is not suppose to e= xist on NFS, does it ?</p>=0A<p>Regards</p>=0A<p> </p>=0A<p><br /><= br /> Le 20-Mar-2018 12:22:50 +0100, stirabos@redhat.com a écrit:= </p>=0A<div> </div>=0A<blockquote style=3D"margin-left: 0; padding-= left: 5px; border-left: 2px solid navy;">=0A<div dir=3D"ltr"><br />=0A<d= iv class=3D"gmail_extra"><br />=0A<div class=3D"gmail_quote">On Tue, Mar= 20, 2018 at 11:44 AM, <span dir=3D"ltr"><<a href=3D"mailto:spfma.tec= h@e.mail.fr" target=3D"_blank" rel=3D"noreferrer noopener">spfma.tech@e.= mail.fr</a>></span> wrote:<br />=0A<blockquote class=3D"gmail_quote"= style=3D"margin: 0px 0px 0px .8ex; border-left: 1px solid #cccccc; padd= ing-left: 1ex;">=0A<div><span style=3D"font-family: arial, helvetica, sa= ns-serif; font-size: 10pt; color: #000000;"> </span></div>=0A<div><= span style=3D"font-family: arial, helvetica, sans-serif; font-size: 10pt= ; color: #000000;">Hi,<br /></span></div>=0A<div> </div>=0A<div>&nb= sp;</div>=0A<div> </div>=0A<div><span style=3D"font-family: arial,= helvetica, sans-serif; font-size: 10pt; color: #000000;">In fact it is= a workaround coming from you I found in the bugtrack that helped me : <= /span></div>=0A<div> </div>=0A<div> </div>=0A<div> </div>= =0A<div>=0A<pre id=3D"gmail-m_-4123427470926593816comment_text_8" class= =3D"gmail-m_-4123427470926593816bz_comment_text gmail-m_-412342747092659= 3816bz_wrap_comment_text">chmod 644 /var/cache/vdsm/schema/*</pre>=0A</d= iv>=0A<div> </div>=0A<p>As the only thing looking like a weird erro= r I have found was :</p>=0A<div> </div>=0A<div> </div>=0A<p>ER= ROR Exception raised#012Traceback (most recent call last):#012 Fil= e "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 156, in run#012= serve_clients(log)#012 File "/usr/lib/python2.7= /site-packages/vdsm/vdsmd.py", line 103, in serve_clients#012  = ; cif =3D clientIF.getInstance(irs, log, scheduler)#012 File= "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 250, in getIn= stance#012 cls._instance =3D clientIF(irs, log, schedu= ler)#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py",= line 144, in __init__#012 self._prepareJSONRPCServer(= )#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", li= ne 307, in _prepareJSONRPCServer#012 bridge =3D Bridge= .DynamicBridge()#012 File "/usr/lib/python2.7/site-packages/vdsm/r= pc/Bridge.py", line 67, in __init__#012 self._schema= =3D vdsmapi.Schema(paths, api_strict_mode)#012 File "/usr/lib/pyt= hon2.7/site-packages/vdsm/api/vdsmapi.py", line 217, in __init__#012&nbs= p; raise SchemaNotFound("Unable to find API schema file")#01= 2SchemaNotFound: Unable to find API schema file</p>=0A</blockquote>=0A<d= iv> </div>=0A<div>Thanks, it's tracked here:</div>=0A<div><a href= =3D"https://bugzilla.redhat.com/1552565" target=3D"_blank">https://bugzi= lla.redhat.com/1552565</a></div>=0A<div> </div>=0A<div>A fix will c= ome in the next build.</div>=0A<div> </div>=0A<blockquote class=3D"= gmail_quote" style=3D"margin: 0px 0px 0px .8ex; border-left: 1px solid #= cccccc; padding-left: 1ex;">=0A<div> </div>=0A<div> </div>=0A<= div>So I can go one step futher, but the installation still fails in the= end, with file permission problems in datastore files (i chose NFS 4.1)= . I can't indeed touch or get informations even logged in root. But I ca= n create and delete files in the same directory.</div>=0A<div> </di= v>=0A<div>Is there a workaround for this too ?</div>=0A</blockquote>=0A<= div> </div>=0A<div>Everything should get wrote and read on the NFS= export as vdsm:kvm (36:36); can you please ensure that everything is fi= ne with that?</div>=0A<div> </div>=0A<blockquote class=3D"gmail_quo= te" style=3D"margin: 0px 0px 0px .8ex; border-left: 1px solid #cccccc; p= adding-left: 1ex;">=0A<div> </div>=0A<div>Regards</div>=0A<p><br />= <br /> Le 19-Mar-2018 17:48:41 +0100, <a href=3D"mailto:stirabos@redhat.= com" target=3D"_blank" rel=3D"noreferrer noopener">stirabos@redhat.com</= a> a écrit:</p>=0A<div> </div>=0A<div> </div>=0A<div>&n= bsp;</div>=0A<blockquote style=3D"margin-left: 0px; padding-left: 5px; b= order-left: 2px solid #000080;">=0A<div dir=3D"ltr"><br />=0A<div class= =3D"gmail_extra"><br />=0A<div class=3D"gmail_quote"><span class=3D"gmai= l-">On Mon, Mar 19, 2018 at 4:56 PM, <span dir=3D"ltr"><<a href=3D"ma= ilto:spfma.tech@e.mail.fr" target=3D"_blank" rel=3D"noreferrer noopener"= p;</div>=0A<div><span style=3D"font-family: arial, helvetica, sans-serif= ; font-size: 10pt; color: #000000;">Then I decided to update the package= s to the latest versions avaible, rebooted the server and run "ovirt-hos= ted-engine-setup".</span></div>=0A<div> </div>=0A<div><span style= =3D"font-family: arial, helvetica, sans-serif; font-size: 10pt; color: #= 000000;">But the process never succeeds, as I get an error after a long= time spent in "<span class=3D"gmail-m_-4123427470926593816gmail-m_-3726= 384503116450878ansible-output-line">[ INFO ] TASK [Wait for the host to= be up]</span>"</span></div>=0A<div> </div>=0A<div> </div>=0A<= div><span style=3D"font-family: arial, helvetica, sans-serif; font-size:= 10pt; color: #000000;"><span class=3D"gmail-"><span class=3D"gmail-m_-4= 123427470926593816gmail-m_-3726384503116450878ansible-output-line">[ ERR= OR ] fatal: [localhost]: FAILED! =3D> {"ansible_facts": {"ovirt_hosts= ": [{"address": "pfm-srv-virt-1.pfm-ad.pfm.loc", "affinity_labels": [],= "auto_numa_status": "unknown", "certificate": {"organization": "pfm.loc= ", "subject": "O=3Dpfm.loc,CN=3Dpfm-srv-virt-1.pfm-ad.pfm.loc"}, "cluste= r": {"href": "/ovirt-engine/api/clusters/d6c9358e-2b8b-11e8-bc86-00163e1= 52701", "id": "d6c9358e-2b8b-11e8-bc86-00163e152701"}, "comment": "", "c= pu": {"speed": 0.0, "topology": {}}, "device_passthrough": {"enabled": f= alse}, "devices": [], "external_network_provider_configurations": [], "e= xternal_status": "ok", "hardware_information": {"supported_rng_sources":= []}, "hooks": [], "href": "/ovirt-engine/api/hosts/542566c4-fc85-4398-9= 402-10c8adaa9554", "id": "542566c4-fc85-4398-9402-10c8adaa9554", "katell= o_errata": [], "kdump_status": "unknown", "ksm": {"enabled": false}, "ma= x_scheduling_memory": 0, "memory": 0, "name": "pfm-srv-virt-1.pfm-ad.pfm= .loc", "network_attachments": [], "nics": [], "numa_nodes": [], "numa_su= pported": false, "os": {"custom_kernel_cmdline": ""}, "permissions": [],= "port": 54321, "power_management": {"automatic_pm_enabled": true, "enab= led": false, "kdump_detection": true, "pm_proxies": []}, "protocol": "st= omp", "se_linux": {}, "spm": {"priority": 5, "status": "none"}, "ssh": {= "fingerprint": "SHA256:J75BVLFnmGBGFosXzaxCRnuIYcOc75HUBQZ4pOKpDg8", "po= rt": 22}, "statistics": [], "status": "non_responsive", "storage_connect= ion_extensions": [], "summary": {"total": 0}, "tags": [], "transparent_h= uge_pages": {"enabled": false}, "type": "rhel", "unmanaged_networks": []= , "update_available": false}]}, "attempts": 120, "changed": false}<br />= </span><span class=3D"gmail-m_-4123427470926593816gmail-m_-3726384503116= 450878ansible-output-line">[ INFO ] TASK [Remove local vm dir]<br /></sp= an><span class=3D"gmail-m_-4123427470926593816gmail-m_-37263845031164508= 78ansible-output-line">[ INFO ] TASK [Notify the user about a failure]<b= r /></span></span><span class=3D"gmail-m_-4123427470926593816gmail-m_-37= 26384503116450878ansible-output-line">[ ERROR ] fatal: [localhost]: FAIL= ED! =3D> {"changed": false, "msg": "The system may not be provisioned= according to the playbook results: please check the logs for the issue,= fix accordingly or re-deploy from scratch.n"}</span></span></div>=0A<di= v> </div>=0A<div> </div>=0A<div><span style=3D"font-family: ar= ial, helvetica, sans-serif; font-size: 10pt; color: #000000;">I made ano= ther try with Cockpit, it is the same.</span></div>=0A<div> </div>= =0A<div><span style=3D"font-family: arial, helvetica, sans-serif; font-s= ize: 10pt; color: #000000;">Am I doing something wrong or is there a bug= ?</span></div>=0A</blockquote>=0A<div> </div>=0A<div>I suppose tha= t your host was condifured with DHCP, if so it's this one:</div>=0A<div>= <a href=3D"https://bugzilla.redhat.com/1549642" target=3D"_blank" rel=3D= "noreferrer noopener">https://bugzilla.redhat.com/1549642</a></div>=0A<d= iv> </div>=0A<div>The fix will come with 4.2.2.</div>=0A<div> = </div>=0A<blockquote class=3D"gmail_quote" style=3D"margin: 0px 0px 0px= .8ex; border-left: 1px solid #cccccc; padding-left: 1ex;">=0A<div> = ;</div>=0A<div><span style=3D"font-family: arial, helvetica, sans-serif;= font-size: 10pt; color: #000000;">Regards</span></div>=0A<div> </d= iv>=0A<div> </div>=0A<br /><hr />FreeMail powered by <a href=3D"htt= ps://mail.fr" target=3D"_blank" rel=3D"noreferrer noopener">mail.fr</a>= <br /><span class=3D"gmail-">__________________________________________= _____<br /> Users mailing list<br /><a href=3D"mailto:Users@ovirt.org" t= arget=3D"_blank" rel=3D"noreferrer noopener">Users@ovirt.org</a><br /><a= href=3D"http://lists.ovirt.org/mailman/listinfo/users" target=3D"_blank= " rel=3D"noreferrer noopener">http://lists.ovirt.org/mailman/listinfo/us= ers</a><br /><br /></span></blockquote>=0A</div>=0A</div>=0A</div>=0A</b= lockquote>=0A<div class=3D"gmail-HOEnZb">=0A<div class=3D"gmail-h5"><br= /><hr />FreeMail powered by <a href=3D"https://mail.fr" target=3D"_blan= k" rel=3D"noreferrer noopener">mail.fr</a></div>=0A</div>=0A<br />______= _________________________________________<br /> Users mailing list<br />= <a href=3D"mailto:Users@ovirt.org" target=3D"_blank">Users@ovirt.org</a>= <br /><a href=3D"http://lists.ovirt.org/mailman/listinfo/users" target= =3D"_blank" rel=3D"noreferrer noopener">http://lists.ovirt.org/mailman/l= istinfo/users</a><br /><br /></blockquote>=0A</div>=0A</div>=0A</div>=0A= </blockquote>=0A <br/><hr>FreeMail powered by <a href= =3D"https://mail.fr" target=3D"_blank">mail.fr</a>=0A --=_2627f23f400ebb389cc57c156f3cb9f7--

On Tue, Mar 20, 2018 at 4:12 PM, <spfma.tech@e.mail.fr> wrote:
I tried to make a cleaner install : after cleanup, I recreated "/rhev/data-center/mnt/" and ran the installer again.
It should be automatically created by vdsm, can you please avoid that?
As you can see, it crashed again with the same access denied error on this file :
[ INFO ] TASK [Copy configuration archive to storage] [ ERROR ] fatal: [localhost]: FAILED! => {"changed": true, "cmd": ["dd", "bs=20480", "count=1", "oflag=direct", "if=/var/tmp/localvmVBRLpL/ b1884198-69e6-4096-939d-03c87112de10", "of=/rhev/data-center/mnt/10. 100.2.132:_volume3_ovirt__engine__self__hosted/015d9546- af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495- aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10"], "delta": "0:00:00.004468", "end": "2018-03-20 15:57:34.199405", "msg": "non-zero return code", "rc": 1, "start": "2018-03-20 15:57:34.194937", "stderr": "dd: impossible d'ouvrir « /rhev/data-center/mnt/10. 100.2.132:_volume3_ovirt__engine__self__hosted/015d9546- af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495- aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 »: Permission non accordée", "stderr_lines": ["dd: impossible d'ouvrir « /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__ engine__self__hosted/015d9546-af01-4fb2-891e-e28683db3387/ images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 »: Permission non accordée"], "stdout": "", "stdout_lines": []} [ ERROR ] Failed to execute stage 'Closing up': Failed executing ansible-playbook
But the file permissions look ok to me :
-rw-rw----. 1 vdsm kvm 1,0G 20 mars 2018 /rhev/data-center/mnt/10.100. 2.132:_volume3_ovirt__engine__self__hosted/015d9546-af01- 4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57- 45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10
So I decided to test something : I set a shell for "vdsm", so I could login :
su - vdsm -c "touch /rhev/data-center/mnt/10.100. 2.132:_volume3_ovirt__engine__self__hosted/015d9546-af01- 4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57- 45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10" && echo "OK" OK
As far as I can see,still no permission problem
But if I try the same as "root" :
touch /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine__ self__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/ 589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 && echo "OK" touch: impossible de faire un touch « /rhev/data-center/mnt/10. 100.2.132:_volume3_ovirt__engine__self__hosted/015d9546- af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495- aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 »: Permission non accordée
Of course, "root" and "vdsm" can create, touch and delete other files flawlessly in this share.
It looks like some kind of immutable file, but is is not suppose to exist on NFS, does it ?
Regards
Le 20-Mar-2018 12:22:50 +0100, stirabos@redhat.com a écrit:
On Tue, Mar 20, 2018 at 11:44 AM, <spfma.tech@e.mail.fr> wrote:
Hi,
In fact it is a workaround coming from you I found in the bugtrack that helped me :
chmod 644 /var/cache/vdsm/schema/*
As the only thing looking like a weird error I have found was :
ERROR Exception raised#012Traceback (most recent call last):#012 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 156, in run#012 serve_clients(log)#012 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 103, in serve_clients#012 cif = clientIF.getInstance(irs, log, scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 250, in getInstance#012 cls._instance = clientIF(irs, log, scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 144, in __init__#012 self._prepareJSONRPCServer()#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 307, in _prepareJSONRPCServer#012 bridge = Bridge.DynamicBridge()#012 File "/usr/lib/python2.7/site-packages/vdsm/rpc/Bridge.py", line 67, in __init__#012 self._schema = vdsmapi.Schema(paths, api_strict_mode)#012 File "/usr/lib/python2.7/site-packages/vdsm/api/vdsmapi.py", line 217, in __init__#012 raise SchemaNotFound("Unable to find API schema file")#012SchemaNotFound: Unable to find API schema file
Thanks, it's tracked here: https://bugzilla.redhat.com/1552565
A fix will come in the next build.
So I can go one step futher, but the installation still fails in the end, with file permission problems in datastore files (i chose NFS 4.1). I can't indeed touch or get informations even logged in root. But I can create and delete files in the same directory.
Is there a workaround for this too ?
Everything should get wrote and read on the NFS export as vdsm:kvm (36:36); can you please ensure that everything is fine with that?
Regards
Le 19-Mar-2018 17:48:41 +0100, stirabos@redhat.com a écrit:
On Mon, Mar 19, 2018 at 4:56 PM, <spfma.tech@e.mail.fr> wrote:
Hi,
I wanted to rebuild a new hosted engine setup, as the old was corrupted (too much violent poweroff !)
So the server was not reinstalled, I just runned "ovirt-hosted-engine-cleanup". The network setup generated by vdsm seems to be still in place, so I haven't changed anything there.
Then I decided to update the packages to the latest versions avaible, rebooted the server and run "ovirt-hosted-engine-setup".
But the process never succeeds, as I get an error after a long time spent in "[ INFO ] TASK [Wait for the host to be up]"
[ ERROR ] fatal: [localhost]: FAILED! => {"ansible_facts": {"ovirt_hosts": [{"address": "pfm-srv-virt-1.pfm-ad.pfm.loc", "affinity_labels": [], "auto_numa_status": "unknown", "certificate": {"organization": "pfm.loc", "subject": "O=pfm.loc,CN=pfm-srv-virt-1.pfm-ad.pfm.loc"}, "cluster": {"href": "/ovirt-engine/api/clusters/d6c9358e-2b8b-11e8-bc86-00163e152701", "id": "d6c9358e-2b8b-11e8-bc86-00163e152701"}, "comment": "", "cpu": {"speed": 0.0, "topology": {}}, "device_passthrough": {"enabled": false}, "devices": [], "external_network_provider_configurations": [], "external_status": "ok", "hardware_information": {"supported_rng_sources": []}, "hooks": [], "href": "/ovirt-engine/api/hosts/ 542566c4-fc85-4398-9402-10c8adaa9554", "id": "542566c4-fc85-4398-9402-10c8adaa9554", "katello_errata": [], "kdump_status": "unknown", "ksm": {"enabled": false}, "max_scheduling_memory": 0, "memory": 0, "name": "pfm-srv-virt-1.pfm-ad.pfm.loc", "network_attachments": [], "nics": [], "numa_nodes": [], "numa_supported": false, "os": {"custom_kernel_cmdline": ""}, "permissions": [], "port": 54321, "power_management": {"automatic_pm_enabled": true, "enabled": false, "kdump_detection": true, "pm_proxies": []}, "protocol": "stomp", "se_linux": {}, "spm": {"priority": 5, "status": "none"}, "ssh": {"fingerprint": "SHA256:J75BVLFnmGBGFosXzaxCRnuIYcOc75HUBQZ4pOKpDg8", "port": 22}, "statistics": [], "status": "non_responsive", "storage_connection_extensions": [], "summary": {"total": 0}, "tags": [], "transparent_huge_pages": {"enabled": false}, "type": "rhel", "unmanaged_networks": [], "update_available": false}]}, "attempts": 120, "changed": false} [ INFO ] TASK [Remove local vm dir] [ INFO ] TASK [Notify the user about a failure] [ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "The system may not be provisioned according to the playbook results: please check the logs for the issue, fix accordingly or re-deploy from scratch.n"}
I made another try with Cockpit, it is the same.
Am I doing something wrong or is there a bug ?
I suppose that your host was condifured with DHCP, if so it's this one: https://bugzilla.redhat.com/1549642
The fix will come with 4.2.2.
Regards
------------------------------ FreeMail powered by mail.fr _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
------------------------------ FreeMail powered by mail.fr
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users
------------------------------ FreeMail powered by mail.fr
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

<a href=3D"https://bugzilla.redhat.com/1552565" target=3D"_blank" rel= =3D"noreferrer noopener">https://bugzilla.redhat.com/1552565</a></div>= =0A<div> </div>=0A<div>A fix will come in the next build.</div>=0A<=
<span class=3D"m_3642752428400018059gmail-m_-4123427470926593816gmail-m= _-3726384503116450878ansible-output-line">[ INFO ] TASK [Notify the user= about a failure]<br /></span></span><span class=3D"m_364275242840001805= 9gmail-m_-4123427470926593816gmail-m_-3726384503116450878ansible-output-=
<hr />FreeMail powered by <a href=3D"https://mail.fr" target=3D"_blank"= rel=3D"noreferrer noopener">mail.fr</a> <br /><span class=3D"m_36427524= 28400018059gmail-">_______________________________________________<br />= Users mailing list<br /><a href=3D"mailto:Users@ovirt.org" target=3D"_b= lank" rel=3D"noreferrer noopener">Users@ovirt.org</a><br /><a href=3D"ht= tp://lists.ovirt.org/mailman/listinfo/users" target=3D"_blank" rel=3D"no= referrer noopener">http://lists.ovirt.org/mailman/listinfo/users</a><br= /><br /></span></blockquote>=0A</div>=0A</div>=0A</div>=0A</blockquote>= =0A<div class=3D"m_3642752428400018059gmail-HOEnZb">=0A<div class=3D"m_3= 642752428400018059gmail-h5"><br /><hr />FreeMail powered by <a href=3D"h= ttps://mail.fr" target=3D"_blank" rel=3D"noreferrer noopener">mail.fr</a= </div>=0A</div>=0A<br />_______________________________________________= <br /> Users mailing list<br /><a href=3D"mailto:Users@ovirt.org" target= =3D"_blank" rel=3D"noreferrer noopener">Users@ovirt.org</a><br /><a href= =3D"http://lists.ovirt.org/mailman/listinfo/users" target=3D"_blank" rel= =3D"noreferrer noopener">http://lists.ovirt.org/mailman/listinfo/users</= a><br /><br /></blockquote>=0A</div>=0A</div>=0A</div>=0A</blockquote>= =0A<br /><hr />FreeMail powered by <a href=3D"https://mail.fr" target=3D= "_blank" rel=3D"noreferrer noopener">mail.fr</a></div>=0A</div>=0A<br />= _______________________________________________<br /> Users mailing list= <br /><a href=3D"mailto:Users@ovirt.org" target=3D"_blank">Users@ovirt.o= rg</a><br /><a href=3D"http://lists.ovirt.org/mailman/listinfo/users" ta= rget=3D"_blank" rel=3D"noreferrer noopener">http://lists.ovirt.org/mailm= an/listinfo/users</a><br /><br /></blockquote>=0A</div>=0A</div>=0A</div= =0A</blockquote>=0A <br/><hr>FreeMail powered by <a=
--=_1ba088308412248e0037104062e7fe21 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Just to be sure I hadn't altered something, I renamed "mnt" in something= else and it was indeed recreated.=0A=0A Le 20-Mar-2018 16:57:22 +0100,= stirabos@redhat.com a crit: =0A=0A On Tue, Mar 20, 2018 at 4:12 PM, = wrote:=0A=0A I tried to make a cleaner install : after cleanup, I recre= ated "/rhev/data-center/mnt/" and ran the installer again.=0A It shou= ld be automatically created by vdsm, can you please avoid that? =0A = As you can see, it crashed again with the same access denied error on= this file : [ INFO ] TASK [Copy configuration archive to storage]=0A= [ ERROR ] fatal: [localhost]: FAILED! =3D> {"changed": true, "cmd": ["dd= ", "bs=3D20480", "count=3D1", "oflag=3Ddirect", "if=3D/var/tmp/localvmVB= RLpL/b1884198-69e6-4096-939d-03c87112de10", "of=3D/rhev/data-center/mnt/= 10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891= e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6= -4096-939d-03c87112de10"], "delta": "0:00:00.004468", "end": "2018-03-20= 15:57:34.199405", "msg": "non-zero return code", "rc": 1, "start": "201= 8-03-20 15:57:34.194937", "stderr": "dd: impossible d'ouvrir /rhev/data-= center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-af= 01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b1= 884198-69e6-4096-939d-03c87112de10 : Permission non accorde", "stderr_li= nes": ["dd: impossible d'ouvrir /rhev/data-center/mnt/10.100.2.132:_volu= me3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db3387/ima= ges/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c8711= 2de10 : Permission non accorde"], "stdout": "", "stdout_lines": []}=0A[= ERROR ] Failed to execute stage 'Closing up': Failed executing ansible-= playbook=0A But the file permissions look ok to me : -rw-rw----. 1= vdsm kvm 1,0G 20 mars 2018 /rhev/data-center/mnt/10.100.2.132:_volume3_= ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/= 589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de1= 0=0A=0A So I decided to test something : I set a shell for "vdsm", so I= could login : su - vdsm -c "touch /rhev/data-center/mnt/10.100.2.132= :_volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db33= 87/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-0= 3c87112de10" && echo "OK"=0AOK As far as I can see,still no permission= problem =0A=0ABut if I try the same as "root" : =0A=0Atouch /rhev/data-= center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-af= 01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b1= 884198-69e6-4096-939d-03c87112de10 && echo "OK"=0Atouch: impossible de f= aire un touch /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine_= _self__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-= 4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 : Permission= non accorde =0A=0AOf course, "root" and "vdsm" can create, touch and de= lete other files flawlessly in this share. =0A=0AIt looks like some kind= of immutable file, but is is not suppose to exist on NFS, does it ? =0A= =0ARegards =0A=0A Le 20-Mar-2018 12:22:50 +0100, stirabos@redhat.com a= crit: =0A=0A On Tue, Mar 20, 2018 at 11:44 AM, wrote:=0A=0A Hi,=0A= In fact it is a workaround coming from you I found in the bugtrac= k that helped me : =0A=0Achmod 644 /var/cache/vdsm/schema/* = =0A=0AAs the only thing looking like a weird error I have found was : = =0A=0AERROR Exception raised#012Traceback (most recent call last):#0= 12 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 156, in r= un#012 serve_clients(log)#012 File "/usr/lib/python2.7/site-packages/vds= m/vdsmd.py", line 103, in serve_clients#012 cif =3D clientIF.getInstance= (irs, log, scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm/cl= ientIF.py", line 250, in getInstance#012 cls._instance =3D clientIF(irs,= log, scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm/clientI= F.py", line 144, in __init__#012 self._prepareJSONRPCServer()#012 File "= /usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 307, in _prepar= eJSONRPCServer#012 bridge =3D Bridge.DynamicBridge()#012 File "/usr/lib/= python2.7/site-packages/vdsm/rpc/Bridge.py", line 67, in __init__#012 se= lf._schema =3D vdsmapi.Schema(paths, api_strict_mode)#012 File "/usr/lib= /python2.7/site-packages/vdsm/api/vdsmapi.py", line 217, in __init__#012= raise SchemaNotFound("Unable to find API schema file")#012SchemaNotFoun= d: Unable to find API schema file Thanks, it's tracked here: https://= bugzilla.redhat.com/1552565 A fix will come in the next build. =0A = So I can go one step futher, but the installation still fails in the= end, with file permission problems in datastore files (i chose NFS 4.1)= . I can't indeed touch or get informations even logged in root. But I ca= n create and delete files in the same directory. Is there a workaround= for this too ? Everything should get wrote and read on the NFS expor= t as vdsm:kvm (36:36); can you please ensure that everything is fine wit= h that? =0A Regards =0A=0A Le 19-Mar-2018 17:48:41 +0100, stirabos@r= edhat.com a crit: =0A=0A On Mon, Mar 19, 2018 at 4:56 PM, wrote:= =0A=0A Hi,=0A I wanted to rebuild a new hosted engine setup, as the ol= d was corrupted (too much violent poweroff !) So the server was not re= installed, I just runned "ovirt-hosted-engine-cleanup". The network setu= p generated by vdsm seems to be still in place, so I haven't changed any= thing there. Then I decided to update the packages to the latest versi= ons avaible, rebooted the server and run "ovirt-hosted-engine-setup". = But the process never succeeds, as I get an error after a long time spe= nt in "[ INFO ] TASK [Wait for the host to be up]" [ ERROR ] fatal:= [localhost]: FAILED! =3D> {"ansible_facts": {"ovirt_hosts": [{"address"= : "pfm-srv-virt-1.pfm-ad.pfm.loc", "affinity_labels": [], "auto_numa_sta= tus": "unknown", "certificate": {"organization": "pfm.loc", "subject": "= O=3Dpfm.loc,CN=3Dpfm-srv-virt-1.pfm-ad.pfm.loc"}, "cluster": {"href": "/= ovirt-engine/api/clusters/d6c9358e-2b8b-11e8-bc86-00163e152701", "id": "= d6c9358e-2b8b-11e8-bc86-00163e152701"}, "comment": "", "cpu": {"speed":= 0.0, "topology": {}}, "device_passthrough": {"enabled": false}, "device= s": [], "external_network_provider_configurations": [], "external_status= ": "ok", "hardware_information": {"supported_rng_sources": []}, "hooks":= [], "href": "/ovirt-engine/api/hosts/542566c4-fc85-4398-9402-10c8adaa95= 54", "id": "542566c4-fc85-4398-9402-10c8adaa9554", "katello_errata": [],= "kdump_status": "unknown", "ksm": {"enabled": false}, "max_scheduling_m= emory": 0, "memory": 0, "name": "pfm-srv-virt-1.pfm-ad.pfm.loc", "networ= k_attachments": [], "nics": [], "numa_nodes": [], "numa_supported": fals= e, "os": {"custom_kernel_cmdline": ""}, "permissions": [], "port": 54321= , "power_management": {"automatic_pm_enabled": true, "enabled": false, "= kdump_detection": true, "pm_proxies": []}, "protocol": "stomp", "se_linu= x": {}, "spm": {"priority": 5, "status": "none"}, "ssh": {"fingerprint":= "SHA256:J75BVLFnmGBGFosXzaxCRnuIYcOc75HUBQZ4pOKpDg8", "port": 22}, "sta= tistics": [], "status": "non_responsive", "storage_connection_extensions= ": [], "summary": {"total": 0}, "tags": [], "transparent_huge_pages": {"= enabled": false}, "type": "rhel", "unmanaged_networks": [], "update_avai= lable": false}]}, "attempts": 120, "changed": false}=0A[ INFO ] TASK [Re= move local vm dir]=0A[ INFO ] TASK [Notify the user about a failure]=0A[= ERROR ] fatal: [localhost]: FAILED! =3D> {"changed": false, "msg": "The= system may not be provisioned according to the playbook results: please= check the logs for the issue, fix accordingly or re-deploy from scratch= .n"} I made another try with Cockpit, it is the same. Am I doing s= omething wrong or is there a bug ? I suppose that your host was condi= fured with DHCP, if so it's this one: https://bugzilla.redhat.com/154964= 2 The fix will come with 4.2.2. =0A Regards =0A=0A------------= ------------------------------------------------------------------------= -------------=0AFreeMail powered by mail.fr =0A_________________________= ______________________=0A Users mailing list=0AUsers@ovirt.org=0Ahttp://= lists.ovirt.org/mailman/listinfo/users=0A=0A----------------------------= ---------------------------------------------------------------------=0A= FreeMail powered by mail.fr =0A________________________________________= _______=0A Users mailing list=0AUsers@ovirt.org=0Ahttp://lists.ovirt.org= /mailman/listinfo/users=0A=0A-------------------------------------------= ------------------------------------------------------=0AFreeMail powere= d by mail.fr =0A_______________________________________________=0A User= s mailing list=0AUsers@ovirt.org=0Ahttp://lists.ovirt.org/mailman/listin= fo/users=0A=0A----------------------------------------------------------= ---------------------------------------=0AFreeMail powered by mail.fr --=_1ba088308412248e0037104062e7fe21 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable <div><span style=3D"font-family: arial, helvetica,sans-serif; font-size:= 10pt; color: #000000;">Just to be sure I hadn't altered something, I re= named "mnt" in something else and it was indeed recreated.<br /></span><= /div>=0A<p><br /><br /> Le 20-Mar-2018 16:57:22 +0100, stirabos@redhat.c= om a écrit:</p>=0A<div> </div>=0A<blockquote style=3D"margin= -left: 0; padding-left: 5px; border-left: 2px solid navy;">=0A<div dir= =3D"ltr"><br />=0A<div class=3D"gmail_extra"><br />=0A<div class=3D"gmai= l_quote">On Tue, Mar 20, 2018 at 4:12 PM, <span dir=3D"ltr"><<a href= =3D"mailto:spfma.tech@e.mail.fr" target=3D"_blank" rel=3D"noreferrer noo= pener">spfma.tech@e.mail.fr</a>></span> wrote:<br />=0A<blockquote cl= ass=3D"gmail_quote" style=3D"margin: 0 0 0 .8ex; border-left: 1px #ccc s= olid; padding-left: 1ex;">=0A<div><span style=3D"font-family: arial, hel= vetica, sans-serif; font-size: 10pt; color: #000000;">I tried to make a= cleaner install : after cleanup, I recreated "/rhev/data-center/mnt/" a= nd ran the installer again.<br /></span></div>=0A</blockquote>=0A<div>&n= bsp;</div>=0A<div>It should be automatically created by vdsm, can you pl= ease avoid that?</div>=0A<div> </div>=0A<blockquote class=3D"gmail_= quote" style=3D"margin: 0 0 0 .8ex; border-left: 1px #ccc solid; padding= -left: 1ex;">=0A<div> </div>=0A<div> </div>=0A<div><span style= =3D"font-family: arial, helvetica, sans-serif; font-size: 10pt; color: #= 000000;">As you can see, it crashed again with the same access denied er= ror on this file : </span></div>=0A<div> </div>=0A<div><span style= =3D"font-family: arial, helvetica, sans-serif; font-size: 10pt; color: #= 000000;">[ INFO ] TASK [Copy configuration archive to storage]<br= />[ ERROR ] fatal: [localhost]: FAILED! =3D> {"changed": true, "cmd"= : ["dd", "bs=3D20480", "count=3D1", "oflag=3Ddirect", "if=3D/var/tmp/loc= alvmVBRLpL/b1884198-69e6-4096-939d-03c87112de10", "of=3D/rhev/data-cente= r/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-af01-4f= b2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b188419= 8-69e6-4096-939d-03c87112de10"], "delta": "0:00:00.004468", "end": "2018= -03-20 15:57:34.199405", "msg": "non-zero return code", "rc": 1, "start"= : "2018-03-20 15:57:34.194937", "stderr": "dd: impossible d'ouvrir &laqu= o; /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine__self_= _hosted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-a= a57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 »: Perm= ission non accordée", "stderr_lines": ["dd: impossible d'ouvrir &= laquo; /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine__s= elf__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-44= 95-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 »:= Permission non accordée"], "stdout": "", "stdout_lines": []}<br= />[ ERROR ] Failed to execute stage 'Closing up': Failed executing ansi= ble-playbook<br /></span></div>=0A<div> </div>=0A<div><span style= =3D"font-family: arial, helvetica, sans-serif; font-size: 10pt; color: #= 000000;">But the file permissions look ok to me : </span></div>=0A<div>&= nbsp;</div>=0A<div><span style=3D"font-family: arial, helvetica, sans-se= rif; font-size: 10pt; color: #000000;">-rw-rw----. 1 vdsm kvm 1,0G 20 ma= rs 2018 /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__e= ngine__self__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768= -c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10<br /><= br /></span></div>=0A<div><span style=3D"font-family: arial, helvetica,= sans-serif; font-size: 10pt; color: #000000;">So I decided to test some= thing : I set a shell for "vdsm", so I could login : </span><= /div>=0A<div> </div>=0A<div>su - vdsm -c "touch /rhev/data-center/m= nt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-= 891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-6= 9e6-4096-939d-03c87112de10" && echo "OK"<br />OK</div>=0A<div>&n= bsp;</div>=0A<div>As far as I can see,still no permission problem<= /div>=0A<p>But if I try the same as "root" :</p>=0A<p>touch /rhev/data-c= enter/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-af0= 1-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b18= 84198-69e6-4096-939d-03c87112de10 && echo "OK"<br />touch: impos= sible de faire un touch « /rhev/data-center/mnt/10.100.2.132:= _volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db338= 7/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03= c87112de10 »: Permission non accordée</p>=0A<p>Of cour= se, "root" and "vdsm" can create, touch and delete other files flawlessl= y in this share.</p>=0A<p>It looks like some kind of immutable file, but= is is not suppose to exist on NFS, does it ?</p>=0A<p>Regards</p>=0A<di= v class=3D"HOEnZb">=0A<div class=3D"h5">=0A<p> </p>=0A<p><br /><br= /> Le 20-Mar-2018 12:22:50 +0100, <a href=3D"mailto:stirabos@redhat.com= " target=3D"_blank" rel=3D"noreferrer noopener">stirabos@redhat.com</a>= a écrit:</p>=0A<div> </div>=0A<blockquote style=3D"margin-l= eft: 0; padding-left: 5px; border-left: 2px solid #000080;">=0A<div dir= =3D"ltr"><br />=0A<div class=3D"gmail_extra"><br />=0A<div class=3D"gmai= l_quote">On Tue, Mar 20, 2018 at 11:44 AM, <span dir=3D"ltr"><<a href= =3D"mailto:spfma.tech@e.mail.fr" target=3D"_blank" rel=3D"noreferrer noo= pener">spfma.tech@e.mail.fr</a>></span> wrote:<br />=0A<blockquote cl= ass=3D"gmail_quote" style=3D"margin: 0px 0px 0px .8ex; border-left: 1px= solid #cccccc; padding-left: 1ex;">=0A<div><span style=3D"font-family:= arial, helvetica, sans-serif; font-size: 10pt; color: #000000;"> <= /span></div>=0A<div><span style=3D"font-family: arial, helvetica, sans-s= erif; font-size: 10pt; color: #000000;">Hi,<br /></span></div>=0A<div>&n= bsp;</div>=0A<div> </div>=0A<div> </div>=0A<div><span style=3D= "font-family: arial, helvetica, sans-serif; font-size: 10pt; color: #000= 000;">In fact it is a workaround coming from you I found in the bugtrack= that helped me : </span></div>=0A<div> </div>=0A<div> </div>= =0A<div> </div>=0A<div>=0A<pre id=3D"m_3642752428400018059gmail-m_-= 4123427470926593816comment_text_8" class=3D"m_3642752428400018059gmail-m= _-4123427470926593816bz_comment_text m_3642752428400018059gmail-m_-41234= 27470926593816bz_wrap_comment_text">chmod 644 /var/cache/vdsm/schema/*</= pre>=0A</div>=0A<div> </div>=0A<p>As the only thing looking like a= weird error I have found was :</p>=0A<div> </div>=0A<div> </d= iv>=0A<p>ERROR Exception raised#012Traceback (most recent call last):#01= 2 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 156,= in run#012 serve_clients(log)#012 File "/usr/li= b/python2.7/site-packages/vdsm/vdsmd.py", line 103, in serve_clients#012= cif =3D clientIF.getInstance(irs, log, scheduler)#012= File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 25= 0, in getInstance#012 cls._instance =3D clientIF(irs,= log, scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm/c= lientIF.py", line 144, in __init__#012 self._prepareJS= ONRPCServer()#012 File "/usr/lib/python2.7/site-packages/vdsm/clie= ntIF.py", line 307, in _prepareJSONRPCServer#012 bridg= e =3D Bridge.DynamicBridge()#012 File "/usr/lib/python2.7/site-pac= kages/vdsm/rpc/Bridge.py", line 67, in __init__#012 se= lf._schema =3D vdsmapi.Schema(paths, api_strict_mode)#012 File "/u= sr/lib/python2.7/site-packages/vdsm/api/vdsmapi.py", line 217, in __init= __#012 raise SchemaNotFound("Unable to find API schema= file")#012SchemaNotFound: Unable to find API schema file</p>=0A</blockq= uote>=0A<div> </div>=0A<div>Thanks, it's tracked here:</div>=0A<div= div> </div>=0A<blockquote class=3D"gmail_quote" style=3D"margin: 0p= x 0px 0px .8ex; border-left: 1px solid #cccccc; padding-left: 1ex;">=0A<= div> </div>=0A<div> </div>=0A<div>So I can go one step futher,= but the installation still fails in the end, with file permission probl= ems in datastore files (i chose NFS 4.1). I can't indeed touch or get in= formations even logged in root. But I can create and delete files in the= same directory.</div>=0A<div> </div>=0A<div>Is there a workaround= for this too ?</div>=0A</blockquote>=0A<div> </div>=0A<div>Everyth= ing should get wrote and read on the NFS export as vdsm:kvm (36:36); can= you please ensure that everything is fine with that?</div>=0A<div> = ;</div>=0A<blockquote class=3D"gmail_quote" style=3D"margin: 0px 0px 0px= .8ex; border-left: 1px solid #cccccc; padding-left: 1ex;">=0A<div> = ;</div>=0A<div>Regards</div>=0A<p><br /><br /> Le 19-Mar-2018 17:48:41 += 0100, <a href=3D"mailto:stirabos@redhat.com" target=3D"_blank" rel=3D"no= referrer noopener">stirabos@redhat.com</a> a écrit:</p>=0A<div>&n= bsp;</div>=0A<div> </div>=0A<div> </div>=0A<blockquote style= =3D"margin-left: 0px; padding-left: 5px; border-left: 2px solid #000080;= ">=0A<div dir=3D"ltr"><br />=0A<div class=3D"gmail_extra"><br />=0A<div= class=3D"gmail_quote"><span class=3D"m_3642752428400018059gmail-">On Mo= n, Mar 19, 2018 at 4:56 PM, <span dir=3D"ltr"><<a href=3D"mailto:spfm= a.tech@e.mail.fr" target=3D"_blank" rel=3D"noreferrer noopener">spfma.te= ch@e.mail.fr</a>></span> wrote:<br /></span>=0A<blockquote class=3D"g= mail_quote" style=3D"margin: 0px 0px 0px .8ex; border-left: 1px solid #c= ccccc; padding-left: 1ex;">=0A<div><span style=3D"font-family: arial, he= lvetica, sans-serif; font-size: 10pt; color: #000000;">Hi,<br /></span><= /div>=0A<div> </div>=0A<div><span style=3D"font-family: arial, helv= etica, sans-serif; font-size: 10pt; color: #000000;">I wanted to rebuild= a new hosted engine setup, as the old was corrupted (too much violent p= oweroff !)</span></div>=0A<div> </div>=0A<div><span style=3D"font-f= amily: arial, helvetica, sans-serif; font-size: 10pt; color: #000000;">S= o the server was not reinstalled, I just runned "ovirt-hosted-engine-cle= anup". The network setup generated by vdsm seems to be still in place, s= o I haven't changed anything there.</span></div>=0A<div> </div>=0A<= div><span style=3D"font-family: arial, helvetica, sans-serif; font-size:= 10pt; color: #000000;">Then I decided to update the packages to the lat= est versions avaible, rebooted the server and run "ovirt-hosted-engine-s= etup".</span></div>=0A<div> </div>=0A<div><span style=3D"font-famil= y: arial, helvetica, sans-serif; font-size: 10pt; color: #000000;">But t= he process never succeeds, as I get an error after a long time spent in= "<span class=3D"m_3642752428400018059gmail-m_-4123427470926593816gmail-= m_-3726384503116450878ansible-output-line">[ INFO ] TASK [Wait for the h= ost to be up]</span>"</span></div>=0A<div> </div>=0A<div> </di= v>=0A<div><span style=3D"font-family: arial, helvetica, sans-serif; font= -size: 10pt; color: #000000;"><span class=3D"m_3642752428400018059gmail-= "><span class=3D"m_3642752428400018059gmail-m_-4123427470926593816gmail-= m_-3726384503116450878ansible-output-line">[ ERROR ] fatal: [localhost]:= FAILED! =3D> {"ansible_facts": {"ovirt_hosts": [{"address": "pfm-srv= -virt-1.pfm-ad.pfm.loc", "affinity_labels": [], "auto_numa_status": "unk= nown", "certificate": {"organization": "pfm.loc", "subject": "O=3Dpfm.lo= c,CN=3Dpfm-srv-virt-1.pfm-ad.pfm.loc"}, "cluster": {"href": "/ovirt-engi= ne/api/clusters/d6c9358e-2b8b-11e8-bc86-00163e152701", "id": "d6c9358e-2= b8b-11e8-bc86-00163e152701"}, "comment": "", "cpu": {"speed": 0.0, "topo= logy": {}}, "device_passthrough": {"enabled": false}, "devices": [], "ex= ternal_network_provider_configurations": [], "external_status": "ok", "h= ardware_information": {"supported_rng_sources": []}, "hooks": [], "href"= : "/ovirt-engine/api/hosts/542566c4-fc85-4398-9402-10c8adaa9554", "id":= "542566c4-fc85-4398-9402-10c8adaa9554", "katello_errata": [], "kdump_st= atus": "unknown", "ksm": {"enabled": false}, "max_scheduling_memory": 0,= "memory": 0, "name": "pfm-srv-virt-1.pfm-ad.pfm.loc", "network_attachme= nts": [], "nics": [], "numa_nodes": [], "numa_supported": false, "os": {= "custom_kernel_cmdline": ""}, "permissions": [], "port": 54321, "power_m= anagement": {"automatic_pm_enabled": true, "enabled": false, "kdump_dete= ction": true, "pm_proxies": []}, "protocol": "stomp", "se_linux": {}, "s= pm": {"priority": 5, "status": "none"}, "ssh": {"fingerprint": "SHA256:J= 75BVLFnmGBGFosXzaxCRnuIYcOc75HUBQZ4pOKpDg8", "port": 22}, "statistics":= [], "status": "non_responsive", "storage_connection_extensions": [], "s= ummary": {"total": 0}, "tags": [], "transparent_huge_pages": {"enabled":= false}, "type": "rhel", "unmanaged_networks": [], "update_available": f= alse}]}, "attempts": 120, "changed": false}<br /></span><span class=3D"m= _3642752428400018059gmail-m_-4123427470926593816gmail-m_-372638450311645= 0878ansible-output-line">[ INFO ] TASK [Remove local vm dir]<br /></span= line">[ ERROR ] fatal: [localhost]: FAILED! =3D> {"changed": false, "= msg": "The system may not be provisioned according to the playbook resul= ts: please check the logs for the issue, fix accordingly or re-deploy fr= om scratch.n"}</span></span></div>=0A<div> </div>=0A<div> </di= v>=0A<div><span style=3D"font-family: arial, helvetica, sans-serif; font= -size: 10pt; color: #000000;">I made another try with Cockpit, it is the= same.</span></div>=0A<div> </div>=0A<div><span style=3D"font-famil= y: arial, helvetica, sans-serif; font-size: 10pt; color: #000000;">Am I= doing something wrong or is there a bug ?</span></div>=0A</blockquote>= =0A<div> </div>=0A<div>I suppose that your host was condifured with= DHCP, if so it's this one:</div>=0A<div><a href=3D"https://bugzilla.red= hat.com/1549642" target=3D"_blank" rel=3D"noreferrer noopener">https://b= ugzilla.redhat.com/1549642</a></div>=0A<div> </div>=0A<div>The fix= will come with 4.2.2.</div>=0A<div> </div>=0A<blockquote class=3D"= gmail_quote" style=3D"margin: 0px 0px 0px .8ex; border-left: 1px solid #= cccccc; padding-left: 1ex;">=0A<div> </div>=0A<div><span style=3D"f= ont-family: arial, helvetica, sans-serif; font-size: 10pt; color: #00000= 0;">Regards</span></div>=0A<div> </div>=0A<div> </div>=0A<br /= href=3D"https://mail.fr" target=3D"_blank">mail.fr</a>=0A --=_1ba088308412248e0037104062e7fe21--

I get a lot of errors like this one :</span></div>=0A<div> </div>= =0A<div><span style=3D"font-family: arial, helvetica,sans-serif; font-si= ze: 10pt; color: #000000;">vdsm[3008]: ERROR ssl handshake: SSLError, ad= dress: ::ffff:10.100.1.100 <br /></span></div>=0A<p><span style=3D"font-= family: arial, helvetica,sans-serif; font-size: 10pt; color: #000000;">1= 0.100.1.100 is the IP of the engine vm.</span></p>=0A<p>vdsm.log i= s not more helpful :</p>=0A<p>2018-03-21 17:10:10,769+0100 ERROR (Reacto= r thread) [ProtocolDetector.SSLHandshakeDispatcher] ssl handshake: SSLEr= ror, address: ::ffff:10.100.1.100 (sslutils:258)</p>=0A<p>Is there somet= hing to update or generate after a restore ? I don't know whether keys a= nd certificates were kept or if new ones are now used.</p>=0A<p>I also t= ried to add the SSH public key showed in the GUI to the authorized_keys= on a node, even reboot, but no change.</p>=0A<p> </p>=0A<p>Regards= </p>=0A<p><br /><br /><br /> Le 20-Mar-2018 16:12:40 +0100, spfma.tech@e= .mail.fr a écrit:</p>=0A<div> </div>=0A<blockquote style=3D"= margin-left: 0; padding-left: 5px; border-left: 2px solid navy;">=0A<div= <span style=3D"font-family: arial, helvetica, sans-serif; font-size: 10=
ERROR Exception raised#012Traceback (most recent call last):#012 = File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py", line 156, in run= #012 serve_clients(log)#012 File "/usr/lib/pytho= n2.7/site-packages/vdsm/vdsmd.py", line 103, in serve_clients#012 &= nbsp; cif =3D clientIF.getInstance(irs, log, scheduler)#012 = File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", line 250, in= getInstance#012 cls._instance =3D clientIF(irs, log,= scheduler)#012 File "/usr/lib/python2.7/site-packages/vdsm/client= IF.py", line 144, in __init__#012 self._prepareJSONRPC= Server()#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.=
=0A<div>Regards</div>=0A<p><br /><br /> Le 19-Mar-2018 17:48:41 +0100,= <a href=3D"mailto:stirabos@redhat.com" target=3D"_blank" rel=3D"norefer= rer noopener">stirabos@redhat.com</a> a écrit:</p>=0A<div> <= /div>=0A<div> </div>=0A<div> </div>=0A<blockquote style=3D"mar= gin-left: 0px; padding-left: 5px; border-left: 2px solid #000080;">=0A<d= iv dir=3D"ltr"><br />=0A<div class=3D"gmail_extra"><br />=0A<div class= =3D"gmail_quote"><span class=3D"gmail-">On Mon, Mar 19, 2018 at 4:56 PM,= <span dir=3D"ltr"><<a href=3D"mailto:spfma.tech@e.mail.fr" target=3D= "_blank" rel=3D"noreferrer noopener">spfma.tech@e.mail.fr</a>></span>= wrote:<br /></span>=0A<blockquote class=3D"gmail_quote" style=3D"margin= : 0px 0px 0px .8ex; border-left: 1px solid #cccccc; padding-left: 1ex;">= =0A<div><span style=3D"font-family: arial, helvetica, sans-serif; font-s= ize: 10pt; color: #000000;">Hi,<br /></span></div>=0A<div> </div>= =0A<div><span style=3D"font-family: arial, helvetica, sans-serif; font-s= ize: 10pt; color: #000000;">I wanted to rebuild a new hosted engine setu=
Then I decided to update the packages to the latest versions avaible, r= ebooted the server and run "ovirt-hosted-engine-setup".</span></div>=0A<=
</div>=0A<div>I suppose that your host was condifured with DHCP,= if so it's this one:</div>=0A<div><a href=3D"https://bugzilla.redhat.co= m/1549642" target=3D"_blank" rel=3D"noreferrer noopener">https://bugzill= a.redhat.com/1549642</a></div>=0A<div> </div>=0A<div>The fix will c= ome with 4.2.2.</div>=0A<div> </div>=0A<blockquote class=3D"gmail_q= uote" style=3D"margin: 0px 0px 0px .8ex; border-left: 1px solid #cccccc;=
--=_b8dfb35fb3bd146ef1bbc5d31d7bc513 Content-Type: text/plain; charset=utf-8 Content-Transfer-Encoding: quoted-printable Hi,=0A I made some progress : by allowing my NAS to map any user to ad= min (not the best for security, but it is a dedicated infrastructure), t= his weird permissions problem disappeared. Maybe a NFS bug somewhere ? I= don't know. I was able to redeploy a new hosted engine, and after a c= leanup and some other manual cleaning tasks, restore my latest backup. = So the new engine vm is able to startup, but it seems there is a probl= em for communicating with hosts. I get a lot of errors like this one := vdsm[3008]: ERROR ssl handshake: SSLError, address: ::ffff:10.100.1.1= 00 =0A=0A10.100.1.100 is the IP of the engine vm. =0A=0Avdsm.log is not= more helpful : =0A=0A2018-03-21 17:10:10,769+0100 ERROR (Reactor thread= ) [ProtocolDetector.SSLHandshakeDispatcher] ssl handshake: SSLError, add= ress: ::ffff:10.100.1.100 (sslutils:258) =0A=0AIs there something to upd= ate or generate after a restore ? I don't know whether keys and certific= ates were kept or if new ones are now used. =0A=0AI also tried to add th= e SSH public key showed in the GUI to the authorized_keys on a node, eve= n reboot, but no change. =0A=0ARegards =0A=0A Le 20-Mar-2018 16:12:40 +0= 100, spfma.tech@e.mail.fr a crit: =0A I tried to make a cleaner instal= l : after cleanup, I recreated "/rhev/data-center/mnt/" and ran the inst= aller again.=0A As you can see, it crashed again with the same access= denied error on this file : [ INFO ] TASK [Copy configuration archiv= e to storage]=0A[ ERROR ] fatal: [localhost]: FAILED! =3D> {"changed": t= rue, "cmd": ["dd", "bs=3D20480", "count=3D1", "oflag=3Ddirect", "if=3D/v= ar/tmp/localvmVBRLpL/b1884198-69e6-4096-939d-03c87112de10", "of=3D/rhev/= data-center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d95= 46-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a185= 26/b1884198-69e6-4096-939d-03c87112de10"], "delta": "0:00:00.004468", "e= nd": "2018-03-20 15:57:34.199405", "msg": "non-zero return code", "rc":= 1, "start": "2018-03-20 15:57:34.194937", "stderr": "dd: impossible d'o= uvrir /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine__self__h= osted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa5= 7-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 : Permission non acc= orde", "stderr_lines": ["dd: impossible d'ouvrir /rhev/data-center/mnt/1= 0.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e= -e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-= 4096-939d-03c87112de10 : Permission non accorde"], "stdout": "", "stdout= _lines": []}=0A[ ERROR ] Failed to execute stage 'Closing up': Failed ex= ecuting ansible-playbook=0A But the file permissions look ok to me : = -rw-rw----. 1 vdsm kvm 1,0G 20 mars 2018 /rhev/data-center/mnt/10.100= .2.132:_volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e286= 83db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-= 939d-03c87112de10=0A=0A So I decided to test something : I set a shell f= or "vdsm", so I could login : su - vdsm -c "touch /rhev/data-center/m= nt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-= 891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-6= 9e6-4096-939d-03c87112de10" && echo "OK"=0AOK As far as I can see,stil= l no permission problem =0A=0ABut if I try the same as "root" : =0A=0Ato= uch /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hos= ted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-= 45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 && echo "OK"=0Atouch:= impossible de faire un touch /rhev/data-center/mnt/10.100.2.132:_volume= 3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db3387/image= s/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112d= e10 : Permission non accorde =0A=0AOf course, "root" and "vdsm" can crea= te, touch and delete other files flawlessly in this share. =0A=0AIt look= s like some kind of immutable file, but is is not suppose to exist on NF= S, does it ? =0A=0ARegards =0A=0A Le 20-Mar-2018 12:22:50 +0100, stirabo= s@redhat.com a crit: =0A=0A On Tue, Mar 20, 2018 at 11:44 AM, wrote:= =0A=0A Hi,=0A In fact it is a workaround coming from you I found= in the bugtrack that helped me : =0A=0Achmod 644 /var/cache/vds= m/schema/* =0A=0AAs the only thing looking like a weird error I have= found was : =0A=0AERROR Exception raised#012Traceback (most recent= call last):#012 File "/usr/lib/python2.7/site-packages/vdsm/vdsmd.py",= line 156, in run#012 serve_clients(log)#012 File "/usr/lib/python2.7/si= te-packages/vdsm/vdsmd.py", line 103, in serve_clients#012 cif =3D clien= tIF.getInstance(irs, log, scheduler)#012 File "/usr/lib/python2.7/site-p= ackages/vdsm/clientIF.py", line 250, in getInstance#012 cls._instance= =3D clientIF(irs, log, scheduler)#012 File "/usr/lib/python2.7/site-pac= kages/vdsm/clientIF.py", line 144, in __init__#012 self._prepareJSONRPCS= erver()#012 File "/usr/lib/python2.7/site-packages/vdsm/clientIF.py", li= ne 307, in _prepareJSONRPCServer#012 bridge =3D Bridge.DynamicBridge()#0= 12 File "/usr/lib/python2.7/site-packages/vdsm/rpc/Bridge.py", line 67,= in __init__#012 self._schema =3D vdsmapi.Schema(paths, api_strict_mode)= #012 File "/usr/lib/python2.7/site-packages/vdsm/api/vdsmapi.py", line 2= 17, in __init__#012 raise SchemaNotFound("Unable to find API schema file= ")#012SchemaNotFound: Unable to find API schema file Thanks, it's tra= cked here: https://bugzilla.redhat.com/1552565 A fix will come in the= next build. =0A So I can go one step futher, but the installation= still fails in the end, with file permission problems in datastore file= s (i chose NFS 4.1). I can't indeed touch or get informations even logge= d in root. But I can create and delete files in the same directory. Is= there a workaround for this too ? Everything should get wrote and re= ad on the NFS export as vdsm:kvm (36:36); can you please ensure that eve= rything is fine with that? =0A Regards =0A=0A Le 19-Mar-2018 17:48:4= 1 +0100, stirabos@redhat.com a crit: =0A=0A On Mon, Mar 19, 2018 a= t 4:56 PM, wrote:=0A=0A Hi,=0A I wanted to rebuild a new hosted engin= e setup, as the old was corrupted (too much violent poweroff !) So the= server was not reinstalled, I just runned "ovirt-hosted-engine-cleanup"= . The network setup generated by vdsm seems to be still in place, so I h= aven't changed anything there. Then I decided to update the packages t= o the latest versions avaible, rebooted the server and run "ovirt-hosted= -engine-setup". But the process never succeeds, as I get an error afte= r a long time spent in "[ INFO ] TASK [Wait for the host to be up]" = [ ERROR ] fatal: [localhost]: FAILED! =3D> {"ansible_facts": {"ovirt_ho= sts": [{"address": "pfm-srv-virt-1.pfm-ad.pfm.loc", "affinity_labels": [= ], "auto_numa_status": "unknown", "certificate": {"organization": "pfm.l= oc", "subject": "O=3Dpfm.loc,CN=3Dpfm-srv-virt-1.pfm-ad.pfm.loc"}, "clus= ter": {"href": "/ovirt-engine/api/clusters/d6c9358e-2b8b-11e8-bc86-00163= e152701", "id": "d6c9358e-2b8b-11e8-bc86-00163e152701"}, "comment": "",= "cpu": {"speed": 0.0, "topology": {}}, "device_passthrough": {"enabled"= : false}, "devices": [], "external_network_provider_configurations": [],= "external_status": "ok", "hardware_information": {"supported_rng_source= s": []}, "hooks": [], "href": "/ovirt-engine/api/hosts/542566c4-fc85-439= 8-9402-10c8adaa9554", "id": "542566c4-fc85-4398-9402-10c8adaa9554", "kat= ello_errata": [], "kdump_status": "unknown", "ksm": {"enabled": false},= "max_scheduling_memory": 0, "memory": 0, "name": "pfm-srv-virt-1.pfm-ad= .pfm.loc", "network_attachments": [], "nics": [], "numa_nodes": [], "num= a_supported": false, "os": {"custom_kernel_cmdline": ""}, "permissions":= [], "port": 54321, "power_management": {"automatic_pm_enabled": true, "= enabled": false, "kdump_detection": true, "pm_proxies": []}, "protocol":= "stomp", "se_linux": {}, "spm": {"priority": 5, "status": "none"}, "ssh= ": {"fingerprint": "SHA256:J75BVLFnmGBGFosXzaxCRnuIYcOc75HUBQZ4pOKpDg8",= "port": 22}, "statistics": [], "status": "non_responsive", "storage_con= nection_extensions": [], "summary": {"total": 0}, "tags": [], "transpare= nt_huge_pages": {"enabled": false}, "type": "rhel", "unmanaged_networks"= : [], "update_available": false}]}, "attempts": 120, "changed": false}= =0A[ INFO ] TASK [Remove local vm dir]=0A[ INFO ] TASK [Notify the user= about a failure]=0A[ ERROR ] fatal: [localhost]: FAILED! =3D> {"changed= ": false, "msg": "The system may not be provisioned according to the pla= ybook results: please check the logs for the issue, fix accordingly or r= e-deploy from scratch.n"} I made another try with Cockpit, it is the= same. Am I doing something wrong or is there a bug ? I suppose tha= t your host was condifured with DHCP, if so it's this one: https://bugzi= lla.redhat.com/1549642 The fix will come with 4.2.2. =0A Regards = =0A=0A--------------------------------------------------------------= -----------------------------------=0AFreeMail powered by mail.fr =0A___= ____________________________________________=0A Users mailing list=0AUse= rs@ovirt.org=0Ahttp://lists.ovirt.org/mailman/listinfo/users=0A=0A------= ------------------------------------------------------------------------= -------------------=0AFreeMail powered by mail.fr =0A__________________= _____________________________=0A Users mailing list=0AUsers@ovirt.org=0A= http://lists.ovirt.org/mailman/listinfo/users=0A=0A---------------------= ------------------------------------------------------------------------= ----=0AFreeMail powered by mail.fr =0A=0A-------------------------------= ------------------------------------------------------------------=0AFre= eMail powered by mail.fr --=_b8dfb35fb3bd146ef1bbc5d31d7bc513 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: quoted-printable <div><span style=3D"font-family: arial, helvetica,sans-serif; font-size:= 10pt; color: #000000;">Hi,<br /></span></div>=0A<div> </div>=0A<di= v><span style=3D"font-family: arial, helvetica,sans-serif; font-size: 10= pt; color: #000000;">I made some progress : by allowing my NAS to map an= y user to admin (not the best for security, but it is a dedicated infras= tructure), this weird permissions problem disappeared. Maybe a NFS bug s= omewhere ? I don't know.</span></div>=0A<div> </div>=0A<div><span s= tyle=3D"font-family: arial, helvetica,sans-serif; font-size: 10pt; color= : #000000;">I was able to redeploy a new hosted engine, and after a clea= nup and some other manual cleaning tasks, restore my latest backup.</spa= n></div>=0A<div> </div>=0A<div><span style=3D"font-family: arial, h= elvetica,sans-serif; font-size: 10pt; color: #000000;">So the new engine= vm is able to startup, but it seems there is a problem for communicatin= g with hosts.</span></div>=0A<div> </div>=0A<div><span style=3D"fon= t-family: arial, helvetica,sans-serif; font-size: 10pt; color: #000000;"= pt; color: #000000;">I tried to make a cleaner install : after cleanup,= I recreated "/rhev/data-center/mnt/" and ran the installer again.<br />= </span></div>=0A<div> </div>=0A<div><span style=3D"font-family: ari= al, helvetica, sans-serif; font-size: 10pt; color: #000000;">As you can= see, it crashed again with the same access denied error on this file := </span></div>=0A<div> </div>=0A<div><span style=3D"font-family: ar= ial, helvetica, sans-serif; font-size: 10pt; color: #000000;">[ INFO&nbs= p; ] TASK [Copy configuration archive to storage]<br />[ ERROR ] fatal:= [localhost]: FAILED! =3D> {"changed": true, "cmd": ["dd", "bs=3D2048= 0", "count=3D1", "oflag=3Ddirect", "if=3D/var/tmp/localvmVBRLpL/b1884198= -69e6-4096-939d-03c87112de10", "of=3D/rhev/data-center/mnt/10.100.2.132:= _volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db338= 7/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03= c87112de10"], "delta": "0:00:00.004468", "end": "2018-03-20 15:57:34.199= 405", "msg": "non-zero return code", "rc": 1, "start": "2018-03-20 15:57= :34.194937", "stderr": "dd: impossible d'ouvrir « /rhev/data-= center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d9546-af= 01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a18526/b1= 884198-69e6-4096-939d-03c87112de10 »: Permission non accord&e= acute;e", "stderr_lines": ["dd: impossible d'ouvrir « /rhev/d= ata-center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015d954= 6-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a1852= 6/b1884198-69e6-4096-939d-03c87112de10 »: Permission non acco= rdée"], "stdout": "", "stdout_lines": []}<br />[ ERROR ] Failed t= o execute stage 'Closing up': Failed executing ansible-playbook<br /></s= pan></div>=0A<div> </div>=0A<div><span style=3D"font-family: arial,= helvetica, sans-serif; font-size: 10pt; color: #000000;">But the file p= ermissions look ok to me : </span></div>=0A<div> </div>=0A<div><spa= n style=3D"font-family: arial, helvetica, sans-serif; font-size: 10pt; c= olor: #000000;">-rw-rw----. 1 vdsm kvm 1,0G 20 mars 2018 /rh= ev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine__self__hosted/015= d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-4495-aa57-45b9b2a= 18526/b1884198-69e6-4096-939d-03c87112de10<br /><br /></span></div>=0A<d= iv><span style=3D"font-family: arial, helvetica, sans-serif; font-size:= 10pt; color: #000000;">So I decided to test something : I set a s= hell for "vdsm", so I could login : </span></div>=0A<div> </di= v>=0A<div>su - vdsm -c "touch /rhev/data-center/mnt/10.100.2.132:_volume= 3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db3387/image= s/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112d= e10" && echo "OK"<br />OK</div>=0A<div> </div>=0A<div>As fa= r as I can see,still no permission problem</div>=0A<p>But if I try= the same as "root" :</p>=0A<p>touch /rhev/data-center/mnt/10.100.2.132:= _volume3_ovirt__engine__self__hosted/015d9546-af01-4fb2-891e-e28683db338= 7/images/589d0768-c935-4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03= c87112de10 && echo "OK"<br />touch: impossible de faire un touch= « /rhev/data-center/mnt/10.100.2.132:_volume3_ovirt__engine_= _self__hosted/015d9546-af01-4fb2-891e-e28683db3387/images/589d0768-c935-= 4495-aa57-45b9b2a18526/b1884198-69e6-4096-939d-03c87112de10 »= : Permission non accordée</p>=0A<p>Of course, "root" and "vdsm" c= an create, touch and delete other files flawlessly in this share.</p>=0A= <p>It looks like some kind of immutable file, but is is not suppose to e= xist on NFS, does it ?</p>=0A<p>Regards</p>=0A<p> </p>=0A<p><br /><= br /> Le 20-Mar-2018 12:22:50 +0100, stirabos@redhat.com a écrit:= </p>=0A<div> </div>=0A<blockquote style=3D"margin-left: 0; padding-= left: 5px; border-left: 2px solid #000080;">=0A<div dir=3D"ltr"><br />= =0A<div class=3D"gmail_extra"><br />=0A<div class=3D"gmail_quote">On Tue= , Mar 20, 2018 at 11:44 AM, <span dir=3D"ltr"><<a href=3D"mailto:spfm= a.tech@e.mail.fr" target=3D"_blank" rel=3D"noreferrer noopener">spfma.te= ch@e.mail.fr</a>></span> wrote:<br />=0A<blockquote class=3D"gmail_qu= ote" style=3D"margin: 0px 0px 0px .8ex; border-left: 1px solid #cccccc;= padding-left: 1ex;">=0A<div><span style=3D"font-family: arial, helvetic= a, sans-serif; font-size: 10pt; color: #000000;"> </span></div>=0A<= div><span style=3D"font-family: arial, helvetica, sans-serif; font-size:= 10pt; color: #000000;">Hi,<br /></span></div>=0A<div> </div>=0A<di= v> </div>=0A<div> </div>=0A<div><span style=3D"font-family: ar= ial, helvetica, sans-serif; font-size: 10pt; color: #000000;">In fact it= is a workaround coming from you I found in the bugtrack that helped me= : </span></div>=0A<div> </div>=0A<div> </div>=0A<div> </= div>=0A<div>=0A<pre id=3D"gmail-m_-4123427470926593816comment_text_8" cl= ass=3D"gmail-m_-4123427470926593816bz_comment_text gmail-m_-412342747092= 6593816bz_wrap_comment_text">chmod 644 /var/cache/vdsm/schema/*</pre>=0A= </div>=0A<div> </div>=0A<p>As the only thing looking like a weird e= rror I have found was :</p>=0A<div> </div>=0A<div> </div>=0A<p= py", line 307, in _prepareJSONRPCServer#012 bridge =3D= Bridge.DynamicBridge()#012 File "/usr/lib/python2.7/site-packages= /vdsm/rpc/Bridge.py", line 67, in __init__#012 self._s= chema =3D vdsmapi.Schema(paths, api_strict_mode)#012 File "/usr/li= b/python2.7/site-packages/vdsm/api/vdsmapi.py", line 217, in __init__#01= 2 raise SchemaNotFound("Unable to find API schema file= ")#012SchemaNotFound: Unable to find API schema file</p>=0A</blockquote>= =0A<div> </div>=0A<div>Thanks, it's tracked here:</div>=0A<div><a h= ref=3D"https://bugzilla.redhat.com/1552565" target=3D"_blank" rel=3D"nor= eferrer noopener">https://bugzilla.redhat.com/1552565</a></div>=0A<div>&= nbsp;</div>=0A<div>A fix will come in the next build.</div>=0A<div> = ;</div>=0A<blockquote class=3D"gmail_quote" style=3D"margin: 0px 0px 0px= .8ex; border-left: 1px solid #cccccc; padding-left: 1ex;">=0A<div> = ;</div>=0A<div> </div>=0A<div>So I can go one step futher, but the= installation still fails in the end, with file permission problems in d= atastore files (i chose NFS 4.1). I can't indeed touch or get informatio= ns even logged in root. But I can create and delete files in the same di= rectory.</div>=0A<div> </div>=0A<div>Is there a workaround for this= too ?</div>=0A</blockquote>=0A<div> </div>=0A<div>Everything shoul= d get wrote and read on the NFS export as vdsm:kvm (36:36); can you plea= se ensure that everything is fine with that?</div>=0A<div> </div>= =0A<blockquote class=3D"gmail_quote" style=3D"margin: 0px 0px 0px .8ex;= border-left: 1px solid #cccccc; padding-left: 1ex;">=0A<div> </div= p, as the old was corrupted (too much violent poweroff !)</span></div>= =0A<div> </div>=0A<div><span style=3D"font-family: arial, helvetica= , sans-serif; font-size: 10pt; color: #000000;">So the server was not re= installed, I just runned "ovirt-hosted-engine-cleanup". The network setu= p generated by vdsm seems to be still in place, so I haven't changed any= thing there.</span></div>=0A<div> </div>=0A<div><span style=3D"font= -family: arial, helvetica, sans-serif; font-size: 10pt; color: #000000;"= div> </div>=0A<div><span style=3D"font-family: arial, helvetica, sa= ns-serif; font-size: 10pt; color: #000000;">But the process never succee= ds, as I get an error after a long time spent in "<span class=3D"gmail-m= _-4123427470926593816gmail-m_-3726384503116450878ansible-output-line">[= INFO ] TASK [Wait for the host to be up]</span>"</span></div>=0A<div>&n= bsp;</div>=0A<div> </div>=0A<div><span style=3D"font-family: arial,= helvetica, sans-serif; font-size: 10pt; color: #000000;"><span class=3D= "gmail-"><span class=3D"gmail-m_-4123427470926593816gmail-m_-37263845031= 16450878ansible-output-line">[ ERROR ] fatal: [localhost]: FAILED! =3D&g= t; {"ansible_facts": {"ovirt_hosts": [{"address": "pfm-srv-virt-1.pfm-ad= .pfm.loc", "affinity_labels": [], "auto_numa_status": "unknown", "certif= icate": {"organization": "pfm.loc", "subject": "O=3Dpfm.loc,CN=3Dpfm-srv= -virt-1.pfm-ad.pfm.loc"}, "cluster": {"href": "/ovirt-engine/api/cluster= s/d6c9358e-2b8b-11e8-bc86-00163e152701", "id": "d6c9358e-2b8b-11e8-bc86-= 00163e152701"}, "comment": "", "cpu": {"speed": 0.0, "topology": {}}, "d= evice_passthrough": {"enabled": false}, "devices": [], "external_network= _provider_configurations": [], "external_status": "ok", "hardware_inform= ation": {"supported_rng_sources": []}, "hooks": [], "href": "/ovirt-engi= ne/api/hosts/542566c4-fc85-4398-9402-10c8adaa9554", "id": "542566c4-fc85= -4398-9402-10c8adaa9554", "katello_errata": [], "kdump_status": "unknown= ", "ksm": {"enabled": false}, "max_scheduling_memory": 0, "memory": 0, "= name": "pfm-srv-virt-1.pfm-ad.pfm.loc", "network_attachments": [], "nics= ": [], "numa_nodes": [], "numa_supported": false, "os": {"custom_kernel_= cmdline": ""}, "permissions": [], "port": 54321, "power_management": {"a= utomatic_pm_enabled": true, "enabled": false, "kdump_detection": true, "= pm_proxies": []}, "protocol": "stomp", "se_linux": {}, "spm": {"priority= ": 5, "status": "none"}, "ssh": {"fingerprint": "SHA256:J75BVLFnmGBGFosX= zaxCRnuIYcOc75HUBQZ4pOKpDg8", "port": 22}, "statistics": [], "status": "= non_responsive", "storage_connection_extensions": [], "summary": {"total= ": 0}, "tags": [], "transparent_huge_pages": {"enabled": false}, "type":= "rhel", "unmanaged_networks": [], "update_available": false}]}, "attemp= ts": 120, "changed": false}<br /></span><span class=3D"gmail-m_-41234274= 70926593816gmail-m_-3726384503116450878ansible-output-line">[ INFO ] TAS= K [Remove local vm dir]<br /></span><span class=3D"gmail-m_-412342747092= 6593816gmail-m_-3726384503116450878ansible-output-line">[ INFO ] TASK [N= otify the user about a failure]<br /></span></span><span class=3D"gmail-= m_-4123427470926593816gmail-m_-3726384503116450878ansible-output-line">[= ERROR ] fatal: [localhost]: FAILED! =3D> {"changed": false, "msg": "= The system may not be provisioned according to the playbook results: ple= ase check the logs for the issue, fix accordingly or re-deploy from scra= tch.n"}</span></span></div>=0A<div> </div>=0A<div> </div>=0A<d= iv><span style=3D"font-family: arial, helvetica, sans-serif; font-size:= 10pt; color: #000000;">I made another try with Cockpit, it is the same.= </span></div>=0A<div> </div>=0A<div><span style=3D"font-family: ari= al, helvetica, sans-serif; font-size: 10pt; color: #000000;">Am I doing= something wrong or is there a bug ?</span></div>=0A</blockquote>=0A<div= padding-left: 1ex;">=0A<div> </div>=0A<div><span style=3D"font-fam= ily: arial, helvetica, sans-serif; font-size: 10pt; color: #000000;">Reg= ards</span></div>=0A<div> </div>=0A<div> </div>=0A<br /><hr />= FreeMail powered by <a href=3D"https://mail.fr" target=3D"_blank" rel=3D= "noreferrer noopener">mail.fr</a> <br /><span class=3D"gmail-">_________= ______________________________________<br /> Users mailing list<br /><a= href=3D"mailto:Users@ovirt.org" target=3D"_blank" rel=3D"noreferrer noo= pener">Users@ovirt.org</a><br /><a href=3D"http://lists.ovirt.org/mailma= n/listinfo/users" target=3D"_blank" rel=3D"noreferrer noopener">http://l= ists.ovirt.org/mailman/listinfo/users</a><br /><br /></span></blockquote=
=0A</div>=0A</div>=0A</div>=0A</blockquote>=0A<div class=3D"gmail-HOEnZ= b">=0A<div class=3D"gmail-h5"><br /><hr />FreeMail powered by <a href=3D= "https://mail.fr" target=3D"_blank" rel=3D"noreferrer noopener">mail.fr<= /a></div>=0A</div>=0A<br />_____________________________________________= __<br /> Users mailing list<br /><a href=3D"mailto:Users@ovirt.org" targ= et=3D"_blank" rel=3D"noreferrer noopener">Users@ovirt.org</a><br /><a hr= ef=3D"http://lists.ovirt.org/mailman/listinfo/users" target=3D"_blank" r= el=3D"noreferrer noopener">http://lists.ovirt.org/mailman/listinfo/users= </a><br /><br /></blockquote>=0A</div>=0A</div>=0A</div>=0A</blockquote>= =0A<br /><hr />FreeMail powered by <a href=3D"https://mail.fr" target=3D= "_blank" rel=3D"noreferrer noopener">mail.fr</a></blockquote>=0A = <br/><hr>FreeMail powered by <a href=3D"https://mail.fr" ta= rget=3D"_blank">mail.fr</a>=0A
--=_b8dfb35fb3bd146ef1bbc5d31d7bc513--
participants (2)
-
Simone Tiraboschi
-
spfma.tech@e.mail.fr