
Hey Gervais, Try enabling NetworkManager and firewalld before doing the hosted-engine --deploy. I have run into problems with oVirt trying to perform tasks on hosts where firewalld is disabled, so maybe you are running into a similar problem. Also, I think the setup script will disable NetworkManager if it needs to. I know I didn't manually disable it on any of the boxes I installed on. On 16-05-12 04:49 PM, users-request@ovirt.org wrote:
Message: 1 Date: Thu, 12 May 2016 14:22:12 -0300 From: Gervais de Montbrun <gervais@demontbrun.com> To: Wee Sritippho <wee.s@forest.go.th> Cc: users <users@ovirt.org> Subject: Re: [ovirt-users] Adding another host to my cluster Message-ID: <28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com> Content-Type: text/plain; charset="utf-8"
Hi Wee (and others)
Thanks for the reply. I tried what you suggested, but I am in the exact same state. :-(
I don't want to completely remove my hosted engine setup as it is working on the two other hosts in my cluster. I did not run the rm -rf stes listed here (https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... <https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install>) that would wipe my hosted_engine nfs mount. If you know that this is 100% necessary, please let me know.
I did: hosted-engine --clean-metadata --force-cleanup --host-id=3 run the bash script to remove all of the ovirt packages and config files reinstalled ovirt-hosted-engine-setup ran "hosted-engine --deploy"
I'm back exactly where I started. Is there a way to run just the network configuration part of the deploy?
Since the last attempt, I did upgrade my hosted engine and my cluster is now running oVirt 3.6.5.
Cheers, Gervais
On May 12, 2016, at 11:50 AM, Wee Sritippho <wee.s@forest.go.th> wrote:
Hi,
I used to have a similar problem where one of my host can't be deployed due to the absence of ovirtmgmt bridge. Simone said it's a bug ( https://bugzilla.redhat.com/1323465 <https://bugzilla.redhat.com/1323465> ) which would be fixed in 3.6.6.
This is what I've done to solve it:
1. In the web UI, set the failed host to maintenance. 2. Remove it. 3. In that host, run a script from https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... <https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install> 4. Install ovirt-hosted-engine-setup again. 5. Redeploy again.
Hope that helps
On 11 ??????? 2016 22 ?????? 48 ???? 58 ?????? GMT+07:00, Gervais de Montbrun <gervais@demontbrun.com> wrote: Hi Folks,
I hate to reply to my own message, but I'm really hoping someone can help me with my issue http://lists.ovirt.org/pipermail/users/2016-May/039690.html <http://lists.ovirt.org/pipermail/users/2016-May/039690.html>
Does anyone have a suggestion for me? If there is any more information that I can provide that would help you to help me, please advise.
Cheers, Gervais
On May 9, 2016, at 1:42 PM, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote:
Hi All,
I'm trying to add a third host into my oVirt cluster. I have hosted engine setup on the first two. It's failing to finish the hosted-engine --deploy on this third host. I wiped the server and did a CentOS 7 minimum install and ran it again to have a clean machine.
My setup: CentOS 7 clean install yum install -y http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm <http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm> yum install -y ovirt-hosted-engine-setup yum upgrade -y && reboot systemctl disable NetworkManager ; systemctl stop NetworkManager ; systemctl disable firewalld ; systemctl stop firewalld hosted-engine --deploy
hosted-engine --deploy always throws an error: [ ERROR ] The VDSM host was found in a failed state. Please check engine and bootstrap installation logs. [ ERROR ] Unable to add Cultivar2 to the manager and then echo's [ INFO ] Waiting for VDSM hardware info ... [ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within 120 seconds [ INFO ] Stage: Clean up [ INFO ] Generating answer file '/var/lib/ovirt-hosted-engine-setup/answers/answers-20160509131103.conf' [ INFO ] Stage: Pre-termination [ INFO ] Stage: Termination [ ERROR ] Hosted Engine deployment failed: this system is not reliable, please check the issue, fix and redeploy Log file is located at /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20160509130658-qb8ev0.log
Full output of hosted-engine --deploy included in the attached zip file. I've also included vdsm.log (There is more than one tries worth of tries in there). You'll also find the ovirt-hosted-engine-setup-20160509130658-qb8ev0.log listed above.
This is my "test" setup. Cultivar0 is my first host and my nfs server for storage. I have two hosts in the setup already and everything is working fine. The host does show up in the oVirt admin, but shows "Installed Failed" <PastedGraphic-1.png>
Trying to reinstall from within the interface just fails again.
The ovirt bridge interface is not configured and there are no config files in /etc/sysconfi/network-scripts related to ovirt.
OS: [root@cultivar2 ovirt-hosted-engine-setup]# cat /etc/redhat-release CentOS Linux release 7.2.1511 (Core)
[root@cultivar2 ovirt-hosted-engine-setup]# uname -a Linux cultivar2.grove.silverorange.com <http://cultivar2.grove.silverorange.com/> 3.10.0-327.13.1.el7.x86_64 #1 SMP Thu Mar 31 16:04:38 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
Versions: [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i ovirt libgovirt-0.3.3-1.el7_2.1.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch ovirt-host-deploy-1.4.1-1.el7.centos.noarch ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch ovirt-release36-007-1.noarch ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i virt libvirt-daemon-driver-secret-1.2.17-13.el7_2.4.x86_64 virt-viewer-2.0-6.el7.x86_64 libgovirt-0.3.3-1.el7_2.1.x86_64 libvirt-daemon-kvm-1.2.17-13.el7_2.4.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch fence-virt-0.3.2-2.el7.x86_64 virt-what-1.13-6.el7.x86_64 libvirt-python-1.2.17-2.el7.x86_64 libvirt-daemon-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-config-nwfilter-1.2.17-13.el7_2.4.x86_64 libvirt-lock-sanlock-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nodedev-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-network-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-storage-1.2.17-13.el7_2.4.x86_64 ovirt-host-deploy-1.4.1-1.el7.centos.noarch virt-v2v-1.28.1-1.55.el7.centos.2.x86_64 ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch libvirt-client-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nwfilter-1.2.17-13.el7_2.4.x86_64 ovirt-release36-007-1.noarch libvirt-daemon-driver-interface-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-qemu-1.2.17-13.el7_2.4.x86_64 ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch
I also have a series of stuck tasks that I can't clear related to the host that can't be added... This is a secondary issue and I don't want to get off track, but they look like this: <PastedGraphic-2.png>
I'd appreciate any help that can be offered.
Cheers, Gervais
Gervais de Montbrun Systems Administrator / silverorange Inc.
Phone +1 902 367 4532 ext. 104 <tel:+1 902 367 4532 ext. 104> Mobile +1 902 978 0009 <tel:+1 902 978 0009>
<hosted-engine--deploy-logs.zip>
Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users <http://lists.ovirt.org/mailman/listinfo/users>
-- Wee

Hi Charles, Thanks for the suggestion. I cleaned up again using the bash script from the recoving-from-failed-install link below, then reinstalled (yum install ovirt-hosted-engine-setup). I enabled NetworkManager and firewalld as you suggested. The install stops very early on with an error: [ ERROR ] Failed to execute stage 'Programs detection': hosted-engine cannot be deployed while NetworkManager is running, please stop and disable it before proceeding I disabled and stopped NetworkManager and tried again. Same result. :( Any more guesses? Cheers, Gervais On May 12, 2016, at 9:08 PM, Charles Tassell <ctassell@gmail.com> wrote: Hey Gervais, Try enabling NetworkManager and firewalld before doing the hosted-engine --deploy. I have run into problems with oVirt trying to perform tasks on hosts where firewalld is disabled, so maybe you are running into a similar problem. Also, I think the setup script will disable NetworkManager if it needs to. I know I didn't manually disable it on any of the boxes I installed on. On 16-05-12 04:49 PM, users-request@ovirt.org wrote: Message: 1 Date: Thu, 12 May 2016 14:22:12 -0300 From: Gervais de Montbrun <gervais@demontbrun.com> To: Wee Sritippho <wee.s@forest.go.th> Cc: users <users@ovirt.org> Subject: Re: [ovirt-users] Adding another host to my cluster Message-ID: <28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com> Content-Type: text/plain; charset="utf-8" Hi Wee (and others) Thanks for the reply. I tried what you suggested, but I am in the exact same state. :-( I don't want to completely remove my hosted engine setup as it is working on the two other hosts in my cluster. I did not run the rm -rf stes listed here ( https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... < https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install>) that would wipe my hosted_engine nfs mount. If you know that this is 100% necessary, please let me know. I did: hosted-engine --clean-metadata --force-cleanup --host-id=3 run the bash script to remove all of the ovirt packages and config files reinstalled ovirt-hosted-engine-setup ran "hosted-engine --deploy" I'm back exactly where I started. Is there a way to run just the network configuration part of the deploy? Since the last attempt, I did upgrade my hosted engine and my cluster is now running oVirt 3.6.5. Cheers, Gervais On May 12, 2016, at 11:50 AM, Wee Sritippho <wee.s@forest.go.th> wrote: Hi, I used to have a similar problem where one of my host can't be deployed due to the absence of ovirtmgmt bridge. Simone said it's a bug ( https://bugzilla.redhat.com/1323465 <https://bugzilla.redhat.com/1323465> ) which would be fixed in 3.6.6. This is what I've done to solve it: 1. In the web UI, set the failed host to maintenance. 2. Remove it. 3. In that host, run a script from https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... < https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail...
4. Install ovirt-hosted-engine-setup again. 5. Redeploy again. Hope that helps On 11 ??????? 2016 22 ?????? 48 ???? 58 ?????? GMT+07:00, Gervais de Montbrun <gervais@demontbrun.com> wrote: Hi Folks, I hate to reply to my own message, but I'm really hoping someone can help me with my issue http://lists.ovirt.org/pipermail/users/2016-May/039690.html < http://lists.ovirt.org/pipermail/users/2016-May/039690.html> Does anyone have a suggestion for me? If there is any more information that I can provide that would help you to help me, please advise. Cheers, Gervais On May 9, 2016, at 1:42 PM, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote: Hi All, I'm trying to add a third host into my oVirt cluster. I have hosted engine setup on the first two. It's failing to finish the hosted-engine --deploy on this third host. I wiped the server and did a CentOS 7 minimum install and ran it again to have a clean machine. My setup: CentOS 7 clean install yum install -y http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm < http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm> yum install -y ovirt-hosted-engine-setup yum upgrade -y && reboot systemctl disable NetworkManager ; systemctl stop NetworkManager ; systemctl disable firewalld ; systemctl stop firewalld hosted-engine --deploy hosted-engine --deploy always throws an error: [ ERROR ] The VDSM host was found in a failed state. Please check engine and bootstrap installation logs. [ ERROR ] Unable to add Cultivar2 to the manager and then echo's [ INFO ] Waiting for VDSM hardware info ... [ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within 120 seconds [ INFO ] Stage: Clean up [ INFO ] Generating answer file '/var/lib/ovirt-hosted-engine-setup/answers/answers-20160509131103.conf' [ INFO ] Stage: Pre-termination [ INFO ] Stage: Termination [ ERROR ] Hosted Engine deployment failed: this system is not reliable, please check the issue, fix and redeploy Log file is located at /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20160509130658-qb8ev0.log Full output of hosted-engine --deploy included in the attached zip file. I've also included vdsm.log (There is more than one tries worth of tries in there). You'll also find the ovirt-hosted-engine-setup-20160509130658-qb8ev0.log listed above. This is my "test" setup. Cultivar0 is my first host and my nfs server for storage. I have two hosts in the setup already and everything is working fine. The host does show up in the oVirt admin, but shows "Installed Failed" <PastedGraphic-1.png> Trying to reinstall from within the interface just fails again. The ovirt bridge interface is not configured and there are no config files in /etc/sysconfi/network-scripts related to ovirt. OS: [root@cultivar2 ovirt-hosted-engine-setup]# cat /etc/redhat-release CentOS Linux release 7.2.1511 (Core) [root@cultivar2 ovirt-hosted-engine-setup]# uname -a Linux cultivar2.grove.silverorange.com < http://cultivar2.grove.silverorange.com/> 3.10.0-327.13.1.el7.x86_64 #1 SMP Thu Mar 31 16:04:38 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux Versions: [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i ovirt libgovirt-0.3.3-1.el7_2.1.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch ovirt-host-deploy-1.4.1-1.el7.centos.noarch ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch ovirt-release36-007-1.noarch ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i virt libvirt-daemon-driver-secret-1.2.17-13.el7_2.4.x86_64 virt-viewer-2.0-6.el7.x86_64 libgovirt-0.3.3-1.el7_2.1.x86_64 libvirt-daemon-kvm-1.2.17-13.el7_2.4.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch fence-virt-0.3.2-2.el7.x86_64 virt-what-1.13-6.el7.x86_64 libvirt-python-1.2.17-2.el7.x86_64 libvirt-daemon-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-config-nwfilter-1.2.17-13.el7_2.4.x86_64 libvirt-lock-sanlock-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nodedev-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-network-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-storage-1.2.17-13.el7_2.4.x86_64 ovirt-host-deploy-1.4.1-1.el7.centos.noarch virt-v2v-1.28.1-1.55.el7.centos.2.x86_64 ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch libvirt-client-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nwfilter-1.2.17-13.el7_2.4.x86_64 ovirt-release36-007-1.noarch libvirt-daemon-driver-interface-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-qemu-1.2.17-13.el7_2.4.x86_64 ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch I also have a series of stuck tasks that I can't clear related to the host that can't be added... This is a secondary issue and I don't want to get off track, but they look like this: <PastedGraphic-2.png> I'd appreciate any help that can be offered. Cheers, Gervais Gervais de Montbrun Systems Administrator / silverorange Inc. Phone +1 902 367 4532 ext. 104 <tel:+1 902 367 4532 ext. 104> Mobile +1 902 978 0009 <tel:+1 902 978 0009> <hosted-engine--deploy-logs.zip> Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users < http://lists.ovirt.org/mailman/listinfo/users> -- Wee _______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

This is a multi-part message in MIME format. --------------050607040809020004050901 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Hi Gervais, Hmm, can you tar up the logfiles (/var/log/vdsm/* on the host you are installing on) and put them somewhere to look at? Also, I found that starting VDSM from the command line is useful as it sometimes spits out error messages that don't show up in the logs. I think the command I used was: su -s /bin/bash vdsm python /usr/share/vdsm/vdsm My problem was that I customized the logging settings in /etc/vdsm/*conf to try and tone down the debugging stuff and had a syntax error. On 16-05-12 10:24 PM, Gervais de Montbrun wrote:
Hi Charles,
Thanks for the suggestion.
I cleaned up again using the bash script from the recoving-from-failed-install link below, then reinstalled (yum install ovirt-hosted-engine-setup).
I enabled NetworkManager and firewalld as you suggested. The install stops very early on with an error: [ ERROR ] Failed to execute stage 'Programs detection': hosted-engine cannot be deployed while NetworkManager is running, please stop and disable it before proceeding
I disabled and stopped NetworkManager and tried again. Same result. :(
Any more guesses?
Cheers, Gervais
On May 12, 2016, at 9:08 PM, Charles Tassell <ctassell@gmail.com <mailto:ctassell@gmail.com>> wrote:
Hey Gervais,
Try enabling NetworkManager and firewalld before doing the hosted-engine --deploy. I have run into problems with oVirt trying to perform tasks on hosts where firewalld is disabled, so maybe you are running into a similar problem. Also, I think the setup script will disable NetworkManager if it needs to. I know I didn't manually disable it on any of the boxes I installed on.
On 16-05-12 04:49 PM, users-request@ovirt.org <mailto:users-request@ovirt.org> wrote:
Message: 1 Date: Thu, 12 May 2016 14:22:12 -0300 From: Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> To: Wee Sritippho <wee.s@forest.go.th <mailto:wee.s@forest.go.th>> Cc: users <users@ovirt.org <mailto:users@ovirt.org>> Subject: Re: [ovirt-users] Adding another host to my cluster Message-ID: <28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com <mailto:28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com>> Content-Type: text/plain; charset="utf-8"
Hi Wee (and others)
Thanks for the reply. I tried what you suggested, but I am in the exact same state. :-(
I don't want to completely remove my hosted engine setup as it is working on the two other hosts in my cluster. I did not run the rm -rf stes listed here (https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... <https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install>) that would wipe my hosted_engine nfs mount. If you know that this is 100% necessary, please let me know.
I did: hosted-engine --clean-metadata --force-cleanup --host-id=3 run the bash script to remove all of the ovirt packages and config files reinstalled ovirt-hosted-engine-setup ran "hosted-engine --deploy"
I'm back exactly where I started. Is there a way to run just the network configuration part of the deploy?
Since the last attempt, I did upgrade my hosted engine and my cluster is now running oVirt 3.6.5.
Cheers, Gervais
On May 12, 2016, at 11:50 AM, Wee Sritippho <wee.s@forest.go.th <mailto:wee.s@forest.go.th>> wrote:
Hi,
I used to have a similar problem where one of my host can't be deployed due to the absence of ovirtmgmt bridge. Simone said it's a bug ( https://bugzilla.redhat.com/1323465 <https://bugzilla.redhat.com/1323465> ) which would be fixed in 3.6.6.
This is what I've done to solve it:
1. In the web UI, set the failed host to maintenance. 2. Remove it. 3. In that host, run a script from https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... <https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install> 4. Install ovirt-hosted-engine-setup again. 5. Redeploy again.
Hope that helps
On 11 ??????? 2016 22 ?????? 48 ???? 58 ?????? GMT+07:00, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote: Hi Folks,
I hate to reply to my own message, but I'm really hoping someone can help me with my issue http://lists.ovirt.org/pipermail/users/2016-May/039690.html <http://lists.ovirt.org/pipermail/users/2016-May/039690.html>
Does anyone have a suggestion for me? If there is any more information that I can provide that would help you to help me, please advise.
Cheers, Gervais
On May 9, 2016, at 1:42 PM, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com> <mailto:gervais@demontbrun.com <mailto:gervais@demontbrun.com>>> wrote:
Hi All,
I'm trying to add a third host into my oVirt cluster. I have hosted engine setup on the first two. It's failing to finish the hosted-engine --deploy on this third host. I wiped the server and did a CentOS 7 minimum install and ran it again to have a clean machine.
My setup: CentOS 7 clean install yum install -y http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm <http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm> yum install -y ovirt-hosted-engine-setup yum upgrade -y && reboot systemctl disable NetworkManager ; systemctl stop NetworkManager ; systemctl disable firewalld ; systemctl stop firewalld hosted-engine --deploy
hosted-engine --deploy always throws an error: [ ERROR ] The VDSM host was found in a failed state. Please check engine and bootstrap installation logs. [ ERROR ] Unable to add Cultivar2 to the manager and then echo's [ INFO ] Waiting for VDSM hardware info ... [ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within 120 seconds [ INFO ] Stage: Clean up [ INFO ] Generating answer file '/var/lib/ovirt-hosted-engine-setup/answers/answers-20160509131103.conf' [ INFO ] Stage: Pre-termination [ INFO ] Stage: Termination [ ERROR ] Hosted Engine deployment failed: this system is not reliable, please check the issue, fix and redeploy Log file is located at /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20160509130658-qb8ev0.log
Full output of hosted-engine --deploy included in the attached zip file. I've also included vdsm.log (There is more than one tries worth of tries in there). You'll also find the ovirt-hosted-engine-setup-20160509130658-qb8ev0.log listed above.
This is my "test" setup. Cultivar0 is my first host and my nfs server for storage. I have two hosts in the setup already and everything is working fine. The host does show up in the oVirt admin, but shows "Installed Failed" <PastedGraphic-1.png>
Trying to reinstall from within the interface just fails again.
The ovirt bridge interface is not configured and there are no config files in /etc/sysconfi/network-scripts related to ovirt.
OS: [root@cultivar2 ovirt-hosted-engine-setup]# cat /etc/redhat-release CentOS Linux release 7.2.1511 (Core)
[root@cultivar2 ovirt-hosted-engine-setup]# uname -a Linux cultivar2.grove.silverorange.com <http://cultivar2.grove.silverorange.com> <http://cultivar2.grove.silverorange.com/> 3.10.0-327.13.1.el7.x86_64 #1 SMP Thu Mar 31 16:04:38 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
Versions: [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i ovirt libgovirt-0.3.3-1.el7_2.1.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch ovirt-host-deploy-1.4.1-1.el7.centos.noarch ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch ovirt-release36-007-1.noarch ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i virt libvirt-daemon-driver-secret-1.2.17-13.el7_2.4.x86_64 virt-viewer-2.0-6.el7.x86_64 libgovirt-0.3.3-1.el7_2.1.x86_64 libvirt-daemon-kvm-1.2.17-13.el7_2.4.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch fence-virt-0.3.2-2.el7.x86_64 virt-what-1.13-6.el7.x86_64 libvirt-python-1.2.17-2.el7.x86_64 libvirt-daemon-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-config-nwfilter-1.2.17-13.el7_2.4.x86_64 libvirt-lock-sanlock-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nodedev-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-network-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-storage-1.2.17-13.el7_2.4.x86_64 ovirt-host-deploy-1.4.1-1.el7.centos.noarch virt-v2v-1.28.1-1.55.el7.centos.2.x86_64 ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch libvirt-client-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nwfilter-1.2.17-13.el7_2.4.x86_64 ovirt-release36-007-1.noarch libvirt-daemon-driver-interface-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-qemu-1.2.17-13.el7_2.4.x86_64 ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch
I also have a series of stuck tasks that I can't clear related to the host that can't be added... This is a secondary issue and I don't want to get off track, but they look like this: <PastedGraphic-2.png>
I'd appreciate any help that can be offered.
Cheers, Gervais
Gervais de Montbrun Systems Administrator / silverorange Inc.
Phone +1 902 367 4532 ext. 104 <tel:%2B1%20902%20367%204532%20ext.%20104> <tel:+1 902 367 4532 ext. 104 <tel:%2B1%20902%20367%204532%20ext.%20104>> Mobile +1 902 978 0009 <tel:%2B1%20902%20978%200009> <tel:+1 902 978 0009 <tel:%2B1%20902%20978%200009>>
<hosted-engine--deploy-logs.zip>
Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users <http://lists.ovirt.org/mailman/listinfo/users>
-- Wee
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
--------------050607040809020004050901 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit <html> <head> <meta content="text/html; charset=utf-8" http-equiv="Content-Type"> </head> <body text="#000000" bgcolor="#FFFFFF"> <div class="moz-cite-prefix">Hi Gervais,<br> <br> Hmm, can you tar up the logfiles (/var/log/vdsm/* on the host you are installing on) and put them somewhere to look at? Also, I found that starting VDSM from the command line is useful as it sometimes spits out error messages that don't show up in the logs. I think the command I used was:<br> su -s /bin/bash vdsm<br> python /usr/share/vdsm/vdsm<br> <br> My problem was that I customized the logging settings in /etc/vdsm/*conf to try and tone down the debugging stuff and had a syntax error.<br> <br> On 16-05-12 10:24 PM, Gervais de Montbrun wrote:<br> </div> <blockquote cite="mid:CAESCRhP_6k3m45726-QFHsxv-be+LNWQNeQ+_XPc2sLnLrZ8PA@mail.gmail.com" type="cite"> <div dir="ltr"> <div dir="auto" style="word-wrap:break-word">Hi Charles,<br> <br> Thanks for the suggestion.<br> <br> I cleaned up again using the bash script from the recoving-from-failed-install link below, then reinstalled (yum install ovirt-hosted-engine-setup).<br> <br> I enabled NetworkManager and firewalld as you suggested. The install stops very early on with an error:<br> <span style="white-space:pre-wrap"> </span>[ ERROR ] Failed to execute stage 'Programs detection': hosted-engine cannot be deployed while NetworkManager is running, please stop and disable it before proceeding <br> <br> I disabled and stopped NetworkManager and tried again. Same result. :(<br> <br> Any more guesses?<br> <br> Cheers,<br> Gervais<br> <br> <br> <br> <blockquote type="cite">On May 12, 2016, at 9:08 PM, Charles Tassell <<a moz-do-not-send="true" href="mailto:ctassell@gmail.com" target="_blank">ctassell@gmail.com</a>> wrote:<br> <br> Hey Gervais,<br> <br> Try enabling NetworkManager and firewalld before doing the hosted-engine --deploy. I have run into problems with oVirt trying to perform tasks on hosts where firewalld is disabled, so maybe you are running into a similar problem. Also, I think the setup script will disable NetworkManager if it needs to. I know I didn't manually disable it on any of the boxes I installed on.<br> <br> On 16-05-12 04:49 PM, <a moz-do-not-send="true" href="mailto:users-request@ovirt.org" target="_blank">users-request@ovirt.org</a> wrote:<br> <blockquote type="cite">Message: 1<br> Date: Thu, 12 May 2016 14:22:12 -0300<br> From: Gervais de Montbrun <<a moz-do-not-send="true" href="mailto:gervais@demontbrun.com" target="_blank">gervais@demontbrun.com</a>><br> To: Wee Sritippho <<a moz-do-not-send="true" href="mailto:wee.s@forest.go.th" target="_blank">wee.s@forest.go.th</a>><br> Cc: users <<a moz-do-not-send="true" href="mailto:users@ovirt.org" target="_blank">users@ovirt.org</a>><br> Subject: Re: [ovirt-users] Adding another host to my cluster<br> Message-ID: <<a moz-do-not-send="true" href="mailto:28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com" target="_blank">28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com</a>><br> Content-Type: text/plain; charset="utf-8"<br> <br> Hi Wee<br> (and others)<br> <br> Thanks for the reply. I tried what you suggested, but I am in the exact same state. :-(<br> <br> I don't want to completely remove my hosted engine setup as it is working on the two other hosts in my cluster. I did not run the rm -rf stes listed here (<a moz-do-not-send="true" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail..." target="_blank"><a class="moz-txt-link-freetext" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install">https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install</a></a> <<a moz-do-not-send="true" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail..." target="_blank">https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install</a>>) that would wipe my hosted_engine nfs mount. If you know that this is 100% necessary, please let me know.<br> <br> I did:<br> hosted-engine --clean-metadata --force-cleanup --host-id=3<br> run the bash script to remove all of the ovirt packages and config files<br> reinstalled ovirt-hosted-engine-setup<br> ran "hosted-engine --deploy"<br> <br> I'm back exactly where I started. Is there a way to run just the network configuration part of the deploy?<br> <br> Since the last attempt, I did upgrade my hosted engine and my cluster is now running oVirt 3.6.5.<br> <br> Cheers,<br> Gervais<br> <br> <br> <br> <blockquote type="cite">On May 12, 2016, at 11:50 AM, Wee Sritippho <<a moz-do-not-send="true" href="mailto:wee.s@forest.go.th" target="_blank">wee.s@forest.go.th</a>> wrote:<br> <br> Hi,<br> <br> I used to have a similar problem where one of my host can't be deployed due to the absence of ovirtmgmt bridge. Simone said it's a bug ( <a moz-do-not-send="true" href="https://bugzilla.redhat.com/1323465" target="_blank"><a class="moz-txt-link-freetext" href="https://bugzilla.redhat.com/1323465">https://bugzilla.redhat.com/1323465</a></a> <<a moz-do-not-send="true" href="https://bugzilla.redhat.com/1323465" target="_blank">https://bugzilla.redhat.com/1323465</a>> ) which would be fixed in 3.6.6.<br> <br> This is what I've done to solve it:<br> <br> 1. In the web UI, set the failed host to maintenance.<br> 2. Remove it.<br> 3. In that host, run a script from <a moz-do-not-send="true" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail..." target="_blank"><a class="moz-txt-link-freetext" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install">https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install</a></a> <<a moz-do-not-send="true" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail..." target="_blank">https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install</a>><br> 4. Install ovirt-hosted-engine-setup again.<br> 5. Redeploy again.<br> <br> Hope that helps<br> <br> On 11 ??????? 2016 22 ?????? 48 ???? 58 ?????? GMT+07:00, Gervais de Montbrun <<a moz-do-not-send="true" href="mailto:gervais@demontbrun.com" target="_blank"><a class="moz-txt-link-abbreviated" href="mailto:gervais@demontbrun.com">gervais@demontbrun.com</a></a>> wrote:<br> Hi Folks,<br> <br> I hate to reply to my own message, but I'm really hoping someone can help me with my issue<br> <a moz-do-not-send="true" href="http://lists.ovirt.org/pipermail/users/2016-May/039690.html" target="_blank">http://lists.ovirt.org/pipermail/users/2016-May/039690.html</a> <<a moz-do-not-send="true" href="http://lists.ovirt.org/pipermail/users/2016-May/039690.html" target="_blank">http://lists.ovirt.org/pipermail/users/2016-May/039690.html</a>><br> <br> Does anyone have a suggestion for me? If there is any more information that I can provide that would help you to help me, please advise.<br> <br> Cheers,<br> Gervais<br> <br> <br> <br> <blockquote type="cite">On May 9, 2016, at 1:42 PM, Gervais de Montbrun <<a moz-do-not-send="true" href="mailto:gervais@demontbrun.com" target="_blank">gervais@demontbrun.com</a> <mailto:<a moz-do-not-send="true" href="mailto:gervais@demontbrun.com" target="_blank">gervais@demontbrun.com</a>>> wrote:<br> <br> Hi All,<br> <br> I'm trying to add a third host into my oVirt cluster. I have hosted engine setup on the first two. It's failing to finish the hosted-engine --deploy on this third host. I wiped the server and did a CentOS 7 minimum install and ran it again to have a clean machine.<br> <br> My setup:<br> CentOS 7 clean install<br> yum install -y <a moz-do-not-send="true" href="http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm" target="_blank">http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm</a> <<a moz-do-not-send="true" href="http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm" target="_blank">http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm</a>><br> yum install -y ovirt-hosted-engine-setup<br> yum upgrade -y && reboot<br> systemctl disable NetworkManager ; systemctl stop NetworkManager ; systemctl disable firewalld ; systemctl stop firewalld<br> hosted-engine --deploy<br> <br> hosted-engine --deploy always throws an error:<br> [ ERROR ] The VDSM host was found in a failed state. Please check engine and bootstrap installation logs.<br> [ ERROR ] Unable to add Cultivar2 to the manager<br> and then echo's<br> [ INFO ] Waiting for VDSM hardware info<br> ...<br> [ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within 120 seconds<br> [ INFO ] Stage: Clean up<br> [ INFO ] Generating answer file '/var/lib/ovirt-hosted-engine-setup/answers/answers-20160509131103.conf'<br> [ INFO ] Stage: Pre-termination<br> [ INFO ] Stage: Termination<br> [ ERROR ] Hosted Engine deployment failed: this system is not reliable, please check the issue, fix and redeploy<br> Log file is located at /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20160509130658-qb8ev0.log<br> <br> Full output of hosted-engine --deploy included in the attached zip file.<br> I've also included vdsm.log (There is more than one tries worth of tries in there).<br> You'll also find the ovirt-hosted-engine-setup-20160509130658-qb8ev0.log listed above.<br> <br> This is my "test" setup. Cultivar0 is my first host and my nfs server for storage. I have two hosts in the setup already and everything is working fine. The host does show up in the oVirt admin, but shows "Installed Failed"<br> <PastedGraphic-1.png><br> <br> Trying to reinstall from within the interface just fails again.<br> <br> The ovirt bridge interface is not configured and there are no config files in /etc/sysconfi/network-scripts related to ovirt.<br> <br> OS:<br> [root@cultivar2 ovirt-hosted-engine-setup]# cat /etc/redhat-release<br> CentOS Linux release 7.2.1511 (Core)<br> <br> [root@cultivar2 ovirt-hosted-engine-setup]# uname -a<br> Linux <a moz-do-not-send="true" href="http://cultivar2.grove.silverorange.com" target="_blank">cultivar2.grove.silverorange.com</a> <<a moz-do-not-send="true" href="http://cultivar2.grove.silverorange.com/" target="_blank">http://cultivar2.grove.silverorange.com/</a>> 3.10.0-327.13.1.el7.x86_64 #1 SMP Thu Mar 31 16:04:38 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux<br> <br> Versions:<br> [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i ovirt<br> libgovirt-0.3.3-1.el7_2.1.x86_64<br> ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch<br> ovirt-host-deploy-1.4.1-1.el7.centos.noarch<br> ovirt-vmconsole-1.0.0-1.el7.centos.noarch<br> ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch<br> ovirt-release36-007-1.noarch<br> ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch<br> ovirt-setup-lib-1.0.1-1.el7.centos.noarch<br> ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch<br> [root@cultivar2 ovirt-hosted-engine-setup]#<br> [root@cultivar2 ovirt-hosted-engine-setup]#<br> [root@cultivar2 ovirt-hosted-engine-setup]#<br> [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i virt<br> libvirt-daemon-driver-secret-1.2.17-13.el7_2.4.x86_64<br> virt-viewer-2.0-6.el7.x86_64<br> libgovirt-0.3.3-1.el7_2.1.x86_64<br> libvirt-daemon-kvm-1.2.17-13.el7_2.4.x86_64<br> ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch<br> fence-virt-0.3.2-2.el7.x86_64<br> virt-what-1.13-6.el7.x86_64<br> libvirt-python-1.2.17-2.el7.x86_64<br> libvirt-daemon-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-config-nwfilter-1.2.17-13.el7_2.4.x86_64<br> libvirt-lock-sanlock-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-nodedev-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-network-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-storage-1.2.17-13.el7_2.4.x86_64<br> ovirt-host-deploy-1.4.1-1.el7.centos.noarch<br> virt-v2v-1.28.1-1.55.el7.centos.2.x86_64<br> ovirt-vmconsole-1.0.0-1.el7.centos.noarch<br> ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch<br> libvirt-client-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-nwfilter-1.2.17-13.el7_2.4.x86_64<br> ovirt-release36-007-1.noarch<br> libvirt-daemon-driver-interface-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-qemu-1.2.17-13.el7_2.4.x86_64<br> ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch<br> ovirt-setup-lib-1.0.1-1.el7.centos.noarch<br> ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch<br> <br> I also have a series of stuck tasks that I can't clear related to the host that can't be added... This is a secondary issue and I don't want to get off track, but they look like this:<br> <PastedGraphic-2.png><br> <br> I'd appreciate any help that can be offered.<br> <br> Cheers,<br> Gervais<br> <br> <br> Gervais de Montbrun<br> Systems Administrator / silverorange Inc.<br> <br> Phone <span style="white-space:pre-wrap"> </span><a moz-do-not-send="true" href="tel:%2B1%20902%20367%204532%20ext.%20104" value="+19023674532" target="_blank">+1 902 367 4532 ext. 104</a> <tel:<a moz-do-not-send="true" href="tel:%2B1%20902%20367%204532%20ext.%20104" value="+19023674532" target="_blank">+1 902 367 4532 ext. 104</a>><br> Mobile <span style="white-space:pre-wrap"> </span><a moz-do-not-send="true" href="tel:%2B1%20902%20978%200009" value="+19029780009" target="_blank">+1 902 978 0009</a> <tel:<a moz-do-not-send="true" href="tel:%2B1%20902%20978%200009" value="+19029780009" target="_blank">+1 902 978 0009</a>><br> <br> <hosted-engine--deploy-logs.zip><br> </blockquote> <br> <br> Users mailing list<br> <a moz-do-not-send="true" href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br> <a moz-do-not-send="true" href="http://lists.ovirt.org/mailman/listinfo/users" target="_blank">http://lists.ovirt.org/mailman/listinfo/users</a> <<a moz-do-not-send="true" href="http://lists.ovirt.org/mailman/listinfo/users" target="_blank">http://lists.ovirt.org/mailman/listinfo/users</a>><br> <br> -- <br> Wee<br> </blockquote> <br> </blockquote> <br> _______________________________________________<br> Users mailing list<br> <a moz-do-not-send="true" href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br> <a moz-do-not-send="true" href="http://lists.ovirt.org/mailman/listinfo/users" target="_blank">http://lists.ovirt.org/mailman/listinfo/users</a><br> </blockquote> <br> </div> </div> </blockquote> <br> </body> </html> --------------050607040809020004050901--

Hi Charles, I think the problem I am having is due to the setup failing and not something in vdsm configs as I have never gotten this server to start up properly and the BRIDGE ethernet interface + ovirt routes are not setup. I put the logs here: https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0 hosted-engine--deploy-logs.zip # Logs from when I tried to deploy and it failed vdsm.tar.gz # /var/log/vdsm Output from running vdsm from the command line: [root@cultivar2 log]# su -s /bin/bash vdsm [vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm (PID: 6521) I am the actual vdsm 4.17.26-1.el7 cultivar2.grove.silverorange.com (3.10.0-327.el7.x86_64) VDSM will run with cpu affinity: frozenset([1]) /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None) SUCCESS: <err> = ''; <rc> = 0 Starting scheduler vdsm.Scheduler started Run and protect: registerDomainStateChangeCallback(callbackFunc=<functools.partial object at 0x381b158>) Run and protect: registerDomainStateChangeCallback, Return response: None Trying to connect to Super Vdsm Preparing MOM interface Using named unix socket /var/run/vdsm/mom-vdsm.sock Unregistering all secrests trying to connect libvirt recovery: started Setting channels' timeout to 30 seconds. Starting VM channels listener thread. Listening at 0.0.0.0:54321 Adding detector <rpc.bindingxmlrpc.XmlDetector instance at 0x3b4ecb0> recovery: completed in 0s Adding detector <yajsonrpc.stompreactor.StompDetector instance at 0x382e5a8> Starting executor Starting worker jsonrpc.Executor/0 Worker started Starting worker jsonrpc.Executor/1 Worker started Starting worker jsonrpc.Executor/2 Worker started Starting worker jsonrpc.Executor/3 Worker started Starting worker jsonrpc.Executor/4 Worker started Starting worker jsonrpc.Executor/5 Worker started Starting worker jsonrpc.Executor/6 Worker started Starting worker jsonrpc.Executor/7 Worker started XMLRPC server running Starting executor Starting worker periodic/0 Worker started Starting worker periodic/1 Worker started Starting worker periodic/2 Worker started Starting worker periodic/3 Worker started trying to connect libvirt Panic: Connect to supervdsm service failed: [Errno 2] No such file or directory Traceback (most recent call last): File "/usr/share/vdsm/supervdsm.py", line 78, in _connect utils.retry(self._manager.connect, Exception, timeout=60, tries=3) File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry return func() File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in connect conn = Client(self._address, authkey=self._authkey) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in Client c = SocketClient(address) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, in SocketClient s.connect(address) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 2] No such file or directory Killed Thanks for the help. It's really appreciated. Cheers, Gervais On Fri, May 13, 2016 at 12:55 AM, Charles Tassell <ctassell@gmail.com> wrote:
Hi Gervais,
Hmm, can you tar up the logfiles (/var/log/vdsm/* on the host you are installing on) and put them somewhere to look at? Also, I found that starting VDSM from the command line is useful as it sometimes spits out error messages that don't show up in the logs. I think the command I used was: su -s /bin/bash vdsm python /usr/share/vdsm/vdsm
My problem was that I customized the logging settings in /etc/vdsm/*conf to try and tone down the debugging stuff and had a syntax error.
On 16-05-12 10:24 PM, Gervais de Montbrun wrote:
Hi Charles,
Thanks for the suggestion.
I cleaned up again using the bash script from the recoving-from-failed-install link below, then reinstalled (yum install ovirt-hosted-engine-setup).
I enabled NetworkManager and firewalld as you suggested. The install stops very early on with an error: [ ERROR ] Failed to execute stage 'Programs detection': hosted-engine cannot be deployed while NetworkManager is running, please stop and disable it before proceeding
I disabled and stopped NetworkManager and tried again. Same result. :(
Any more guesses?
Cheers, Gervais
On May 12, 2016, at 9:08 PM, Charles Tassell <ctassell@gmail.com> wrote:
Hey Gervais,
Try enabling NetworkManager and firewalld before doing the hosted-engine --deploy. I have run into problems with oVirt trying to perform tasks on hosts where firewalld is disabled, so maybe you are running into a similar problem. Also, I think the setup script will disable NetworkManager if it needs to. I know I didn't manually disable it on any of the boxes I installed on.
On 16-05-12 04:49 PM, users-request@ovirt.org wrote:
Message: 1 Date: Thu, 12 May 2016 14:22:12 -0300 From: Gervais de Montbrun <gervais@demontbrun.com> To: Wee Sritippho <wee.s@forest.go.th> Cc: users <users@ovirt.org> Subject: Re: [ovirt-users] Adding another host to my cluster Message-ID: <28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com> Content-Type: text/plain; charset="utf-8"
Hi Wee (and others)
Thanks for the reply. I tried what you suggested, but I am in the exact same state. :-(
I don't want to completely remove my hosted engine setup as it is working on the two other hosts in my cluster. I did not run the rm -rf stes listed here ( <https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install> https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... < https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install>) that would wipe my hosted_engine nfs mount. If you know that this is 100% necessary, please let me know.
I did: hosted-engine --clean-metadata --force-cleanup --host-id=3 run the bash script to remove all of the ovirt packages and config files reinstalled ovirt-hosted-engine-setup ran "hosted-engine --deploy"
I'm back exactly where I started. Is there a way to run just the network configuration part of the deploy?
Since the last attempt, I did upgrade my hosted engine and my cluster is now running oVirt 3.6.5.
Cheers, Gervais
On May 12, 2016, at 11:50 AM, Wee Sritippho <wee.s@forest.go.th> wrote:
Hi,
I used to have a similar problem where one of my host can't be deployed due to the absence of ovirtmgmt bridge. Simone said it's a bug ( <https://bugzilla.redhat.com/1323465>https://bugzilla.redhat.com/1323465 < https://bugzilla.redhat.com/1323465> ) which would be fixed in 3.6.6.
This is what I've done to solve it:
1. In the web UI, set the failed host to maintenance. 2. Remove it. 3. In that host, run a script from <https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install> https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... < https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail...
4. Install ovirt-hosted-engine-setup again. 5. Redeploy again.
Hope that helps
On 11 ??????? 2016 22 ?????? 48 ???? 58 ?????? GMT+07:00, Gervais de Montbrun < <gervais@demontbrun.com>gervais@demontbrun.com> wrote: Hi Folks,
I hate to reply to my own message, but I'm really hoping someone can help me with my issue http://lists.ovirt.org/pipermail/users/2016-May/039690.html < http://lists.ovirt.org/pipermail/users/2016-May/039690.html>
Does anyone have a suggestion for me? If there is any more information that I can provide that would help you to help me, please advise.
Cheers, Gervais
On May 9, 2016, at 1:42 PM, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote:
Hi All,
I'm trying to add a third host into my oVirt cluster. I have hosted engine setup on the first two. It's failing to finish the hosted-engine --deploy on this third host. I wiped the server and did a CentOS 7 minimum install and ran it again to have a clean machine.
My setup: CentOS 7 clean install yum install -y http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm <http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm> yum install -y ovirt-hosted-engine-setup yum upgrade -y && reboot systemctl disable NetworkManager ; systemctl stop NetworkManager ; systemctl disable firewalld ; systemctl stop firewalld hosted-engine --deploy
hosted-engine --deploy always throws an error: [ ERROR ] The VDSM host was found in a failed state. Please check engine and bootstrap installation logs. [ ERROR ] Unable to add Cultivar2 to the manager and then echo's [ INFO ] Waiting for VDSM hardware info ... [ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within 120 seconds [ INFO ] Stage: Clean up [ INFO ] Generating answer file '/var/lib/ovirt-hosted-engine-setup/answers/answers-20160509131103.conf' [ INFO ] Stage: Pre-termination [ INFO ] Stage: Termination [ ERROR ] Hosted Engine deployment failed: this system is not reliable, please check the issue, fix and redeploy Log file is located at /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20160509130658-qb8ev0.log
Full output of hosted-engine --deploy included in the attached zip file. I've also included vdsm.log (There is more than one tries worth of tries in there). You'll also find the ovirt-hosted-engine-setup-20160509130658-qb8ev0.log listed above.
This is my "test" setup. Cultivar0 is my first host and my nfs server for storage. I have two hosts in the setup already and everything is working fine. The host does show up in the oVirt admin, but shows "Installed Failed" <PastedGraphic-1.png>
Trying to reinstall from within the interface just fails again.
The ovirt bridge interface is not configured and there are no config files in /etc/sysconfi/network-scripts related to ovirt.
OS: [root@cultivar2 ovirt-hosted-engine-setup]# cat /etc/redhat-release CentOS Linux release 7.2.1511 (Core)
[root@cultivar2 ovirt-hosted-engine-setup]# uname -a Linux cultivar2.grove.silverorange.com < http://cultivar2.grove.silverorange.com/> 3.10.0-327.13.1.el7.x86_64 #1 SMP Thu Mar 31 16:04:38 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
Versions: [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i ovirt libgovirt-0.3.3-1.el7_2.1.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch ovirt-host-deploy-1.4.1-1.el7.centos.noarch ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch ovirt-release36-007-1.noarch ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i virt libvirt-daemon-driver-secret-1.2.17-13.el7_2.4.x86_64 virt-viewer-2.0-6.el7.x86_64 libgovirt-0.3.3-1.el7_2.1.x86_64 libvirt-daemon-kvm-1.2.17-13.el7_2.4.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch fence-virt-0.3.2-2.el7.x86_64 virt-what-1.13-6.el7.x86_64 libvirt-python-1.2.17-2.el7.x86_64 libvirt-daemon-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-config-nwfilter-1.2.17-13.el7_2.4.x86_64 libvirt-lock-sanlock-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nodedev-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-network-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-storage-1.2.17-13.el7_2.4.x86_64 ovirt-host-deploy-1.4.1-1.el7.centos.noarch virt-v2v-1.28.1-1.55.el7.centos.2.x86_64 ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch libvirt-client-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nwfilter-1.2.17-13.el7_2.4.x86_64 ovirt-release36-007-1.noarch libvirt-daemon-driver-interface-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-qemu-1.2.17-13.el7_2.4.x86_64 ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch
I also have a series of stuck tasks that I can't clear related to the host that can't be added... This is a secondary issue and I don't want to get off track, but they look like this: <PastedGraphic-2.png>
I'd appreciate any help that can be offered.
Cheers, Gervais
Gervais de Montbrun Systems Administrator / silverorange Inc.
Phone +1 902 367 4532 ext. 104 <%2B1%20902%20367%204532%20ext.%20104> <tel:+1 902 367 4532 ext. 104 <%2B1%20902%20367%204532%20ext.%20104>> Mobile +1 902 978 0009 <tel:+1 902 978 0009>
<hosted-engine--deploy-logs.zip>
Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users < http://lists.ovirt.org/mailman/listinfo/users>
-- Wee
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

This is a multi-part message in MIME format. --------------030000020408000408030009 Content-Type: text/plain; charset=utf-8; format=flowed Content-Transfer-Encoding: 7bit Hi Gervais, Okay, I see two problems: there are some leftover direcyories causing issues and for some reason VDSM seems to be trying to bind to a port something is already running on (probably an older version of VDSM.) Try removing the duplicate dirs (rmdir /var/run/vdsm/storage/248f46f0-d793-4581-9810-c9d965e2f286 and /rhev/data-center/mnt - if they aren't empty don't rm -rf them because they might be mounted from your production servers. Just mv -i them to /root or somewhere.) Next shutdown the vdsm service with "service vdsm stop" (I think, might be service stop vdsm, I don't use CentOS much) and kill any running vdsm processes (ps ax |grep vdsm) The error that I saw was: MainThread::ERROR::2016-05-13 08:58:38,262::clientIF::128::vds::(__init__) failed to init clientIF, shutting down storage dispatcher MainThread::ERROR::2016-05-13 08:58:38,289::vdsm::171::vds::(run) Exception raised Traceback (most recent call last): File "/usr/share/vdsm/vdsm", line 169, in run serve_clients(log) File "/usr/share/vdsm/vdsm", line 102, in serve_clients cif = clientIF.getInstance(irs, log, scheduler) File "/usr/share/vdsm/clientIF.py", line 193, in getInstance cls._instance = clientIF(irs, log, scheduler) File "/usr/share/vdsm/clientIF.py", line 123, in __init__ self._createAcceptor(host, port) File "/usr/share/vdsm/clientIF.py", line 201, in _createAcceptor port, sslctx) File "/usr/share/vdsm/protocoldetector.py", line 170, in __init__ sock = _create_socket(host, port) File "/usr/share/vdsm/protocoldetector.py", line 40, in _create_socket server_socket.bind(addr[0][4]) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 98] Address already in use If you get the same error, do a netstat -lnp and compare it to the same from a working box to see if something else is running on the VDSM port. On 2016-05-13 09:37 AM, Gervais de Montbrun wrote:
Hi Charles,
I think the problem I am having is due to the setup failing and not something in vdsm configs as I have never gotten this server to start up properly and the BRIDGE ethernet interface + ovirt routes are not setup.
I put the logs here: https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0
hosted-engine--deploy-logs.zip# Logs from when I tried to deploy and it failed vdsm.tar.gz# /var/log/vdsm
Output from running vdsm from the command line:
[root@cultivar2 log]# su -s /bin/bash vdsm [vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm (PID: 6521) I am the actual vdsm 4.17.26-1.el7 cultivar2.grove.silverorange.com <http://cultivar2.grove.silverorange.com/> (3.10.0-327.el7.x86_64) VDSM will run with cpu affinity: frozenset([1]) /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None) SUCCESS: <err> = ''; <rc> = 0 Starting scheduler vdsm.Scheduler started Run and protect: registerDomainStateChangeCallback(callbackFunc=<functools.partial object at 0x381b158>) Run and protect: registerDomainStateChangeCallback, Return response: None Trying to connect to Super Vdsm Preparing MOM interface Using named unix socket /var/run/vdsm/mom-vdsm.sock Unregistering all secrests trying to connect libvirt recovery: started Setting channels' timeout to 30 seconds. Starting VM channels listener thread. Listening at 0.0.0.0:54321 <http://0.0.0.0:54321> Adding detector <rpc.bindingxmlrpc.XmlDetector instance at 0x3b4ecb0> recovery: completed in 0s Adding detector <yajsonrpc.stompreactor.StompDetector instance at 0x382e5a8> Starting executor Starting worker jsonrpc.Executor/0 Worker started Starting worker jsonrpc.Executor/1 Worker started Starting worker jsonrpc.Executor/2 Worker started Starting worker jsonrpc.Executor/3 Worker started Starting worker jsonrpc.Executor/4 Worker started Starting worker jsonrpc.Executor/5 Worker started Starting worker jsonrpc.Executor/6 Worker started Starting worker jsonrpc.Executor/7 Worker started XMLRPC server running Starting executor Starting worker periodic/0 Worker started Starting worker periodic/1 Worker started Starting worker periodic/2 Worker started Starting worker periodic/3 Worker started trying to connect libvirt Panic: Connect to supervdsm service failed: [Errno 2] No such file or directory Traceback (most recent call last): File "/usr/share/vdsm/supervdsm.py", line 78, in _connect utils.retry(self._manager.connect, Exception, timeout=60, tries=3) File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry return func() File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in connect conn = Client(self._address, authkey=self._authkey) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in Client c = SocketClient(address) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, in SocketClient s.connect(address) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 2] No such file or directory Killed
Thanks for the help. It's really appreciated.
Cheers, Gervais
On Fri, May 13, 2016 at 12:55 AM, Charles Tassell <ctassell@gmail.com <mailto:ctassell@gmail.com>> wrote:
Hi Gervais,
Hmm, can you tar up the logfiles (/var/log/vdsm/* on the host you are installing on) and put them somewhere to look at? Also, I found that starting VDSM from the command line is useful as it sometimes spits out error messages that don't show up in the logs. I think the command I used was: su -s /bin/bash vdsm python /usr/share/vdsm/vdsm
My problem was that I customized the logging settings in /etc/vdsm/*conf to try and tone down the debugging stuff and had a syntax error.
On 16-05-12 10:24 PM, Gervais de Montbrun wrote:
Hi Charles,
Thanks for the suggestion.
I cleaned up again using the bash script from the recoving-from-failed-install link below, then reinstalled (yum install ovirt-hosted-engine-setup).
I enabled NetworkManager and firewalld as you suggested. The install stops very early on with an error: [ ERROR ] Failed to execute stage 'Programs detection': hosted-engine cannot be deployed while NetworkManager is running, please stop and disable it before proceeding
I disabled and stopped NetworkManager and tried again. Same result. :(
Any more guesses?
Cheers, Gervais
On May 12, 2016, at 9:08 PM, Charles Tassell <ctassell@gmail.com <mailto:ctassell@gmail.com>> wrote:
Hey Gervais,
Try enabling NetworkManager and firewalld before doing the hosted-engine --deploy. I have run into problems with oVirt trying to perform tasks on hosts where firewalld is disabled, so maybe you are running into a similar problem. Also, I think the setup script will disable NetworkManager if it needs to. I know I didn't manually disable it on any of the boxes I installed on.
On 16-05-12 04:49 PM, users-request@ovirt.org <mailto:users-request@ovirt.org> wrote:
Message: 1 Date: Thu, 12 May 2016 14:22:12 -0300 From: Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> To: Wee Sritippho <wee.s@forest.go.th <mailto:wee.s@forest.go.th>> Cc: users <users@ovirt.org <mailto:users@ovirt.org>> Subject: Re: [ovirt-users] Adding another host to my cluster Message-ID: <28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com <mailto:28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com>> Content-Type: text/plain; charset="utf-8"
Hi Wee (and others)
Thanks for the reply. I tried what you suggested, but I am in the exact same state. :-(
I don't want to completely remove my hosted engine setup as it is working on the two other hosts in my cluster. I did not run the rm -rf stes listed here (https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... <https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install>) that would wipe my hosted_engine nfs mount. If you know that this is 100% necessary, please let me know.
I did: hosted-engine --clean-metadata --force-cleanup --host-id=3 run the bash script to remove all of the ovirt packages and config files reinstalled ovirt-hosted-engine-setup ran "hosted-engine --deploy"
I'm back exactly where I started. Is there a way to run just the network configuration part of the deploy?
Since the last attempt, I did upgrade my hosted engine and my cluster is now running oVirt 3.6.5.
Cheers, Gervais
On May 12, 2016, at 11:50 AM, Wee Sritippho <wee.s@forest.go.th <mailto:wee.s@forest.go.th>> wrote:
Hi,
I used to have a similar problem where one of my host can't be deployed due to the absence of ovirtmgmt bridge. Simone said it's a bug ( https://bugzilla.redhat.com/1323465 <https://bugzilla.redhat.com/1323465> ) which would be fixed in 3.6.6.
This is what I've done to solve it:
1. In the web UI, set the failed host to maintenance. 2. Remove it. 3. In that host, run a script from https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail... <https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install> 4. Install ovirt-hosted-engine-setup again. 5. Redeploy again.
Hope that helps
On 11 ??????? 2016 22 ?????? 48 ???? 58 ?????? GMT+07:00, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote: Hi Folks,
I hate to reply to my own message, but I'm really hoping someone can help me with my issue http://lists.ovirt.org/pipermail/users/2016-May/039690.html <http://lists.ovirt.org/pipermail/users/2016-May/039690.html>
Does anyone have a suggestion for me? If there is any more information that I can provide that would help you to help me, please advise.
Cheers, Gervais
On May 9, 2016, at 1:42 PM, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com> <mailto:gervais@demontbrun.com <mailto:gervais@demontbrun.com>>> wrote:
Hi All,
I'm trying to add a third host into my oVirt cluster. I have hosted engine setup on the first two. It's failing to finish the hosted-engine --deploy on this third host. I wiped the server and did a CentOS 7 minimum install and ran it again to have a clean machine.
My setup: CentOS 7 clean install yum install -y http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm <http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm> yum install -y ovirt-hosted-engine-setup yum upgrade -y && reboot systemctl disable NetworkManager ; systemctl stop NetworkManager ; systemctl disable firewalld ; systemctl stop firewalld hosted-engine --deploy
hosted-engine --deploy always throws an error: [ ERROR ] The VDSM host was found in a failed state. Please check engine and bootstrap installation logs. [ ERROR ] Unable to add Cultivar2 to the manager and then echo's [ INFO ] Waiting for VDSM hardware info ... [ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within 120 seconds [ INFO ] Stage: Clean up [ INFO ] Generating answer file '/var/lib/ovirt-hosted-engine-setup/answers/answers-20160509131103.conf' [ INFO ] Stage: Pre-termination [ INFO ] Stage: Termination [ ERROR ] Hosted Engine deployment failed: this system is not reliable, please check the issue, fix and redeploy Log file is located at /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20160509130658-qb8ev0.log
Full output of hosted-engine --deploy included in the attached zip file. I've also included vdsm.log (There is more than one tries worth of tries in there). You'll also find the ovirt-hosted-engine-setup-20160509130658-qb8ev0.log listed above.
This is my "test" setup. Cultivar0 is my first host and my nfs server for storage. I have two hosts in the setup already and everything is working fine. The host does show up in the oVirt admin, but shows "Installed Failed" <PastedGraphic-1.png>
Trying to reinstall from within the interface just fails again.
The ovirt bridge interface is not configured and there are no config files in /etc/sysconfi/network-scripts related to ovirt.
OS: [root@cultivar2 ovirt-hosted-engine-setup]# cat /etc/redhat-release CentOS Linux release 7.2.1511 (Core)
[root@cultivar2 ovirt-hosted-engine-setup]# uname -a Linux cultivar2.grove.silverorange.com <http://cultivar2.grove.silverorange.com> <http://cultivar2.grove.silverorange.com/> 3.10.0-327.13.1.el7.x86_64 #1 SMP Thu Mar 31 16:04:38 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux
Versions: [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i ovirt libgovirt-0.3.3-1.el7_2.1.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch ovirt-host-deploy-1.4.1-1.el7.centos.noarch ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch ovirt-release36-007-1.noarch ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i virt libvirt-daemon-driver-secret-1.2.17-13.el7_2.4.x86_64 virt-viewer-2.0-6.el7.x86_64 libgovirt-0.3.3-1.el7_2.1.x86_64 libvirt-daemon-kvm-1.2.17-13.el7_2.4.x86_64 ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch fence-virt-0.3.2-2.el7.x86_64 virt-what-1.13-6.el7.x86_64 libvirt-python-1.2.17-2.el7.x86_64 libvirt-daemon-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-config-nwfilter-1.2.17-13.el7_2.4.x86_64 libvirt-lock-sanlock-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nodedev-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-network-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-storage-1.2.17-13.el7_2.4.x86_64 ovirt-host-deploy-1.4.1-1.el7.centos.noarch virt-v2v-1.28.1-1.55.el7.centos.2.x86_64 ovirt-vmconsole-1.0.0-1.el7.centos.noarch ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch libvirt-client-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-nwfilter-1.2.17-13.el7_2.4.x86_64 ovirt-release36-007-1.noarch libvirt-daemon-driver-interface-1.2.17-13.el7_2.4.x86_64 libvirt-daemon-driver-qemu-1.2.17-13.el7_2.4.x86_64 ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch ovirt-setup-lib-1.0.1-1.el7.centos.noarch ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch
I also have a series of stuck tasks that I can't clear related to the host that can't be added... This is a secondary issue and I don't want to get off track, but they look like this: <PastedGraphic-2.png>
I'd appreciate any help that can be offered.
Cheers, Gervais
Gervais de Montbrun Systems Administrator / silverorange Inc.
Phone +1 902 367 4532 ext. 104 <tel:%2B1%20902%20367%204532%20ext.%20104> <tel:+1 902 367 4532 ext. 104 <tel:%2B1%20902%20367%204532%20ext.%20104>> Mobile +1 902 978 0009 <tel:%2B1%20902%20978%200009> <tel:+1 902 978 0009 <tel:%2B1%20902%20978%200009>>
<hosted-engine--deploy-logs.zip>
Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users <http://lists.ovirt.org/mailman/listinfo/users>
-- Wee
_______________________________________________ Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users
--------------030000020408000408030009 Content-Type: text/html; charset=utf-8 Content-Transfer-Encoding: 8bit <html> <head> <meta content="text/html; charset=utf-8" http-equiv="Content-Type"> </head> <body bgcolor="#FFFFFF" text="#000000"> <div class="moz-cite-prefix">Hi Gervais,<br> <br> Okay, I see two problems: there are some leftover direcyories causing issues and for some reason VDSM seems to be trying to bind to a port something is already running on (probably an older version of VDSM.) Try removing the duplicate dirs (rmdir /var/run/vdsm/storage/248f46f0-d793-4581-9810-c9d965e2f286 and /rhev/data-center/mnt - if they aren't empty don't rm -rf them because they might be mounted from your production servers. Just mv -i them to /root or somewhere.)<br> <br> Next shutdown the vdsm service with "service vdsm stop" (I think, might be service stop vdsm, I don't use CentOS much) and kill any running vdsm processes (ps ax |grep vdsm) The error that I saw was:<br> <br> MainThread::ERROR::2016-05-13 08:58:38,262::clientIF::128::vds::(__init__) failed to init clientIF, shutting down storage dispatcher<br> MainThread::ERROR::2016-05-13 08:58:38,289::vdsm::171::vds::(run) Exception raised<br> Traceback (most recent call last):<br> File "/usr/share/vdsm/vdsm", line 169, in run<br> serve_clients(log)<br> File "/usr/share/vdsm/vdsm", line 102, in serve_clients<br> cif = clientIF.getInstance(irs, log, scheduler)<br> File "/usr/share/vdsm/clientIF.py", line 193, in getInstance<br> cls._instance = clientIF(irs, log, scheduler)<br> File "/usr/share/vdsm/clientIF.py", line 123, in __init__<br> self._createAcceptor(host, port)<br> File "/usr/share/vdsm/clientIF.py", line 201, in _createAcceptor<br> port, sslctx)<br> File "/usr/share/vdsm/protocoldetector.py", line 170, in __init__<br> sock = _create_socket(host, port)<br> File "/usr/share/vdsm/protocoldetector.py", line 40, in _create_socket<br> server_socket.bind(addr[0][4])<br> File "/usr/lib64/python2.7/socket.py", line 224, in meth<br> return getattr(self._sock,name)(*args)<br> error: [Errno 98] Address already in use<br> <br> If you get the same error, do a netstat -lnp and compare it to the same from a working box to see if something else is running on the VDSM port.<br> <br> <br> On 2016-05-13 09:37 AM, Gervais de Montbrun wrote:<br> </div> <blockquote cite="mid:CAESCRhO-o2my2MqO0-GXzW=y0eyK0YdUpGkJN8Mq_wnXxn_QBA@mail.gmail.com" type="cite"> <div dir="ltr"><span style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px">Hi Charles,</span> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px"><br class=""> </div> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px">I think the problem I am having is due to the setup failing and not something in vdsm configs as I have never gotten this server to start up properly and the BRIDGE ethernet interface + ovirt routes are not setup.</div> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px"><br class=""> </div> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px">I put the logs here: <a moz-do-not-send="true" href="https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0" class="">https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0</a></div> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px"><br class=""> </div> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px">hosted-engine--deploy-logs.zip<span class="" style="white-space:pre"> </span># Logs from when I tried to deploy and it failed</div> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px">vdsm.tar.gz<span class="" style="white-space:pre"> </span># /var/log/vdsm</div> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px"><br class=""> </div> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px">Output from running vdsm from the command line:</div> <blockquote class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px;margin:0px 0px 0px 40px;border:none;padding:0px"> <div class="">[root@cultivar2 log]# su -s /bin/bash vdsm</div> <div class="">[vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm</div> <div class="">(PID: 6521) I am the actual vdsm 4.17.26-1.el7 <a moz-do-not-send="true" href="http://cultivar2.grove.silverorange.com/" class="">cultivar2.grove.silverorange.com</a> (3.10.0-327.el7.x86_64)</div> <div class="">VDSM will run with cpu affinity: frozenset([1])</div> <div class="">/usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None)</div> <div class="">SUCCESS: <err> = ''; <rc> = 0</div> <div class="">Starting scheduler vdsm.Scheduler</div> <div class="">started</div> <div class="">Run and protect: registerDomainStateChangeCallback(callbackFunc=<functools.partial object at 0x381b158>)</div> <div class="">Run and protect: registerDomainStateChangeCallback, Return response: None</div> <div class="">Trying to connect to Super Vdsm</div> <div class="">Preparing MOM interface</div> <div class="">Using named unix socket /var/run/vdsm/mom-vdsm.sock</div> <div class="">Unregistering all secrests</div> <div class="">trying to connect libvirt</div> <div class="">recovery: started</div> <div class="">Setting channels' timeout to 30 seconds.</div> <div class="">Starting VM channels listener thread.</div> <div class="">Listening at <a moz-do-not-send="true" href="http://0.0.0.0:54321">0.0.0.0:54321</a></div> <div class="">Adding detector <rpc.bindingxmlrpc.XmlDetector instance at 0x3b4ecb0></div> <div class="">recovery: completed in 0s</div> <div class="">Adding detector <yajsonrpc.stompreactor.StompDetector instance at 0x382e5a8></div> <div class="">Starting executor</div> <div class="">Starting worker jsonrpc.Executor/0</div> <div class="">Worker started</div> <div class="">Starting worker jsonrpc.Executor/1</div> <div class="">Worker started</div> <div class="">Starting worker jsonrpc.Executor/2</div> <div class="">Worker started</div> <div class="">Starting worker jsonrpc.Executor/3</div> <div class="">Worker started</div> <div class="">Starting worker jsonrpc.Executor/4</div> <div class="">Worker started</div> <div class="">Starting worker jsonrpc.Executor/5</div> <div class="">Worker started</div> <div class="">Starting worker jsonrpc.Executor/6</div> <div class="">Worker started</div> <div class="">Starting worker jsonrpc.Executor/7</div> <div class="">Worker started</div> <div class="">XMLRPC server running</div> <div class="">Starting executor</div> <div class="">Starting worker periodic/0</div> <div class="">Worker started</div> <div class="">Starting worker periodic/1</div> <div class="">Worker started</div> <div class="">Starting worker periodic/2</div> <div class="">Worker started</div> <div class="">Starting worker periodic/3</div> <div class="">Worker started</div> <div class="">trying to connect libvirt</div> <div class="">Panic: Connect to supervdsm service failed: [Errno 2] No such file or directory</div> <div class="">Traceback (most recent call last):</div> <div class=""> File "/usr/share/vdsm/supervdsm.py", line 78, in _connect</div> <div class=""> utils.retry(self._manager.connect, Exception, timeout=60, tries=3)</div> <div class=""> File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry</div> <div class=""> return func()</div> <div class=""> File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in connect</div> <div class=""> conn = Client(self._address, authkey=self._authkey)</div> <div class=""> File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in Client</div> <div class=""> c = SocketClient(address)</div> <div class=""> File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, in SocketClient</div> <div class=""> s.connect(address)</div> <div class=""> File "/usr/lib64/python2.7/socket.py", line 224, in meth</div> <div class=""> return getattr(self._sock,name)(*args)</div> <div class="">error: [Errno 2] No such file or directory</div> <div class="">Killed</div> </blockquote> <div class="" style="color:rgb(0,0,0);font-family:Helvetica;font-size:12px"> <div class=""><br class=""> </div> <div class="">Thanks for the help. It's really appreciated.</div> <div class=""> <div id="signature" class=""><br class=""> Cheers,<br class=""> Gervais</div> </div> </div> </div> <div class="gmail_extra"><br> <div class="gmail_quote">On Fri, May 13, 2016 at 12:55 AM, Charles Tassell <span dir="ltr"><<a moz-do-not-send="true" href="mailto:ctassell@gmail.com" target="_blank">ctassell@gmail.com</a>></span> wrote:<br> <blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"> <div text="#000000" bgcolor="#FFFFFF"> <div>Hi Gervais,<br> <br> Hmm, can you tar up the logfiles (/var/log/vdsm/* on the host you are installing on) and put them somewhere to look at? Also, I found that starting VDSM from the command line is useful as it sometimes spits out error messages that don't show up in the logs. I think the command I used was:<br> su -s /bin/bash vdsm<br> python /usr/share/vdsm/vdsm<br> <br> My problem was that I customized the logging settings in /etc/vdsm/*conf to try and tone down the debugging stuff and had a syntax error. <div> <div class="h5"><br> <br> On 16-05-12 10:24 PM, Gervais de Montbrun wrote:<br> </div> </div> </div> <div> <div class="h5"> <blockquote type="cite"> <div dir="ltr"> <div dir="auto" style="word-wrap:break-word">Hi Charles,<br> <br> Thanks for the suggestion.<br> <br> I cleaned up again using the bash script from the recoving-from-failed-install link below, then reinstalled (yum install ovirt-hosted-engine-setup).<br> <br> I enabled NetworkManager and firewalld as you suggested. The install stops very early on with an error:<br> <span style="white-space:pre-wrap"> </span>[ ERROR ] Failed to execute stage 'Programs detection': hosted-engine cannot be deployed while NetworkManager is running, please stop and disable it before proceeding <br> <br> I disabled and stopped NetworkManager and tried again. Same result. :(<br> <br> Any more guesses?<br> <br> Cheers,<br> Gervais<br> <br> <br> <br> <blockquote type="cite">On May 12, 2016, at 9:08 PM, Charles Tassell <<a moz-do-not-send="true" href="mailto:ctassell@gmail.com" target="_blank"><a class="moz-txt-link-abbreviated" href="mailto:ctassell@gmail.com">ctassell@gmail.com</a></a>> wrote:<br> <br> Hey Gervais,<br> <br> Try enabling NetworkManager and firewalld before doing the hosted-engine --deploy. I have run into problems with oVirt trying to perform tasks on hosts where firewalld is disabled, so maybe you are running into a similar problem. Also, I think the setup script will disable NetworkManager if it needs to. I know I didn't manually disable it on any of the boxes I installed on.<br> <br> On 16-05-12 04:49 PM, <a moz-do-not-send="true" href="mailto:users-request@ovirt.org" target="_blank"><a class="moz-txt-link-abbreviated" href="mailto:users-request@ovirt.org">users-request@ovirt.org</a></a> wrote:<br> <blockquote type="cite">Message: 1<br> Date: Thu, 12 May 2016 14:22:12 -0300<br> From: Gervais de Montbrun <<a moz-do-not-send="true" href="mailto:gervais@demontbrun.com" target="_blank"><a class="moz-txt-link-abbreviated" href="mailto:gervais@demontbrun.com">gervais@demontbrun.com</a></a>><br> To: Wee Sritippho <<a moz-do-not-send="true" href="mailto:wee.s@forest.go.th" target="_blank"><a class="moz-txt-link-abbreviated" href="mailto:wee.s@forest.go.th">wee.s@forest.go.th</a></a>><br> Cc: users <<a moz-do-not-send="true" href="mailto:users@ovirt.org" target="_blank">users@ovirt.org</a>><br> Subject: Re: [ovirt-users] Adding another host to my cluster<br> Message-ID: <<a moz-do-not-send="true" href="mailto:28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com" target="_blank">28B7FC74-5C52-4F60-B9F3-39A36621A7CA@demontbrun.com</a>><br> Content-Type: text/plain; charset="utf-8"<br> <br> Hi Wee<br> (and others)<br> <br> Thanks for the reply. I tried what you suggested, but I am in the exact same state. :-(<br> <br> I don't want to completely remove my hosted engine setup as it is working on the two other hosts in my cluster. I did not run the rm -rf stes listed here (<a moz-do-not-send="true" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail..." target="_blank"><a class="moz-txt-link-freetext" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install">https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install</a></a> <<a moz-do-not-send="true" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail..." target="_blank">https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install</a>>) that would wipe my hosted_engine nfs mount. If you know that this is 100% necessary, please let me know.<br> <br> I did:<br> hosted-engine --clean-metadata --force-cleanup --host-id=3<br> run the bash script to remove all of the ovirt packages and config files<br> reinstalled ovirt-hosted-engine-setup<br> ran "hosted-engine --deploy"<br> <br> I'm back exactly where I started. Is there a way to run just the network configuration part of the deploy?<br> <br> Since the last attempt, I did upgrade my hosted engine and my cluster is now running oVirt 3.6.5.<br> <br> Cheers,<br> Gervais<br> <br> <br> <br> <blockquote type="cite">On May 12, 2016, at 11:50 AM, Wee Sritippho <<a moz-do-not-send="true" href="mailto:wee.s@forest.go.th" target="_blank"><a class="moz-txt-link-abbreviated" href="mailto:wee.s@forest.go.th">wee.s@forest.go.th</a></a>> wrote:<br> <br> Hi,<br> <br> I used to have a similar problem where one of my host can't be deployed due to the absence of ovirtmgmt bridge. Simone said it's a bug ( <a moz-do-not-send="true" href="https://bugzilla.redhat.com/1323465" target="_blank">https://bugzilla.redhat.com/1323465</a> <<a moz-do-not-send="true" href="https://bugzilla.redhat.com/1323465" target="_blank">https://bugzilla.redhat.com/1323465</a>> ) which would be fixed in 3.6.6.<br> <br> This is what I've done to solve it:<br> <br> 1. In the web UI, set the failed host to maintenance.<br> 2. Remove it.<br> 3. In that host, run a script from <a moz-do-not-send="true" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail..." target="_blank"><a class="moz-txt-link-freetext" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install">https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install</a></a> <<a moz-do-not-send="true" href="https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-fail..." target="_blank">https://www.ovirt.org/documentation/how-to/hosted-engine/#recoving-from-failed-install</a>><br> 4. Install ovirt-hosted-engine-setup again.<br> 5. Redeploy again.<br> <br> Hope that helps<br> <br> On 11 ??????? 2016 22 ?????? 48 ???? 58 ?????? GMT+07:00, Gervais de Montbrun <<a moz-do-not-send="true" href="mailto:gervais@demontbrun.com" target="_blank"><a class="moz-txt-link-abbreviated" href="mailto:gervais@demontbrun.com">gervais@demontbrun.com</a></a>> wrote:<br> Hi Folks,<br> <br> I hate to reply to my own message, but I'm really hoping someone can help me with my issue<br> <a moz-do-not-send="true" href="http://lists.ovirt.org/pipermail/users/2016-May/039690.html" target="_blank">http://lists.ovirt.org/pipermail/users/2016-May/039690.html</a> <<a moz-do-not-send="true" href="http://lists.ovirt.org/pipermail/users/2016-May/039690.html" target="_blank">http://lists.ovirt.org/pipermail/users/2016-May/039690.html</a>><br> <br> Does anyone have a suggestion for me? If there is any more information that I can provide that would help you to help me, please advise.<br> <br> Cheers,<br> Gervais<br> <br> <br> <br> <blockquote type="cite">On May 9, 2016, at 1:42 PM, Gervais de Montbrun <<a moz-do-not-send="true" href="mailto:gervais@demontbrun.com" target="_blank"><a class="moz-txt-link-abbreviated" href="mailto:gervais@demontbrun.com">gervais@demontbrun.com</a></a> <mailto:<a moz-do-not-send="true" href="mailto:gervais@demontbrun.com" target="_blank">gervais@demontbrun.com</a>>> wrote:<br> <br> Hi All,<br> <br> I'm trying to add a third host into my oVirt cluster. I have hosted engine setup on the first two. It's failing to finish the hosted-engine --deploy on this third host. I wiped the server and did a CentOS 7 minimum install and ran it again to have a clean machine.<br> <br> My setup:<br> CentOS 7 clean install<br> yum install -y <a moz-do-not-send="true" href="http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm" target="_blank"><a class="moz-txt-link-freetext" href="http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm">http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm</a></a> <<a moz-do-not-send="true" href="http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm" target="_blank">http://resources.ovirt.org/pub/yum-repo/ovirt-release36.rpm</a>><br> yum install -y ovirt-hosted-engine-setup<br> yum upgrade -y && reboot<br> systemctl disable NetworkManager ; systemctl stop NetworkManager ; systemctl disable firewalld ; systemctl stop firewalld<br> hosted-engine --deploy<br> <br> hosted-engine --deploy always throws an error:<br> [ ERROR ] The VDSM host was found in a failed state. Please check engine and bootstrap installation logs.<br> [ ERROR ] Unable to add Cultivar2 to the manager<br> and then echo's<br> [ INFO ] Waiting for VDSM hardware info<br> ...<br> [ ERROR ] Failed to execute stage 'Closing up': VDSM did not start within 120 seconds<br> [ INFO ] Stage: Clean up<br> [ INFO ] Generating answer file '/var/lib/ovirt-hosted-engine-setup/answers/answers-20160509131103.conf'<br> [ INFO ] Stage: Pre-termination<br> [ INFO ] Stage: Termination<br> [ ERROR ] Hosted Engine deployment failed: this system is not reliable, please check the issue, fix and redeploy<br> Log file is located at /var/log/ovirt-hosted-engine-setup/ovirt-hosted-engine-setup-20160509130658-qb8ev0.log<br> <br> Full output of hosted-engine --deploy included in the attached zip file.<br> I've also included vdsm.log (There is more than one tries worth of tries in there).<br> You'll also find the ovirt-hosted-engine-setup-20160509130658-qb8ev0.log listed above.<br> <br> This is my "test" setup. Cultivar0 is my first host and my nfs server for storage. I have two hosts in the setup already and everything is working fine. The host does show up in the oVirt admin, but shows "Installed Failed"<br> <PastedGraphic-1.png><br> <br> Trying to reinstall from within the interface just fails again.<br> <br> The ovirt bridge interface is not configured and there are no config files in /etc/sysconfi/network-scripts related to ovirt.<br> <br> OS:<br> [root@cultivar2 ovirt-hosted-engine-setup]# cat /etc/redhat-release<br> CentOS Linux release 7.2.1511 (Core)<br> <br> [root@cultivar2 ovirt-hosted-engine-setup]# uname -a<br> Linux <a moz-do-not-send="true" href="http://cultivar2.grove.silverorange.com" target="_blank">cultivar2.grove.silverorange.com</a> <<a moz-do-not-send="true" href="http://cultivar2.grove.silverorange.com/" target="_blank">http://cultivar2.grove.silverorange.com/</a>> 3.10.0-327.13.1.el7.x86_64 #1 SMP Thu Mar 31 16:04:38 UTC 2016 x86_64 x86_64 x86_64 GNU/Linux<br> <br> Versions:<br> [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i ovirt<br> libgovirt-0.3.3-1.el7_2.1.x86_64<br> ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch<br> ovirt-host-deploy-1.4.1-1.el7.centos.noarch<br> ovirt-vmconsole-1.0.0-1.el7.centos.noarch<br> ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch<br> ovirt-release36-007-1.noarch<br> ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch<br> ovirt-setup-lib-1.0.1-1.el7.centos.noarch<br> ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch<br> [root@cultivar2 ovirt-hosted-engine-setup]#<br> [root@cultivar2 ovirt-hosted-engine-setup]#<br> [root@cultivar2 ovirt-hosted-engine-setup]#<br> [root@cultivar2 ovirt-hosted-engine-setup]# rpm -qa | grep -i virt<br> libvirt-daemon-driver-secret-1.2.17-13.el7_2.4.x86_64<br> virt-viewer-2.0-6.el7.x86_64<br> libgovirt-0.3.3-1.el7_2.1.x86_64<br> libvirt-daemon-kvm-1.2.17-13.el7_2.4.x86_64<br> ovirt-hosted-engine-setup-1.3.5.0-1.1.el7.noarch<br> fence-virt-0.3.2-2.el7.x86_64<br> virt-what-1.13-6.el7.x86_64<br> libvirt-python-1.2.17-2.el7.x86_64<br> libvirt-daemon-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-config-nwfilter-1.2.17-13.el7_2.4.x86_64<br> libvirt-lock-sanlock-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-nodedev-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-network-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-storage-1.2.17-13.el7_2.4.x86_64<br> ovirt-host-deploy-1.4.1-1.el7.centos.noarch<br> virt-v2v-1.28.1-1.55.el7.centos.2.x86_64<br> ovirt-vmconsole-1.0.0-1.el7.centos.noarch<br> ovirt-vmconsole-host-1.0.0-1.el7.centos.noarch<br> libvirt-client-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-nwfilter-1.2.17-13.el7_2.4.x86_64<br> ovirt-release36-007-1.noarch<br> libvirt-daemon-driver-interface-1.2.17-13.el7_2.4.x86_64<br> libvirt-daemon-driver-qemu-1.2.17-13.el7_2.4.x86_64<br> ovirt-engine-sdk-python-3.6.5.0-1.el7.centos.noarch<br> ovirt-setup-lib-1.0.1-1.el7.centos.noarch<br> ovirt-hosted-engine-ha-1.3.5.3-1.1.el7.noarch<br> <br> I also have a series of stuck tasks that I can't clear related to the host that can't be added... This is a secondary issue and I don't want to get off track, but they look like this:<br> <PastedGraphic-2.png><br> <br> I'd appreciate any help that can be offered.<br> <br> Cheers,<br> Gervais<br> <br> <br> Gervais de Montbrun<br> Systems Administrator / silverorange Inc.<br> <br> Phone <span style="white-space:pre-wrap"> </span><a moz-do-not-send="true" href="tel:%2B1%20902%20367%204532%20ext.%20104" value="+19023674532" target="_blank">+1 902 367 4532 ext. 104</a> <tel:<a moz-do-not-send="true" href="tel:%2B1%20902%20367%204532%20ext.%20104" value="+19023674532" target="_blank">+1 902 367 4532 ext. 104</a>><br> Mobile <span style="white-space:pre-wrap"> </span><a moz-do-not-send="true" href="tel:%2B1%20902%20978%200009" value="+19029780009" target="_blank">+1 902 978 0009</a> <tel:<a moz-do-not-send="true" href="tel:%2B1%20902%20978%200009" value="+19029780009" target="_blank">+1 902 978 0009</a>><br> <br> <hosted-engine--deploy-logs.zip><br> </blockquote> <br> <br> Users mailing list<br> <a moz-do-not-send="true" href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br> <a moz-do-not-send="true" href="http://lists.ovirt.org/mailman/listinfo/users" target="_blank">http://lists.ovirt.org/mailman/listinfo/users</a> <<a moz-do-not-send="true" href="http://lists.ovirt.org/mailman/listinfo/users" target="_blank">http://lists.ovirt.org/mailman/listinfo/users</a>><br> <br> -- <br> Wee<br> </blockquote> <br> </blockquote> <br> _______________________________________________<br> Users mailing list<br> <a moz-do-not-send="true" href="mailto:Users@ovirt.org" target="_blank">Users@ovirt.org</a><br> <a moz-do-not-send="true" href="http://lists.ovirt.org/mailman/listinfo/users" target="_blank">http://lists.ovirt.org/mailman/listinfo/users</a><br> </blockquote> <br> </div> </div> </blockquote> <br> </div> </div> </div> </blockquote> </div> <br> </div> </blockquote> <br> </body> </html> --------------030000020408000408030009--

On Fri, May 13, 2016 at 3:53 PM, Charles Tassell <ctassell@gmail.com> wrote:
Hi Gervais,
Okay, I see two problems: there are some leftover direcyories causing issues and for some reason VDSM seems to be trying to bind to a port something is already running on (probably an older version of VDSM.) Try removing the duplicate dirs (rmdir /var/run/vdsm/storage/248f46f0-d793-4581-9810-c9d965e2f286 and /rhev/data-center/mnt - if they aren't empty don't rm -rf them because they might be mounted from your production servers. Just mv -i them to /root or somewhere.)
You should not touch directories under /run/vdsm/storage or /rhev/data-center, they should not have any negative effect on the system.
Next shutdown the vdsm service with "service vdsm stop" (I think, might be service stop vdsm, I don't use CentOS much) and kill any running vdsm processes (ps ax |grep vdsm) The error that I saw was:
MainThread::ERROR::2016-05-13 08:58:38,262::clientIF::128::vds::(__init__) failed to init clientIF, shutting down storage dispatcher MainThread::ERROR::2016-05-13 08:58:38,289::vdsm::171::vds::(run) Exception raised Traceback (most recent call last): File "/usr/share/vdsm/vdsm", line 169, in run serve_clients(log) File "/usr/share/vdsm/vdsm", line 102, in serve_clients cif = clientIF.getInstance(irs, log, scheduler) File "/usr/share/vdsm/clientIF.py", line 193, in getInstance cls._instance = clientIF(irs, log, scheduler) File "/usr/share/vdsm/clientIF.py", line 123, in __init__ self._createAcceptor(host, port) File "/usr/share/vdsm/clientIF.py", line 201, in _createAcceptor port, sslctx) File "/usr/share/vdsm/protocoldetector.py", line 170, in __init__ sock = _create_socket(host, port) File "/usr/share/vdsm/protocoldetector.py", line 40, in _create_socket server_socket.bind(addr[0][4]) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 98] Address already in use
Please open a bug - this should never happen. https://bugzilla.redhat.com/enter_bug.cgi?product=vdsm oVirt team: Infra Severity: High Vdsm must set the socket option socket.SO_REUSEADDR before binding. Ensuring that only one vdsm instance is running must be done elsewhere if needed, for example using a lock file. Nir

On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun <gervais@demontbrun.com> wrote:
Hi Charles,
I think the problem I am having is due to the setup failing and not something in vdsm configs as I have never gotten this server to start up properly and the BRIDGE ethernet interface + ovirt routes are not setup.
I put the logs here: https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0
hosted-engine--deploy-logs.zip # Logs from when I tried to deploy and it failed vdsm.tar.gz # /var/log/vdsm
Output from running vdsm from the command line:
[root@cultivar2 log]# su -s /bin/bash vdsm
This cannot work unless supervdsmd is running...
[vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm (PID: 6521) I am the actual vdsm 4.17.26-1.el7 cultivar2.grove.silverorange.com (3.10.0-327.el7.x86_64) VDSM will run with cpu affinity: frozenset([1]) /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None) SUCCESS: <err> = ''; <rc> = 0 Starting scheduler vdsm.Scheduler started Run and protect: registerDomainStateChangeCallback(callbackFunc=<functools.partial object at 0x381b158>) Run and protect: registerDomainStateChangeCallback, Return response: None Trying to connect to Super Vdsm Preparing MOM interface Using named unix socket /var/run/vdsm/mom-vdsm.sock Unregistering all secrests trying to connect libvirt recovery: started Setting channels' timeout to 30 seconds. Starting VM channels listener thread. Listening at 0.0.0.0:54321 Adding detector <rpc.bindingxmlrpc.XmlDetector instance at 0x3b4ecb0> recovery: completed in 0s Adding detector <yajsonrpc.stompreactor.StompDetector instance at 0x382e5a8> Starting executor Starting worker jsonrpc.Executor/0 Worker started Starting worker jsonrpc.Executor/1 Worker started Starting worker jsonrpc.Executor/2 Worker started Starting worker jsonrpc.Executor/3 Worker started Starting worker jsonrpc.Executor/4 Worker started Starting worker jsonrpc.Executor/5 Worker started Starting worker jsonrpc.Executor/6 Worker started Starting worker jsonrpc.Executor/7 Worker started XMLRPC server running Starting executor Starting worker periodic/0 Worker started Starting worker periodic/1 Worker started Starting worker periodic/2 Worker started Starting worker periodic/3 Worker started trying to connect libvirt Panic: Connect to supervdsm service failed: [Errno 2] No such file or directory Traceback (most recent call last): File "/usr/share/vdsm/supervdsm.py", line 78, in _connect utils.retry(self._manager.connect, Exception, timeout=60, tries=3) File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry return func() File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in connect conn = Client(self._address, authkey=self._authkey) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in Client c = SocketClient(address) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, in SocketClient s.connect(address) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 2] No such file or directory
Vdsm tries to connect to supervdsmd on startup, and if it is not running it will fail. You can do: systemctl start supervdsmd And they you can run vdsmd from the shell. But why do you need to run vdsm from the shell? Nir

Hi Nir, Thank you for you input. It was suggest that I try starting vdsm from the shell so I could see the output in an effort to solve my issue: http://lists.ovirt.org/pipermail/users/2016-May/039690.html Cheers, Gervais On May 13, 2016, at 12:12 PM, Nir Soffer <nsoffer@redhat.com> wrote: On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun <gervais@demontbrun.com> wrote: Hi Charles, I think the problem I am having is due to the setup failing and not something in vdsm configs as I have never gotten this server to start up properly and the BRIDGE ethernet interface + ovirt routes are not setup. I put the logs here: https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0 hosted-engine--deploy-logs.zip # Logs from when I tried to deploy and it failed vdsm.tar.gz # /var/log/vdsm Output from running vdsm from the command line: [root@cultivar2 log]# su -s /bin/bash vdsm This cannot work unless supervdsmd is running... [vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm (PID: 6521) I am the actual vdsm 4.17.26-1.el7 cultivar2.grove.silverorange.com (3.10.0-327.el7.x86_64) VDSM will run with cpu affinity: frozenset([1]) /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None) SUCCESS: <err> = ''; <rc> = 0 Starting scheduler vdsm.Scheduler started Run and protect: registerDomainStateChangeCallback(callbackFunc=<functools.partial object at 0x381b158>) Run and protect: registerDomainStateChangeCallback, Return response: None Trying to connect to Super Vdsm Preparing MOM interface Using named unix socket /var/run/vdsm/mom-vdsm.sock Unregistering all secrests trying to connect libvirt recovery: started Setting channels' timeout to 30 seconds. Starting VM channels listener thread. Listening at 0.0.0.0:54321 Adding detector <rpc.bindingxmlrpc.XmlDetector instance at 0x3b4ecb0> recovery: completed in 0s Adding detector <yajsonrpc.stompreactor.StompDetector instance at 0x382e5a8> Starting executor Starting worker jsonrpc.Executor/0 Worker started Starting worker jsonrpc.Executor/1 Worker started Starting worker jsonrpc.Executor/2 Worker started Starting worker jsonrpc.Executor/3 Worker started Starting worker jsonrpc.Executor/4 Worker started Starting worker jsonrpc.Executor/5 Worker started Starting worker jsonrpc.Executor/6 Worker started Starting worker jsonrpc.Executor/7 Worker started XMLRPC server running Starting executor Starting worker periodic/0 Worker started Starting worker periodic/1 Worker started Starting worker periodic/2 Worker started Starting worker periodic/3 Worker started trying to connect libvirt Panic: Connect to supervdsm service failed: [Errno 2] No such file or directory Traceback (most recent call last): File "/usr/share/vdsm/supervdsm.py", line 78, in _connect utils.retry(self._manager.connect, Exception, timeout=60, tries=3) File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry return func() File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in connect conn = Client(self._address, authkey=self._authkey) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in Client c = SocketClient(address) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, in SocketClient s.connect(address) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 2] No such file or directory Vdsm tries to connect to supervdsmd on startup, and if it is not running it will fail. You can do: systemctl start supervdsmd And they you can run vdsmd from the shell. But why do you need to run vdsm from the shell? Nir

On May 13, 2016, at 1:36 PM, Gervais de Montbrun = <gervais@demontbrun.com> wrote: =20 Hi Nir, =20 Thank you for you input. =20 It was suggest that I try starting vdsm from the shell so I could see =
=20 Cheers, Gervais =20 =20 =20
On May 13, 2016, at 12:12 PM, Nir Soffer <nsoffer@redhat.com = <mailto:nsoffer@redhat.com>> wrote: =20 On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote:
Hi Charles, =20 I think the problem I am having is due to the setup failing and not something in vdsm configs as I have never gotten this server to = start up properly and the BRIDGE ethernet interface + ovirt routes are not = setup. =20 I put the logs here: = https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=3D= 0 = <https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=3D= 0> =20 hosted-engine--deploy-logs.zip # Logs from when I tried to deploy = and it failed vdsm.tar.gz # /var/log/vdsm =20 Output from running vdsm from the command line: =20 [root@cultivar2 log]# su -s /bin/bash vdsm =20 This cannot work unless supervdsmd is running... =20 [vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm (PID: 6521) I am the actual vdsm 4.17.26-1.el7 cultivar2.grove.silverorange.com = <http://cultivar2.grove.silverorange.com/> (3.10.0-327.el7.x86_64) VDSM will run with cpu affinity: frozenset([1]) /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None) SUCCESS: <err> =3D ''; <rc> =3D 0 Starting scheduler vdsm.Scheduler started Run and protect: registerDomainStateChangeCallback(callbackFunc=3D<functools.partial = object at 0x381b158>) Run and protect: registerDomainStateChangeCallback, Return response: = None Trying to connect to Super Vdsm Preparing MOM interface Using named unix socket /var/run/vdsm/mom-vdsm.sock Unregistering all secrests trying to connect libvirt recovery: started Setting channels' timeout to 30 seconds. Starting VM channels listener thread. Listening at 0.0.0.0:54321 <http://0.0.0.0:54321/> Adding detector <rpc.bindingxmlrpc.XmlDetector instance at = 0x3b4ecb0> recovery: completed in 0s Adding detector <yajsonrpc.stompreactor.StompDetector instance at = 0x382e5a8> Starting executor Starting worker jsonrpc.Executor/0 Worker started Starting worker jsonrpc.Executor/1 Worker started Starting worker jsonrpc.Executor/2 Worker started Starting worker jsonrpc.Executor/3 Worker started Starting worker jsonrpc.Executor/4 Worker started Starting worker jsonrpc.Executor/5 Worker started Starting worker jsonrpc.Executor/6 Worker started Starting worker jsonrpc.Executor/7 Worker started XMLRPC server running Starting executor Starting worker periodic/0 Worker started Starting worker periodic/1 Worker started Starting worker periodic/2 Worker started Starting worker periodic/3 Worker started trying to connect libvirt Panic: Connect to supervdsm service failed: [Errno 2] No such file = or directory Traceback (most recent call last): File "/usr/share/vdsm/supervdsm.py", line 78, in _connect utils.retry(self._manager.connect, Exception, timeout=3D60, =
--Apple-Mail=_B4BC9680-8BD9-4CBD-BC07-D9AE3227F522 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii Hey Folks, I'm still stuck here... I believe the issue may be because the Enrolling = certificates stage is failing... Any idea's how to sort this out? I see: Installing Host Cultivar2. Enrolling certificate Failed to install Host Cultivar2. Certificate enrollment failed. Host Cultivar2 installation failed. Certificate enrollement = failed. Any attempt to re-install fails at this step. Cheers, Gervais the output in an effort to solve my issue: = http://lists.ovirt.org/pipermail/users/2016-May/039690.html = <http://lists.ovirt.org/pipermail/users/2016-May/039690.html> tries=3D3)
File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in = retry return func() File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, = in connect conn =3D Client(self._address, authkey=3Dself._authkey) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, = in Client c =3D SocketClient(address) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, = in SocketClient s.connect(address) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 2] No such file or directory =20 Vdsm tries to connect to supervdsmd on startup, and if it is not = running it will fail. =20 You can do: =20 systemctl start supervdsmd =20 And they you can run vdsmd from the shell. =20 But why do you need to run vdsm from the shell? =20 Nir =20
--Apple-Mail=_B4BC9680-8BD9-4CBD-BC07-D9AE3227F522 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=us-ascii <html><head><meta http-equiv=3D"Content-Type" content=3D"text/html = charset=3Dus-ascii"></head><body style=3D"word-wrap: break-word; = -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" = class=3D"">Hey Folks,<div class=3D""><br class=3D""></div><div = class=3D"">I'm still stuck here... I believe the issue may be because = the Enrolling certificates stage is failing...</div><div class=3D""><br = class=3D""></div><div class=3D"">Any idea's how to sort this = out?</div><div class=3D""><br class=3D""></div><div class=3D"">I = see:</div><div class=3D""><span class=3D"Apple-tab-span" = style=3D"white-space:pre"> </span>Installing Host Cultivar2. = Enrolling certificate</div><div class=3D""><span class=3D"Apple-tab-span" = style=3D"white-space:pre"> </span>Failed to install Host Cultivar2. = Certificate enrollment failed.</div><div class=3D""><span = class=3D"Apple-tab-span" style=3D"white-space:pre"> </span>Host = Cultivar2 installation failed. Certificate enrollement failed.</div><div = class=3D""><br class=3D""></div><div class=3D"">Any attempt to = re-install fails at this step.</div><div class=3D""><div class=3D""> <div id=3D"signature" class=3D""><br class=3D"">Cheers,<br = class=3D"">Gervais<br class=3D""><br class=3D""><br class=3D""></div> </div> <br class=3D""><div><blockquote type=3D"cite" class=3D""><div = class=3D"">On May 13, 2016, at 1:36 PM, Gervais de Montbrun <<a = href=3D"mailto:gervais@demontbrun.com" = class=3D"">gervais@demontbrun.com</a>> wrote:</div><br = class=3D"Apple-interchange-newline"><div class=3D""><div dir=3D"ltr" = class=3D""><div dir=3D"auto" style=3D"word-wrap:break-word" class=3D"">Hi = Nir,<br class=3D""><br class=3D"">Thank you for you input.<br = class=3D""><br class=3D"">It was suggest that I try starting vdsm from = the shell so I could see the output in an effort to solve my issue: <a = href=3D"http://lists.ovirt.org/pipermail/users/2016-May/039690.html" = target=3D"_blank" = class=3D"">http://lists.ovirt.org/pipermail/users/2016-May/039690.html</a>= <br class=3D""><br class=3D"">Cheers,<br class=3D"">Gervais<br = class=3D""><br class=3D""><br class=3D""><br class=3D""><blockquote = type=3D"cite" class=3D"">On May 13, 2016, at 12:12 PM, Nir Soffer <<a = href=3D"mailto:nsoffer@redhat.com" target=3D"_blank" = class=3D"">nsoffer@redhat.com</a>> wrote:<br class=3D""><br = class=3D"">On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun<br = class=3D""><<a href=3D"mailto:gervais@demontbrun.com" target=3D"_blank"= class=3D"">gervais@demontbrun.com</a>> wrote:<br = class=3D""><blockquote type=3D"cite" class=3D"">Hi Charles,<br = class=3D""><br class=3D"">I think the problem I am having is due to the = setup failing and not<br class=3D"">something in vdsm configs as I have = never gotten this server to start up<br class=3D"">properly and the = BRIDGE ethernet interface + ovirt routes are not setup.<br class=3D""><br = class=3D"">I put the logs here:<br class=3D""><a = href=3D"https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5tw= Ha?dl=3D0" target=3D"_blank" = class=3D"">https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW= 5twHa?dl=3D0</a><br class=3D""><br = class=3D"">hosted-engine--deploy-logs.zip # Logs from when I tried to = deploy and it<br class=3D"">failed<br class=3D"">vdsm.tar.gz # = /var/log/vdsm<br class=3D""><br class=3D"">Output from running vdsm from = the command line:<br class=3D""><br class=3D"">[root@cultivar2 log]# su = -s /bin/bash vdsm<br class=3D""></blockquote><br class=3D"">This cannot = work unless supervdsmd is running...<br class=3D""><br = class=3D""><blockquote type=3D"cite" class=3D"">[vdsm@cultivar2 log]$ = python /usr/share/vdsm/vdsm<br class=3D"">(PID: 6521) I am the actual = vdsm 4.17.26-1.el7<br class=3D""><a = href=3D"http://cultivar2.grove.silverorange.com/" target=3D"_blank" = class=3D"">cultivar2.grove.silverorange.com</a> = (3.10.0-327.el7.x86_64)<br class=3D"">VDSM will run with cpu affinity: = frozenset([1])<br class=3D"">/usr/bin/taskset --all-tasks --pid = --cpu-list 1 6521 (cwd None)<br class=3D"">SUCCESS: <err> =3D ''; = <rc> =3D 0<br class=3D"">Starting scheduler vdsm.Scheduler<br = class=3D"">started<br class=3D"">Run and protect:<br = class=3D"">registerDomainStateChangeCallback(callbackFunc=3D<functools.= partial object at<br class=3D"">0x381b158>)<br class=3D"">Run and = protect: registerDomainStateChangeCallback, Return response: None<br = class=3D"">Trying to connect to Super Vdsm<br class=3D"">Preparing MOM = interface<br class=3D"">Using named unix socket = /var/run/vdsm/mom-vdsm.sock<br class=3D"">Unregistering all secrests<br = class=3D"">trying to connect libvirt<br class=3D"">recovery: started<br = class=3D"">Setting channels' timeout to 30 seconds.<br class=3D"">Starting= VM channels listener thread.<br class=3D"">Listening at <a = href=3D"http://0.0.0.0:54321/" target=3D"_blank" = class=3D"">0.0.0.0:54321</a><br class=3D"">Adding detector = <rpc.bindingxmlrpc.XmlDetector instance at 0x3b4ecb0><br = class=3D"">recovery: completed in 0s<br class=3D"">Adding detector = <yajsonrpc.stompreactor.StompDetector instance at 0x382e5a8><br = class=3D"">Starting executor<br class=3D"">Starting worker = jsonrpc.Executor/0<br class=3D"">Worker started<br class=3D"">Starting = worker jsonrpc.Executor/1<br class=3D"">Worker started<br = class=3D"">Starting worker jsonrpc.Executor/2<br class=3D"">Worker = started<br class=3D"">Starting worker jsonrpc.Executor/3<br = class=3D"">Worker started<br class=3D"">Starting worker = jsonrpc.Executor/4<br class=3D"">Worker started<br class=3D"">Starting = worker jsonrpc.Executor/5<br class=3D"">Worker started<br = class=3D"">Starting worker jsonrpc.Executor/6<br class=3D"">Worker = started<br class=3D"">Starting worker jsonrpc.Executor/7<br = class=3D"">Worker started<br class=3D"">XMLRPC server running<br = class=3D"">Starting executor<br class=3D"">Starting worker periodic/0<br = class=3D"">Worker started<br class=3D"">Starting worker periodic/1<br = class=3D"">Worker started<br class=3D"">Starting worker periodic/2<br = class=3D"">Worker started<br class=3D"">Starting worker periodic/3<br = class=3D"">Worker started<br class=3D"">trying to connect libvirt<br = class=3D"">Panic: Connect to supervdsm service failed: [Errno 2] No such = file or<br class=3D"">directory<br class=3D"">Traceback (most recent = call last):<br class=3D"">File "/usr/share/vdsm/supervdsm.py", line 78, = in _connect<br class=3D""> utils.retry(self._manager.connect, = Exception, timeout=3D60, tries=3D3)<br class=3D"">File = "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry<br = class=3D""> return func()<br class=3D"">File = "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in<br = class=3D"">connect<br class=3D""> conn =3D Client(self._address, = authkey=3Dself._authkey)<br class=3D"">File = "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in<br = class=3D"">Client<br class=3D""> c =3D SocketClient(address)<br = class=3D"">File "/usr/lib64/python2.7/multiprocessing/connection.py", = line 308, in<br class=3D"">SocketClient<br class=3D""> = s.connect(address)<br class=3D"">File = "/usr/lib64/python2.7/socket.py", line 224, in meth<br class=3D""> = return getattr(self._sock,name)(*args)<br class=3D"">error: [Errno = 2] No such file or directory<br class=3D""></blockquote><br = class=3D"">Vdsm tries to connect to supervdsmd on startup, and if it is = not running<br class=3D"">it will fail.<br class=3D""><br class=3D"">You = can do:<br class=3D""><br class=3D"">systemctl start supervdsmd<br = class=3D""><br class=3D"">And they you can run vdsmd from the shell.<br = class=3D""><br class=3D"">But why do you need to run vdsm from the = shell?<br class=3D""><br class=3D"">Nir<br class=3D""></blockquote><br = class=3D""></div></div> </div></blockquote></div><br class=3D""></div></body></html>= --Apple-Mail=_B4BC9680-8BD9-4CBD-BC07-D9AE3227F522--

On Sun, May 15, 2016 at 1:32 AM, Gervais de Montbrun <gervais@demontbrun.com
wrote:
Hey Folks,
I'm still stuck here... I believe the issue may be because the Enrolling certificates stage is failing...
Any idea's how to sort this out?
I see: Installing Host Cultivar2. Enrolling certificate Failed to install Host Cultivar2. Certificate enrollment failed. Host Cultivar2 installation failed. Certificate enrollement failed.
Any attempt to re-install fails at this step.
Cheers, Gervais
I also see in the logs that the manager (ovirt-engine) rejected the host addition to the setup. Please add the /var/log/ovirt-engine/engine.log from the engine machine. Also, the engine machine has the install log - open one of the Event logs in the UI that says you host installation failed and you will see it retrieved the install log to a path under your engine. Please include this as well.
On May 13, 2016, at 1:36 PM, Gervais de Montbrun <gervais@demontbrun.com> wrote:
Hi Nir,
Thank you for you input.
It was suggest that I try starting vdsm from the shell so I could see the output in an effort to solve my issue: http://lists.ovirt.org/pipermail/users/2016-May/039690.html
Cheers, Gervais
On May 13, 2016, at 12:12 PM, Nir Soffer <nsoffer@redhat.com> wrote:
On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun <gervais@demontbrun.com> wrote:
Hi Charles,
I think the problem I am having is due to the setup failing and not something in vdsm configs as I have never gotten this server to start up properly and the BRIDGE ethernet interface + ovirt routes are not setup.
I put the logs here: https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=0
hosted-engine--deploy-logs.zip # Logs from when I tried to deploy and it failed vdsm.tar.gz # /var/log/vdsm
Output from running vdsm from the command line:
[root@cultivar2 log]# su -s /bin/bash vdsm
This cannot work unless supervdsmd is running...
[vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm (PID: 6521) I am the actual vdsm 4.17.26-1.el7 cultivar2.grove.silverorange.com (3.10.0-327.el7.x86_64) VDSM will run with cpu affinity: frozenset([1]) /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None) SUCCESS: <err> = ''; <rc> = 0 Starting scheduler vdsm.Scheduler started Run and protect: registerDomainStateChangeCallback(callbackFunc=<functools.partial object at 0x381b158>) Run and protect: registerDomainStateChangeCallback, Return response: None Trying to connect to Super Vdsm Preparing MOM interface Using named unix socket /var/run/vdsm/mom-vdsm.sock Unregistering all secrests trying to connect libvirt recovery: started Setting channels' timeout to 30 seconds. Starting VM channels listener thread. Listening at 0.0.0.0:54321 Adding detector <rpc.bindingxmlrpc.XmlDetector instance at 0x3b4ecb0> recovery: completed in 0s Adding detector <yajsonrpc.stompreactor.StompDetector instance at 0x382e5a8> Starting executor Starting worker jsonrpc.Executor/0 Worker started Starting worker jsonrpc.Executor/1 Worker started Starting worker jsonrpc.Executor/2 Worker started Starting worker jsonrpc.Executor/3 Worker started Starting worker jsonrpc.Executor/4 Worker started Starting worker jsonrpc.Executor/5 Worker started Starting worker jsonrpc.Executor/6 Worker started Starting worker jsonrpc.Executor/7 Worker started XMLRPC server running Starting executor Starting worker periodic/0 Worker started Starting worker periodic/1 Worker started Starting worker periodic/2 Worker started Starting worker periodic/3 Worker started trying to connect libvirt Panic: Connect to supervdsm service failed: [Errno 2] No such file or directory Traceback (most recent call last): File "/usr/share/vdsm/supervdsm.py", line 78, in _connect utils.retry(self._manager.connect, Exception, timeout=60, tries=3) File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry return func() File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in connect conn = Client(self._address, authkey=self._authkey) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in Client c = SocketClient(address) File "/usr/lib64/python2.7/multiprocessing/connection.py", line 308, in SocketClient s.connect(address) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 2] No such file or directory
Vdsm tries to connect to supervdsmd on startup, and if it is not running it will fail.
You can do:
systemctl start supervdsmd
And they you can run vdsmd from the shell.
But why do you need to run vdsm from the shell?
Nir
_______________________________________________ Users mailing list Users@ovirt.org http://lists.ovirt.org/mailman/listinfo/users

On May 15, 2016, at 5:10 PM, Roy Golan <rgolan@redhat.com = <mailto:rgolan@redhat.com>> wrote: =20 =20 =20 On Sun, May 15, 2016 at 1:32 AM, Gervais de Montbrun = <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote: Hey Folks, =20 I'm still stuck here... I believe the issue may be because the = Enrolling certificates stage is failing... =20 Any idea's how to sort this out? =20 I see: Installing Host Cultivar2. Enrolling certificate Failed to install Host Cultivar2. Certificate enrollment failed. Host Cultivar2 installation failed. Certificate enrollement = failed. =20 Any attempt to re-install fails at this step. =20 Cheers, Gervais =20 =20 =20 I also see in the logs that the manager (ovirt-engine) rejected the = host addition to the setup. Please add the = /var/log/ovirt-engine/engine.log from the engine machine. Also, the = engine machine has the install log - open one of the Event logs in the = UI that says you host installation failed and you will see it retrieved =
=20 =20
On May 13, 2016, at 1:36 PM, Gervais de Montbrun = <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote: =20 Hi Nir, =20 Thank you for you input. =20 It was suggest that I try starting vdsm from the shell so I could see =
=20 Cheers, Gervais =20 =20 =20
On May 13, 2016, at 12:12 PM, Nir Soffer <nsoffer@redhat.com = <mailto:nsoffer@redhat.com>> wrote: =20 On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote:
Hi Charles, =20 I think the problem I am having is due to the setup failing and not something in vdsm configs as I have never gotten this server to = start up properly and the BRIDGE ethernet interface + ovirt routes are not = setup. =20 I put the logs here: = https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=3D= 0 = <https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=3D= 0> =20 hosted-engine--deploy-logs.zip # Logs from when I tried to deploy = and it failed vdsm.tar.gz # /var/log/vdsm =20 Output from running vdsm from the command line: =20 [root@cultivar2 log]# su -s /bin/bash vdsm =20 This cannot work unless supervdsmd is running... =20 [vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm (PID: 6521) I am the actual vdsm 4.17.26-1.el7 cultivar2.grove.silverorange.com = <http://cultivar2.grove.silverorange.com/> (3.10.0-327.el7.x86_64) VDSM will run with cpu affinity: frozenset([1]) /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None) SUCCESS: <err> =3D ''; <rc> =3D 0 Starting scheduler vdsm.Scheduler started Run and protect: registerDomainStateChangeCallback(callbackFunc=3D<functools.partial = object at 0x381b158>) Run and protect: registerDomainStateChangeCallback, Return = response: None Trying to connect to Super Vdsm Preparing MOM interface Using named unix socket /var/run/vdsm/mom-vdsm.sock Unregistering all secrests trying to connect libvirt recovery: started Setting channels' timeout to 30 seconds. Starting VM channels listener thread. Listening at 0.0.0.0:54321 <http://0.0.0.0:54321/> Adding detector <rpc.bindingxmlrpc.XmlDetector instance at = 0x3b4ecb0> recovery: completed in 0s Adding detector <yajsonrpc.stompreactor.StompDetector instance at = 0x382e5a8> Starting executor Starting worker jsonrpc.Executor/0 Worker started Starting worker jsonrpc.Executor/1 Worker started Starting worker jsonrpc.Executor/2 Worker started Starting worker jsonrpc.Executor/3 Worker started Starting worker jsonrpc.Executor/4 Worker started Starting worker jsonrpc.Executor/5 Worker started Starting worker jsonrpc.Executor/6 Worker started Starting worker jsonrpc.Executor/7 Worker started XMLRPC server running Starting executor Starting worker periodic/0 Worker started Starting worker periodic/1 Worker started Starting worker periodic/2 Worker started Starting worker periodic/3 Worker started trying to connect libvirt Panic: Connect to supervdsm service failed: [Errno 2] No such file = or directory Traceback (most recent call last): File "/usr/share/vdsm/supervdsm.py", line 78, in _connect utils.retry(self._manager.connect, Exception, timeout=3D60, =
--Apple-Mail=_C96C4A69-29F4-48A1-BD73-0F39D0714F1E Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii Hi Roy, I've added /var/log/ovirt-engine/engine.log to my DropBox: = https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxHKa/vdsm?= dl=3D0 = <https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxHKa/vdsm= ?dl=3D0> When I look in the engine.log, I see the error "unable to load number = from serial.txt" I did a find for serial.txt and the only file that comes up is: = /etc/pki/ovirt-engine/serial.txt=20 (Makes sense that this is the file it wants to read) When I open /etc/pki/ovirt-engine/serial.txt, it is an empty file.=20 [root@cultivar ovirt-engine]# cat = /etc/pki/ovirt-engine/serial.txt [root@cultivar ovirt-engine]#=20 What should be in serial.txt? I don't understand where to find the other log that you mention in your = email below. I don't see any paths listed in the UI when I click on the = failed event for certificate enrollment. Cheers, Gervais the install log to a path under your engine. Please include this as = well. the output in an effort to solve my issue: = http://lists.ovirt.org/pipermail/users/2016-May/039690.html = <http://lists.ovirt.org/pipermail/users/2016-May/039690.html> tries=3D3)
File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in = retry return func() File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, = in connect conn =3D Client(self._address, authkey=3Dself._authkey) File "/usr/lib64/python2.7/multiprocessing/connection.py", line = 173, in Client c =3D SocketClient(address) File "/usr/lib64/python2.7/multiprocessing/connection.py", line = 308, in SocketClient s.connect(address) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 2] No such file or directory =20 Vdsm tries to connect to supervdsmd on startup, and if it is not = running it will fail. =20 You can do: =20 systemctl start supervdsmd =20 And they you can run vdsmd from the shell. =20 But why do you need to run vdsm from the shell? =20 Nir =20 =20 =20
Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users = <http://lists.ovirt.org/mailman/listinfo/users> =20 =20
--Apple-Mail=_C96C4A69-29F4-48A1-BD73-0F39D0714F1E Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=us-ascii <html><head><meta http-equiv=3D"Content-Type" content=3D"text/html = charset=3Dus-ascii"><meta http-equiv=3D"Content-Type" content=3D"text/html= charset=3Dus-ascii"><meta http-equiv=3D"Content-Type" = content=3D"text/html charset=3Dus-ascii"></head><body style=3D"word-wrap: = break-word; -webkit-nbsp-mode: space; -webkit-line-break: = after-white-space;" class=3D"">Hi Roy,<div class=3D""><br = class=3D""></div><div class=3D"">I've = added /var/log/ovirt-engine/engine.log to my DropBox: <a = href=3D"https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxH= Ka/vdsm?dl=3D0" = class=3D"">https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dM= QxHKa/vdsm?dl=3D0</a></div><div class=3D""><br class=3D""></div><div = class=3D"">When I look in the engine.log, I see the error "unable to = load number from serial.txt"</div><div class=3D"">I did a find for = serial.txt and the only file that comes up = is: /etc/pki/ovirt-engine/serial.txt </div><div = class=3D"">(Makes sense that this is the file it wants to = read)</div><div class=3D"">When I = open /etc/pki/ovirt-engine/serial.txt, it is an empty = file. </div><div class=3D""><span class=3D"Apple-tab-span" = style=3D"white-space:pre"> = </span>[root@cultivar ovirt-engine]# cat = /etc/pki/ovirt-engine/serial.txt</div><span class=3D"Apple-tab-span" = style=3D"white-space:pre"> = </span>[root@cultivar ovirt-engine]# <div class=3D""><br = class=3D""></div><div class=3D"">What should be in serial.txt?</div><div = class=3D""><br class=3D""></div><div class=3D"">I don't understand where = to find the other log that you mention in your email below. I don't see = any paths listed in the UI when I click on the failed event for = certificate enrollment.<br class=3D""><div class=3D""> <div id=3D"signature" class=3D""><br class=3D"">Cheers,<br = class=3D"">Gervais<br class=3D""><br class=3D""><br class=3D""></div> </div> <br class=3D""><div class=3D""><blockquote type=3D"cite" class=3D""><div = class=3D"">On May 15, 2016, at 5:10 PM, Roy Golan <<a = href=3D"mailto:rgolan@redhat.com" class=3D"">rgolan@redhat.com</a>> = wrote:</div><br class=3D"Apple-interchange-newline"><div class=3D""><div = dir=3D"ltr" class=3D""><br class=3D""><div class=3D"gmail_extra"><br = class=3D""><div class=3D"gmail_quote">On Sun, May 15, 2016 at 1:32 AM, = Gervais de Montbrun <span dir=3D"ltr" class=3D""><<a = href=3D"mailto:gervais@demontbrun.com" target=3D"_blank" = class=3D"">gervais@demontbrun.com</a>></span> wrote:<br = class=3D""><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 = .8ex;border-left:1px #ccc solid;padding-left:1ex"><div = style=3D"word-wrap:break-word" class=3D"">Hey Folks,<div class=3D""><br = class=3D""></div><div class=3D"">I'm still stuck here... I believe the = issue may be because the Enrolling certificates stage is = failing...</div><div class=3D""><br class=3D""></div><div class=3D"">Any = idea's how to sort this out?</div><div class=3D""><br = class=3D""></div><div class=3D"">I see:</div><div class=3D""><span = style=3D"white-space:pre-wrap" class=3D""> </span>Installing Host = Cultivar2. Enrolling certificate</div><div class=3D""><span = style=3D"white-space:pre-wrap" class=3D""> </span>Failed to install = Host Cultivar2. Certificate enrollment failed.</div><div class=3D""><span = style=3D"white-space:pre-wrap" class=3D""> </span>Host Cultivar2 = installation failed. Certificate enrollement failed.</div><div = class=3D""><br class=3D""></div><div class=3D"">Any attempt to = re-install fails at this step.</div><div class=3D""><div class=3D""> <div class=3D""><br class=3D"">Cheers,<br class=3D"">Gervais<br = class=3D""><br class=3D""><br = class=3D""></div></div></div></div></blockquote><div class=3D""><br = class=3D""></div><div class=3D"">I also see in the logs that the manager = (ovirt-engine) rejected the host addition to the setup. Please add the = /var/log/ovirt-engine/engine.log from the engine machine. Also, the = engine machine has the install log - open one of the Event logs in the = UI that says you host installation failed and you will see it retrieved = the install log to a path under your engine. Please include this as = well.</div><div class=3D""> </div><blockquote class=3D"gmail_quote" = style=3D"margin:0 0 0 .8ex;border-left:1px #ccc = solid;padding-left:1ex"><div style=3D"word-wrap:break-word" = class=3D""><div class=3D""><div class=3D""><div class=3D""></div> </div><div class=3D""><div class=3D"h5"> <br class=3D""><div class=3D""><blockquote type=3D"cite" class=3D""><div = class=3D"">On May 13, 2016, at 1:36 PM, Gervais de Montbrun <<a = href=3D"mailto:gervais@demontbrun.com" target=3D"_blank" = class=3D"">gervais@demontbrun.com</a>> wrote:</div><br class=3D""><div = class=3D""><div dir=3D"ltr" class=3D""><div dir=3D"auto" = style=3D"word-wrap:break-word" class=3D"">Hi Nir,<br class=3D""><br = class=3D"">Thank you for you input.<br class=3D""><br class=3D"">It was = suggest that I try starting vdsm from the shell so I could see the = output in an effort to solve my issue: <a = href=3D"http://lists.ovirt.org/pipermail/users/2016-May/039690.html" = target=3D"_blank" = class=3D"">http://lists.ovirt.org/pipermail/users/2016-May/039690.html</a>= <br class=3D""><br class=3D"">Cheers,<br class=3D"">Gervais<br = class=3D""><br class=3D""><br class=3D""><br class=3D""><blockquote = type=3D"cite" class=3D"">On May 13, 2016, at 12:12 PM, Nir Soffer <<a = href=3D"mailto:nsoffer@redhat.com" target=3D"_blank" = class=3D"">nsoffer@redhat.com</a>> wrote:<br class=3D""><br = class=3D"">On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun<br = class=3D""><<a href=3D"mailto:gervais@demontbrun.com" target=3D"_blank"= class=3D"">gervais@demontbrun.com</a>> wrote:<br = class=3D""><blockquote type=3D"cite" class=3D"">Hi Charles,<br = class=3D""><br class=3D"">I think the problem I am having is due to the = setup failing and not<br class=3D"">something in vdsm configs as I have = never gotten this server to start up<br class=3D"">properly and the = BRIDGE ethernet interface + ovirt routes are not setup.<br class=3D""><br = class=3D"">I put the logs here:<br class=3D""><a = href=3D"https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5tw= Ha?dl=3D0" target=3D"_blank" = class=3D"">https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW= 5twHa?dl=3D0</a><br class=3D""><br = class=3D"">hosted-engine--deploy-logs.zip # Logs from when I tried to = deploy and it<br class=3D"">failed<br class=3D"">vdsm.tar.gz # = /var/log/vdsm<br class=3D""><br class=3D"">Output from running vdsm from = the command line:<br class=3D""><br class=3D"">[root@cultivar2 log]# su = -s /bin/bash vdsm<br class=3D""></blockquote><br class=3D"">This cannot = work unless supervdsmd is running...<br class=3D""><br = class=3D""><blockquote type=3D"cite" class=3D"">[vdsm@cultivar2 log]$ = python /usr/share/vdsm/vdsm<br class=3D"">(PID: 6521) I am the actual = vdsm 4.17.26-1.el7<br class=3D""><a = href=3D"http://cultivar2.grove.silverorange.com/" target=3D"_blank" = class=3D"">cultivar2.grove.silverorange.com</a> = (3.10.0-327.el7.x86_64)<br class=3D"">VDSM will run with cpu affinity: = frozenset([1])<br class=3D"">/usr/bin/taskset --all-tasks --pid = --cpu-list 1 6521 (cwd None)<br class=3D"">SUCCESS: <err> =3D ''; = <rc> =3D 0<br class=3D"">Starting scheduler vdsm.Scheduler<br = class=3D"">started<br class=3D"">Run and protect:<br = class=3D"">registerDomainStateChangeCallback(callbackFunc=3D<functools.= partial object at<br class=3D"">0x381b158>)<br class=3D"">Run and = protect: registerDomainStateChangeCallback, Return response: None<br = class=3D"">Trying to connect to Super Vdsm<br class=3D"">Preparing MOM = interface<br class=3D"">Using named unix socket = /var/run/vdsm/mom-vdsm.sock<br class=3D"">Unregistering all secrests<br = class=3D"">trying to connect libvirt<br class=3D"">recovery: started<br = class=3D"">Setting channels' timeout to 30 seconds.<br class=3D"">Starting= VM channels listener thread.<br class=3D"">Listening at <a = href=3D"http://0.0.0.0:54321/" target=3D"_blank" = class=3D"">0.0.0.0:54321</a><br class=3D"">Adding detector = <rpc.bindingxmlrpc.XmlDetector instance at 0x3b4ecb0><br = class=3D"">recovery: completed in 0s<br class=3D"">Adding detector = <yajsonrpc.stompreactor.StompDetector instance at 0x382e5a8><br = class=3D"">Starting executor<br class=3D"">Starting worker = jsonrpc.Executor/0<br class=3D"">Worker started<br class=3D"">Starting = worker jsonrpc.Executor/1<br class=3D"">Worker started<br = class=3D"">Starting worker jsonrpc.Executor/2<br class=3D"">Worker = started<br class=3D"">Starting worker jsonrpc.Executor/3<br = class=3D"">Worker started<br class=3D"">Starting worker = jsonrpc.Executor/4<br class=3D"">Worker started<br class=3D"">Starting = worker jsonrpc.Executor/5<br class=3D"">Worker started<br = class=3D"">Starting worker jsonrpc.Executor/6<br class=3D"">Worker = started<br class=3D"">Starting worker jsonrpc.Executor/7<br = class=3D"">Worker started<br class=3D"">XMLRPC server running<br = class=3D"">Starting executor<br class=3D"">Starting worker periodic/0<br = class=3D"">Worker started<br class=3D"">Starting worker periodic/1<br = class=3D"">Worker started<br class=3D"">Starting worker periodic/2<br = class=3D"">Worker started<br class=3D"">Starting worker periodic/3<br = class=3D"">Worker started<br class=3D"">trying to connect libvirt<br = class=3D"">Panic: Connect to supervdsm service failed: [Errno 2] No such = file or<br class=3D"">directory<br class=3D"">Traceback (most recent = call last):<br class=3D"">File "/usr/share/vdsm/supervdsm.py", line 78, = in _connect<br class=3D""> utils.retry(self._manager.connect, = Exception, timeout=3D60, tries=3D3)<br class=3D"">File = "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry<br = class=3D""> return func()<br class=3D"">File = "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in<br = class=3D"">connect<br class=3D""> conn =3D Client(self._address, = authkey=3Dself._authkey)<br class=3D"">File = "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in<br = class=3D"">Client<br class=3D""> c =3D SocketClient(address)<br = class=3D"">File "/usr/lib64/python2.7/multiprocessing/connection.py", = line 308, in<br class=3D"">SocketClient<br class=3D""> = s.connect(address)<br class=3D"">File = "/usr/lib64/python2.7/socket.py", line 224, in meth<br class=3D""> = return getattr(self._sock,name)(*args)<br class=3D"">error: [Errno = 2] No such file or directory<br class=3D""></blockquote><br = class=3D"">Vdsm tries to connect to supervdsmd on startup, and if it is = not running<br class=3D"">it will fail.<br class=3D""><br class=3D"">You = can do:<br class=3D""><br class=3D"">systemctl start supervdsmd<br = class=3D""><br class=3D"">And they you can run vdsmd from the shell.<br = class=3D""><br class=3D"">But why do you need to run vdsm from the = shell?<br class=3D""><br class=3D"">Nir<br class=3D""></blockquote><br = class=3D""></div></div> </div></blockquote></div><br class=3D""></div></div></div></div><br = class=3D"">_______________________________________________<br class=3D""> Users mailing list<br class=3D""> <a href=3D"mailto:Users@ovirt.org" class=3D"">Users@ovirt.org</a><br = class=3D""> <a href=3D"http://lists.ovirt.org/mailman/listinfo/users" = rel=3D"noreferrer" target=3D"_blank" = class=3D"">http://lists.ovirt.org/mailman/listinfo/users</a><br = class=3D""> <br class=3D""></blockquote></div><br class=3D""></div></div> </div></blockquote></div><br class=3D""></div></body></html>= --Apple-Mail=_C96C4A69-29F4-48A1-BD73-0F39D0714F1E--

On May 16, 2016, at 2:04 PM, Gervais de Montbrun = <gervais@demontbrun.com> wrote: =20 Hi Roy, =20 I've added /var/log/ovirt-engine/engine.log to my DropBox: = https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxHKa/vdsm?=
=20 When I look in the engine.log, I see the error "unable to load number = from serial.txt" I did a find for serial.txt and the only file that comes up is: = /etc/pki/ovirt-engine/serial.txt=20 (Makes sense that this is the file it wants to read) When I open /etc/pki/ovirt-engine/serial.txt, it is an empty file.=20 [root@cultivar ovirt-engine]# cat = /etc/pki/ovirt-engine/serial.txt [root@cultivar ovirt-engine]#=20 =20 What should be in serial.txt? =20 I don't understand where to find the other log that you mention in = your email below. I don't see any paths listed in the UI when I click on =
=20 Cheers, Gervais =20 =20 =20
On May 15, 2016, at 5:10 PM, Roy Golan <rgolan@redhat.com = <mailto:rgolan@redhat.com>> wrote: =20 =20 =20 On Sun, May 15, 2016 at 1:32 AM, Gervais de Montbrun = <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote: Hey Folks, =20 I'm still stuck here... I believe the issue may be because the = Enrolling certificates stage is failing... =20 Any idea's how to sort this out? =20 I see: Installing Host Cultivar2. Enrolling certificate Failed to install Host Cultivar2. Certificate enrollment failed. Host Cultivar2 installation failed. Certificate enrollement = failed. =20 Any attempt to re-install fails at this step. =20 Cheers, Gervais =20 =20 =20 I also see in the logs that the manager (ovirt-engine) rejected the = host addition to the setup. Please add the = /var/log/ovirt-engine/engine.log from the engine machine. Also, the = engine machine has the install log - open one of the Event logs in the = UI that says you host installation failed and you will see it retrieved =
=20 =20
On May 13, 2016, at 1:36 PM, Gervais de Montbrun = <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote: =20 Hi Nir, =20 Thank you for you input. =20 It was suggest that I try starting vdsm from the shell so I could = see the output in an effort to solve my issue: = http://lists.ovirt.org/pipermail/users/2016-May/039690.html = <http://lists.ovirt.org/pipermail/users/2016-May/039690.html> =20 Cheers, Gervais =20 =20 =20
On May 13, 2016, at 12:12 PM, Nir Soffer <nsoffer@redhat.com = <mailto:nsoffer@redhat.com>> wrote: =20 On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun <gervais@demontbrun.com <mailto:gervais@demontbrun.com>> wrote:
Hi Charles, =20 I think the problem I am having is due to the setup failing and = not something in vdsm configs as I have never gotten this server to = start up properly and the BRIDGE ethernet interface + ovirt routes are not = setup. =20 I put the logs here: = https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=3D= 0 = <https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5twHa?dl=3D= 0> =20 hosted-engine--deploy-logs.zip # Logs from when I tried to deploy = and it failed vdsm.tar.gz # /var/log/vdsm =20 Output from running vdsm from the command line: =20 [root@cultivar2 log]# su -s /bin/bash vdsm =20 This cannot work unless supervdsmd is running... =20 [vdsm@cultivar2 log]$ python /usr/share/vdsm/vdsm (PID: 6521) I am the actual vdsm 4.17.26-1.el7 cultivar2.grove.silverorange.com = <http://cultivar2.grove.silverorange.com/> (3.10.0-327.el7.x86_64) VDSM will run with cpu affinity: frozenset([1]) /usr/bin/taskset --all-tasks --pid --cpu-list 1 6521 (cwd None) SUCCESS: <err> =3D ''; <rc> =3D 0 Starting scheduler vdsm.Scheduler started Run and protect: registerDomainStateChangeCallback(callbackFunc=3D<functools.partial = object at 0x381b158>) Run and protect: registerDomainStateChangeCallback, Return = response: None Trying to connect to Super Vdsm Preparing MOM interface Using named unix socket /var/run/vdsm/mom-vdsm.sock Unregistering all secrests trying to connect libvirt recovery: started Setting channels' timeout to 30 seconds. Starting VM channels listener thread. Listening at 0.0.0.0:54321 <http://0.0.0.0:54321/> Adding detector <rpc.bindingxmlrpc.XmlDetector instance at = 0x3b4ecb0> recovery: completed in 0s Adding detector <yajsonrpc.stompreactor.StompDetector instance at = 0x382e5a8> Starting executor Starting worker jsonrpc.Executor/0 Worker started Starting worker jsonrpc.Executor/1 Worker started Starting worker jsonrpc.Executor/2 Worker started Starting worker jsonrpc.Executor/3 Worker started Starting worker jsonrpc.Executor/4 Worker started Starting worker jsonrpc.Executor/5 Worker started Starting worker jsonrpc.Executor/6 Worker started Starting worker jsonrpc.Executor/7 Worker started XMLRPC server running Starting executor Starting worker periodic/0 Worker started Starting worker periodic/1 Worker started Starting worker periodic/2 Worker started Starting worker periodic/3 Worker started trying to connect libvirt Panic: Connect to supervdsm service failed: [Errno 2] No such file = or directory Traceback (most recent call last): File "/usr/share/vdsm/supervdsm.py", line 78, in _connect utils.retry(self._manager.connect, Exception, timeout=3D60, =
--Apple-Mail=_A414C3C1-7D9D-40A8-893C-E947527ADD60 Content-Transfer-Encoding: quoted-printable Content-Type: text/plain; charset=us-ascii Roy, Thank you so much for asking for the logs on the engine. The serial.txt = bein blank was my issue. I found a solution in the following thread = online here: = http://lists.ovirt.org/pipermail/users/2014-November/028870.html I did the exact same thing that Dinuwan did to solve his issue back in = 2014 and was able to enrol my new server and install the hosted-engine = just fine on it. I am not certain what caused serial.txt to be empty and am will to share = logs to anyone from the oVirt team who might want to see them. For my = purposes, I'm considering this issue solved. Thanks to everyone who offered advice and suggestions. Cheers, Gervais dl=3D0 = <https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxHKa/vdsm= ?dl=3D0> the failed event for certificate enrollment. the install log to a path under your engine. Please include this as = well. tries=3D3)
File "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, = in retry return func() File "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, = in connect conn =3D Client(self._address, authkey=3Dself._authkey) File "/usr/lib64/python2.7/multiprocessing/connection.py", line = 173, in Client c =3D SocketClient(address) File "/usr/lib64/python2.7/multiprocessing/connection.py", line = 308, in SocketClient s.connect(address) File "/usr/lib64/python2.7/socket.py", line 224, in meth return getattr(self._sock,name)(*args) error: [Errno 2] No such file or directory =20 Vdsm tries to connect to supervdsmd on startup, and if it is not = running it will fail. =20 You can do: =20 systemctl start supervdsmd =20 And they you can run vdsmd from the shell. =20 But why do you need to run vdsm from the shell? =20 Nir =20 =20 =20
Users mailing list Users@ovirt.org <mailto:Users@ovirt.org> http://lists.ovirt.org/mailman/listinfo/users = <http://lists.ovirt.org/mailman/listinfo/users> =20 =20 =20
--Apple-Mail=_A414C3C1-7D9D-40A8-893C-E947527ADD60 Content-Transfer-Encoding: quoted-printable Content-Type: text/html; charset=us-ascii <html><head><meta http-equiv=3D"Content-Type" content=3D"text/html = charset=3Dus-ascii"></head><body style=3D"word-wrap: break-word; = -webkit-nbsp-mode: space; -webkit-line-break: after-white-space;" = class=3D"">Roy,<div class=3D""><br class=3D""></div><div class=3D"">Thank = you so much for asking for the logs on the engine. The serial.txt bein = blank was my issue. I found a solution in the following thread online = here: <a = href=3D"http://lists.ovirt.org/pipermail/users/2014-November/028870.html" = class=3D"">http://lists.ovirt.org/pipermail/users/2014-November/028870.htm= l</a></div><div class=3D"">I did the exact same thing that Dinuwan did = to solve his issue back in 2014 and was able to enrol my new server and = install the hosted-engine just fine on it.</div><div class=3D""><div = class=3D""><br class=3D"webkit-block-placeholder"></div><div class=3D"">I = am not certain what caused serial.txt to be empty and am will to share = logs to anyone from the oVirt team who might want to see them. For my = purposes, I'm considering this issue solved.</div><div class=3D""><br = class=3D""></div><div class=3D"">Thanks to everyone who offered advice = and suggestions.</div><div class=3D""> <div id=3D"signature" class=3D""><br class=3D"">Cheers,<br = class=3D"">Gervais<br class=3D""><br class=3D""><br class=3D""></div> </div> <br class=3D""><div><blockquote type=3D"cite" class=3D""><div = class=3D"">On May 16, 2016, at 2:04 PM, Gervais de Montbrun <<a = href=3D"mailto:gervais@demontbrun.com" = class=3D"">gervais@demontbrun.com</a>> wrote:</div><br = class=3D"Apple-interchange-newline"><div class=3D""><meta = http-equiv=3D"Content-Type" content=3D"text/html charset=3Dus-ascii" = class=3D""><meta http-equiv=3D"Content-Type" content=3D"text/html = charset=3Dus-ascii" class=3D""><meta http-equiv=3D"Content-Type" = content=3D"text/html charset=3Dus-ascii" class=3D""><div = style=3D"word-wrap: break-word; -webkit-nbsp-mode: space; = -webkit-line-break: after-white-space;" class=3D"">Hi Roy,<div = class=3D""><br class=3D""></div><div class=3D"">I've = added /var/log/ovirt-engine/engine.log to my DropBox: <a = href=3D"https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dMQxH= Ka/vdsm?dl=3D0" = class=3D"">https://www.dropbox.com/sh/21lkl6ao3hv2r85/AADDMCK4Dp7UYgs5P6dM= QxHKa/vdsm?dl=3D0</a></div><div class=3D""><br class=3D""></div><div = class=3D"">When I look in the engine.log, I see the error "unable to = load number from serial.txt"</div><div class=3D"">I did a find for = serial.txt and the only file that comes up = is: /etc/pki/ovirt-engine/serial.txt </div><div = class=3D"">(Makes sense that this is the file it wants to = read)</div><div class=3D"">When I = open /etc/pki/ovirt-engine/serial.txt, it is an empty = file. </div><div class=3D""><span class=3D"Apple-tab-span" = style=3D"white-space:pre"> = </span>[root@cultivar ovirt-engine]# cat = /etc/pki/ovirt-engine/serial.txt</div><span class=3D"Apple-tab-span" = style=3D"white-space:pre"> = </span>[root@cultivar ovirt-engine]# <div class=3D""><br = class=3D""></div><div class=3D"">What should be in serial.txt?</div><div = class=3D""><br class=3D""></div><div class=3D"">I don't understand where = to find the other log that you mention in your email below. I don't see = any paths listed in the UI when I click on the failed event for = certificate enrollment.<br class=3D""><div class=3D""> <div id=3D"signature" class=3D""><br class=3D"">Cheers,<br = class=3D"">Gervais<br class=3D""><br class=3D""><br class=3D""></div> </div> <br class=3D""><div class=3D""><blockquote type=3D"cite" class=3D""><div = class=3D"">On May 15, 2016, at 5:10 PM, Roy Golan <<a = href=3D"mailto:rgolan@redhat.com" class=3D"">rgolan@redhat.com</a>> = wrote:</div><br class=3D"Apple-interchange-newline"><div class=3D""><div = dir=3D"ltr" class=3D""><br class=3D""><div class=3D"gmail_extra"><br = class=3D""><div class=3D"gmail_quote">On Sun, May 15, 2016 at 1:32 AM, = Gervais de Montbrun <span dir=3D"ltr" class=3D""><<a = href=3D"mailto:gervais@demontbrun.com" target=3D"_blank" = class=3D"">gervais@demontbrun.com</a>></span> wrote:<br = class=3D""><blockquote class=3D"gmail_quote" style=3D"margin:0 0 0 = .8ex;border-left:1px #ccc solid;padding-left:1ex"><div = style=3D"word-wrap:break-word" class=3D"">Hey Folks,<div class=3D""><br = class=3D""></div><div class=3D"">I'm still stuck here... I believe the = issue may be because the Enrolling certificates stage is = failing...</div><div class=3D""><br class=3D""></div><div class=3D"">Any = idea's how to sort this out?</div><div class=3D""><br = class=3D""></div><div class=3D"">I see:</div><div class=3D""><span = style=3D"white-space:pre-wrap" class=3D""> </span>Installing Host = Cultivar2. Enrolling certificate</div><div class=3D""><span = style=3D"white-space:pre-wrap" class=3D""> </span>Failed to install = Host Cultivar2. Certificate enrollment failed.</div><div class=3D""><span = style=3D"white-space:pre-wrap" class=3D""> </span>Host Cultivar2 = installation failed. Certificate enrollement failed.</div><div = class=3D""><br class=3D""></div><div class=3D"">Any attempt to = re-install fails at this step.</div><div class=3D""><div class=3D""> <div class=3D""><br class=3D"">Cheers,<br class=3D"">Gervais<br = class=3D""><br class=3D""><br = class=3D""></div></div></div></div></blockquote><div class=3D""><br = class=3D""></div><div class=3D"">I also see in the logs that the manager = (ovirt-engine) rejected the host addition to the setup. Please add the = /var/log/ovirt-engine/engine.log from the engine machine. Also, the = engine machine has the install log - open one of the Event logs in the = UI that says you host installation failed and you will see it retrieved = the install log to a path under your engine. Please include this as = well.</div><div class=3D""> </div><blockquote class=3D"gmail_quote" = style=3D"margin:0 0 0 .8ex;border-left:1px #ccc = solid;padding-left:1ex"><div style=3D"word-wrap:break-word" = class=3D""><div class=3D""><div class=3D""><div class=3D""></div> </div><div class=3D""><div class=3D"h5"> <br class=3D""><div class=3D""><blockquote type=3D"cite" class=3D""><div = class=3D"">On May 13, 2016, at 1:36 PM, Gervais de Montbrun <<a = href=3D"mailto:gervais@demontbrun.com" target=3D"_blank" = class=3D"">gervais@demontbrun.com</a>> wrote:</div><br class=3D""><div = class=3D""><div dir=3D"ltr" class=3D""><div dir=3D"auto" = style=3D"word-wrap:break-word" class=3D"">Hi Nir,<br class=3D""><br = class=3D"">Thank you for you input.<br class=3D""><br class=3D"">It was = suggest that I try starting vdsm from the shell so I could see the = output in an effort to solve my issue: <a = href=3D"http://lists.ovirt.org/pipermail/users/2016-May/039690.html" = target=3D"_blank" = class=3D"">http://lists.ovirt.org/pipermail/users/2016-May/039690.html</a>= <br class=3D""><br class=3D"">Cheers,<br class=3D"">Gervais<br = class=3D""><br class=3D""><br class=3D""><br class=3D""><blockquote = type=3D"cite" class=3D"">On May 13, 2016, at 12:12 PM, Nir Soffer <<a = href=3D"mailto:nsoffer@redhat.com" target=3D"_blank" = class=3D"">nsoffer@redhat.com</a>> wrote:<br class=3D""><br = class=3D"">On Fri, May 13, 2016 at 3:37 PM, Gervais de Montbrun<br = class=3D""><<a href=3D"mailto:gervais@demontbrun.com" target=3D"_blank"= class=3D"">gervais@demontbrun.com</a>> wrote:<br = class=3D""><blockquote type=3D"cite" class=3D"">Hi Charles,<br = class=3D""><br class=3D"">I think the problem I am having is due to the = setup failing and not<br class=3D"">something in vdsm configs as I have = never gotten this server to start up<br class=3D"">properly and the = BRIDGE ethernet interface + ovirt routes are not setup.<br class=3D""><br = class=3D"">I put the logs here:<br class=3D""><a = href=3D"https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW5tw= Ha?dl=3D0" target=3D"_blank" = class=3D"">https://www.dropbox.com/sh/5ugyykqh1lgru9l/AACXxRYWr3tgd0WbBVFW= 5twHa?dl=3D0</a><br class=3D""><br = class=3D"">hosted-engine--deploy-logs.zip # Logs from when I tried to = deploy and it<br class=3D"">failed<br class=3D"">vdsm.tar.gz # = /var/log/vdsm<br class=3D""><br class=3D"">Output from running vdsm from = the command line:<br class=3D""><br class=3D"">[root@cultivar2 log]# su = -s /bin/bash vdsm<br class=3D""></blockquote><br class=3D"">This cannot = work unless supervdsmd is running...<br class=3D""><br = class=3D""><blockquote type=3D"cite" class=3D"">[vdsm@cultivar2 log]$ = python /usr/share/vdsm/vdsm<br class=3D"">(PID: 6521) I am the actual = vdsm 4.17.26-1.el7<br class=3D""><a = href=3D"http://cultivar2.grove.silverorange.com/" target=3D"_blank" = class=3D"">cultivar2.grove.silverorange.com</a> = (3.10.0-327.el7.x86_64)<br class=3D"">VDSM will run with cpu affinity: = frozenset([1])<br class=3D"">/usr/bin/taskset --all-tasks --pid = --cpu-list 1 6521 (cwd None)<br class=3D"">SUCCESS: <err> =3D ''; = <rc> =3D 0<br class=3D"">Starting scheduler vdsm.Scheduler<br = class=3D"">started<br class=3D"">Run and protect:<br = class=3D"">registerDomainStateChangeCallback(callbackFunc=3D<functools.= partial object at<br class=3D"">0x381b158>)<br class=3D"">Run and = protect: registerDomainStateChangeCallback, Return response: None<br = class=3D"">Trying to connect to Super Vdsm<br class=3D"">Preparing MOM = interface<br class=3D"">Using named unix socket = /var/run/vdsm/mom-vdsm.sock<br class=3D"">Unregistering all secrests<br = class=3D"">trying to connect libvirt<br class=3D"">recovery: started<br = class=3D"">Setting channels' timeout to 30 seconds.<br class=3D"">Starting= VM channels listener thread.<br class=3D"">Listening at <a = href=3D"http://0.0.0.0:54321/" target=3D"_blank" = class=3D"">0.0.0.0:54321</a><br class=3D"">Adding detector = <rpc.bindingxmlrpc.XmlDetector instance at 0x3b4ecb0><br = class=3D"">recovery: completed in 0s<br class=3D"">Adding detector = <yajsonrpc.stompreactor.StompDetector instance at 0x382e5a8><br = class=3D"">Starting executor<br class=3D"">Starting worker = jsonrpc.Executor/0<br class=3D"">Worker started<br class=3D"">Starting = worker jsonrpc.Executor/1<br class=3D"">Worker started<br = class=3D"">Starting worker jsonrpc.Executor/2<br class=3D"">Worker = started<br class=3D"">Starting worker jsonrpc.Executor/3<br = class=3D"">Worker started<br class=3D"">Starting worker = jsonrpc.Executor/4<br class=3D"">Worker started<br class=3D"">Starting = worker jsonrpc.Executor/5<br class=3D"">Worker started<br = class=3D"">Starting worker jsonrpc.Executor/6<br class=3D"">Worker = started<br class=3D"">Starting worker jsonrpc.Executor/7<br = class=3D"">Worker started<br class=3D"">XMLRPC server running<br = class=3D"">Starting executor<br class=3D"">Starting worker periodic/0<br = class=3D"">Worker started<br class=3D"">Starting worker periodic/1<br = class=3D"">Worker started<br class=3D"">Starting worker periodic/2<br = class=3D"">Worker started<br class=3D"">Starting worker periodic/3<br = class=3D"">Worker started<br class=3D"">trying to connect libvirt<br = class=3D"">Panic: Connect to supervdsm service failed: [Errno 2] No such = file or<br class=3D"">directory<br class=3D"">Traceback (most recent = call last):<br class=3D"">File "/usr/share/vdsm/supervdsm.py", line 78, = in _connect<br class=3D""> utils.retry(self._manager.connect, = Exception, timeout=3D60, tries=3D3)<br class=3D"">File = "/usr/lib/python2.7/site-packages/vdsm/utils.py", line 959, in retry<br = class=3D""> return func()<br class=3D"">File = "/usr/lib64/python2.7/multiprocessing/managers.py", line 500, in<br = class=3D"">connect<br class=3D""> conn =3D Client(self._address, = authkey=3Dself._authkey)<br class=3D"">File = "/usr/lib64/python2.7/multiprocessing/connection.py", line 173, in<br = class=3D"">Client<br class=3D""> c =3D SocketClient(address)<br = class=3D"">File "/usr/lib64/python2.7/multiprocessing/connection.py", = line 308, in<br class=3D"">SocketClient<br class=3D""> = s.connect(address)<br class=3D"">File = "/usr/lib64/python2.7/socket.py", line 224, in meth<br class=3D""> = return getattr(self._sock,name)(*args)<br class=3D"">error: [Errno = 2] No such file or directory<br class=3D""></blockquote><br = class=3D"">Vdsm tries to connect to supervdsmd on startup, and if it is = not running<br class=3D"">it will fail.<br class=3D""><br class=3D"">You = can do:<br class=3D""><br class=3D"">systemctl start supervdsmd<br = class=3D""><br class=3D"">And they you can run vdsmd from the shell.<br = class=3D""><br class=3D"">But why do you need to run vdsm from the = shell?<br class=3D""><br class=3D"">Nir<br class=3D""></blockquote><br = class=3D""></div></div> </div></blockquote></div><br class=3D""></div></div></div></div><br = class=3D"">_______________________________________________<br class=3D""> Users mailing list<br class=3D""> <a href=3D"mailto:Users@ovirt.org" class=3D"">Users@ovirt.org</a><br = class=3D""> <a href=3D"http://lists.ovirt.org/mailman/listinfo/users" = rel=3D"noreferrer" target=3D"_blank" = class=3D"">http://lists.ovirt.org/mailman/listinfo/users</a><br = class=3D""> <br class=3D""></blockquote></div><br class=3D""></div></div> </div></blockquote></div><br = class=3D""></div></div></div></blockquote></div><br = class=3D""></div></body></html>= --Apple-Mail=_A414C3C1-7D9D-40A8-893C-E947527ADD60--
participants (4)
-
Charles Tassell
-
Gervais de Montbrun
-
Nir Soffer
-
Roy Golan