Fwd: vGPU VM not starting
by Ales Musil
On Thu, May 17, 2018 at 12:01 AM, Callum Smith <callum(a)well.ox.ac.uk> wrote:
> Dear All,
>
> Our vGPU installation is progressing, though the VM is failing to start.
>
> 2018-05-16 22:57:34,328+0100 ERROR (vm/1bc9dae8) [virt.vm]
> (vmId='1bc9dae8-a0ea-44b3-9103-5805100648d0') The vm start process failed
> (vm:943)
> Traceback (most recent call last):
> File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 872, in
> _startUnderlyingVm
> self._run()
> File "/usr/lib/python2.7/site-packages/vdsm/virt/vm.py", line 2872, in
> _run
> dom.createWithFlags(flags)
> File "/usr/lib/python2.7/site-packages/vdsm/common/libvirtconnection.py",
> line 130, in wrapper
> ret = f(*args, **kwargs)
> File "/usr/lib/python2.7/site-packages/vdsm/common/function.py", line
> 92, in wrapper
> return func(inst, *args, **kwargs)
> File "/usr/lib64/python2.7/site-packages/libvirt.py", line 1099, in
> createWithFlags
> if ret == -1: raise libvirtError ('virDomainCreateWithFlags() failed',
> dom=self)
> libvirtError: Cannot get interface MTU on '': No such device
>
> That's the specific error, some other information. It seems the GPU
> 'allocation' of uuid against the nvidia-xx mdev type is proceeding
> correctly, and the device is being created by the VM instantiation but the
> VM does not succeed in going up with this error. Any other logs or
> information relevant to help diagnose?
>
> Regards,
> Callum
>
> --
>
> Callum Smith
> Research Computing Core
> Wellcome Trust Centre for Human Genetics
> University of Oxford
> e. callum(a)well.ox.ac.uk
>
>
> _______________________________________________
> Users mailing list -- users(a)ovirt.org
> To unsubscribe send an email to users-leave(a)ovirt.org
>
>
Hi Callum,
can you share your version of the setup?
Also do you use OVS switch type in the cluster?
Regards,
Ales.
--
ALES MUSIL
INTERN - rhv network
Red Hat EMEA <https://www.redhat.com/>
amusil(a)redhat.com IM: amusil
<https://red.ht/sig>
6 years, 5 months
disk performance findings
by william.dossett@gmail.com
I have deployed a HCI environment several times now as I wanted to get some
idea of disk performance with Gluster/
On a Dell R720 3 node cluster with H710 PERC controller and 8 x 2TB 7200 RPM
SATA drives.
My first test was with the 8 drives configured as H/W RAID 6 and I
configured Gluster as RAID 6 - quite a lot of redundancy but that was just
my first deployment
Running IOMeter for 24 hours using all in one access specification I got 240
IOPs. Pretty good for SATA drives.
I then broke the RAID and configured 8 virtual disks, one per physical and
then deployed Gluster as JBOD - I am not sure how resilient that is, but I
assume in a 3 node cluster failures to tolerate would be one.
This gave me 267 IOPs.
I don't know that much about the internals of Gluster, but when I first
asked about this there didn't seem to be much knowledge of what
configuration would be best for HCI. I plan to do more research and tests
on this, but for what its worth for now, I am going down the JBOD route with
no H/W RAID.
Regards
Bill
6 years, 5 months
Restore and rename engine
by Staniforth, Paul
Hello,
I have been trying to test our system by restoring from backup to a new machine and running ovirt-engine-rename, then engine-setup.
I get to sent back to the main oVirt Web page from any of the portals, which has an Warning triangle saying the engine is initializing. and in the URL it has
ovirt-engine/?error_description=server_error%3A+%2Fetc%2Fpki%2Fjava%2Fcacerts+(No+such+file+or+directory)&error=server_error
The original machine had a certificate signed by an external authority.
I tried putting back the link from apache-ca.pem to ca.pem, the original apache.key.nopass, websocket-proxy.key.nopass as well as hte key files.
I upgraded from 4.1.9 to 4.2.3 and it says it is resigning the certificates but still fails.
Regards,
Paul S.
To view the terms under which this email is distributed, please go to:-
http://disclaimer.leedsbeckett.ac.uk/disclaimer/disclaimer.html
6 years, 5 months
quick way to start glusterfs managed engine over?
by william.dossett@gmail.com
I have successfully got glusterfs playbook to finish once and then my
networking was wrong on the managed engine and on the hosts, so I started
over.
Now I can't seem to even get that far again. I completely reloaded the
hosts, but that takes a while. Last time glusterfs failed right at the end
with
"host is not in 'Peer in Cluster' state"
Redeploying fails as partitions already exist.
I am deploying to /dev/sdb
Is there a quick way to get /dev/sdb back into a state to redeploy to? I
think I got there once by doing pvremove and gdisk zap gpt, but not sure if
that worked or caused problem on the redeploy.
Its very time consuming to start over from scratch, so would appreciate if
there is any shortcut instead of reloading the OS.
Thans
Bill
6 years, 5 months
Export as OVA failed
by du_hongyu@yeah.net
Hi
I want to export my vm, I shutdown my vm, then from the three-dot dropdown select "Export as OVA"
but /var/log/ovirt-engine/engine.log has Exception fail.
Regards
Hongyu Du
6 years, 5 months
OVirt SHE deployment failed - 4.2.2 / 4.2.3
by jeanbaptiste.coupiac@nfrance.com
Hello List 😊
I’m facing an issue regarding Self Hosted Engine appliance deployment on OVirt 4.2( tested on 4.2.2 and 4.2.3).
During setup tool, I’m unable to mount a remote storage NFS / iSCSI.
When I try to mount an NFS share:
[ INFO ] TASK [Remove host-deploy configuration file]
[ INFO ] changed: [localhost]
Please specify the storage you would like to use (glusterfs, iscsi, fc, nfs)[nfs]: nfs
Please specify the nfs version you would like to use (auto, v3, v4, v4_1)[auto]: xxxxxx:/YYYYYY
[ ERROR ] Invalid value
Please specify the nfs version you would like to use (auto, v3, v4, v4_1)[auto]:
Please specify the full shared storage connection path to use (example: host:/path): xxxxxx:/YYYYYY
If needed, specify additional mount options for the connection to the hosted-engine storagedomain []:
[ INFO ] Creating Storage Domain
[ INFO ] TASK [Gathering Facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Check local VM dir stat]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Enforce local VM dir existence]
[ INFO ] skipping: [localhost]
[ INFO ] TASK [include_tasks]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Obtain SSO token using username/password credentials]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch host facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch cluster ID]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch cluster facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch Datacenter facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch Datacenter ID]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch Datacenter name]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Add NFS storage domain]
[ ERROR ] Verify permission settings on the specified storage path.]". HTTP response code is 400.
[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "Fault reason is \"Operation Failed\". Fault detail is \"[Permission settings on the specified path do not allow access to the storage.\nVerify permission settings on the specified storage path.]\". HTTP response code is 400."}
When I try to mount an iSCSI LUN:
Please specify the storage you would like to use (glusterfs, iscsi, fc, nfs)[nfs]: iscsi
Please specify the iSCSI portal IP address: XXXXXX
Please specify the iSCSI portal port [3260]:
Please specify the iSCSI discover user:
Please specify the iSCSI discover password:
Please specify the iSCSI portal login user:
Please specify the iSCSI portal login password:
[ INFO ] Discovering iSCSI targets
[ INFO ] TASK [Gathering Facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [include_tasks]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Obtain SSO token using username/password credentials]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Prepare iSCSI parameters]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch host facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [iSCSI discover with REST API]
[ INFO ] ok: [localhost]
The following targets have been found:
[1] iqn.1992-04.com.emc:cx.aaaaaaaaaaaaaaaaa
TPGT: 7, portals:
XXXXXXX:3260
[2] iqn.1992-04.com.emc:cx. aaaaaaaaaaaaaaaab
TPGT: 6, portals:
XXXXXXX:3260
[3] iqn.1992-04.com.emc:cx. aaaaaaaaaaaaaaaac
TPGT: 8, portals:
XXXXXX:3260
[4] iqn.1992-04.com.emc:cx. aaaaaaaaaaaaaaaad
TPGT: 5, portals:
XXXXXXX:3260
Please select a target (1, 2, 3, 4) [1]: 4
[ INFO ] Getting iSCSI LUNs list
[ INFO ] TASK [Gathering Facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [include_tasks]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Obtain SSO token using username/password credentials]
[ INFO ] ok: [localhost]
[ INFO ] TASK [iSCSI login]
[ INFO ] TASK [Get iSCSI LUNs]
[ INFO ] ok: [localhost]
The following luns have been found on the requested target:
[1] 36006016045004300897d2b5bdfea8478 200GiB DGC VRAID
status: free, paths: 1 active
Please select the destination LUN (1) [1]: 1
[ INFO ] iSCSI discard after delete is disabled
[ INFO ] Creating Storage Domain
[ INFO ] TASK [Gathering Facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Check local VM dir stat]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Enforce local VM dir existence]
[ INFO ] skipping: [localhost]
[ INFO ] TASK [include_tasks]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Obtain SSO token using username/password credentials]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch host facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch cluster ID]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch cluster facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch Datacenter facts]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch Datacenter ID]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Fetch Datacenter name]
[ INFO ] ok: [localhost]
[ INFO ] TASK [Add NFS storage domain]
[ INFO ] skipping: [localhost]
[ INFO ] TASK [Add glusterfs storage domain]
[ INFO ] skipping: [localhost]
[ INFO ] TASK [Add iSCSI storage domain]
[ ERROR ] Error: Fault reason is "Operation Failed". Fault detail is "[Network error during communication with the Host.]". HTTP response code is 400.
[ ERROR ] fatal: [localhost]: FAILED! => {"changed": false, "msg": "Fault reason is \"Operation Failed\". Fault detail is \"[Network error during communication with the Host.]\". HTTP response code is 400."}
Some unnecessary informations like IP or IQN were hidden
I’m pretty disturbed since iSCSI issue seem resoled since 4.2.2 release: https://bugzilla.redhat.com/show_bug.cgi?id=1529226
Did you have same issue to deploy SHE since 4.2* ? ovirt-hosted-engine-setup seems not working has expected
Regards,
Jean-Baptiste,
6 years, 5 months
oVirt export VM as OVA
by du_hongyu@yeah.net
Hi
I want to export my vm by "Eports as OVA", but i don't know what to write in Directory?
Regards
Hongyu Du
6 years, 5 months
Ignore extra parameters in oVirt API
by Hari Prasanth Loganathan
Hi Team,
I want to attach the disk using the oVIrt rest API, I use the version* 4.2*
and completed my script.
But when I downgrade my oVirt to lower version *4.1*, I get the following
error.
detail: 'For correct usage, see:
https://X.X.99.84/ovirt-engine/api/v4/model#services/disk-attachments/met...',\n
reason: 'Request syntactically incorrect.',\n error: 'For correct usage,
see:
https://X.X.99.84/ovirt-engine/api/v4/model#services/disk-attachments/met...
',\n
*Reason*: I added an extra parameter called 'isSharable' which is not
expected in this API.
*So Is there a way to Ignore the extra parameters sent for oVirt API?*
*Example :*
*Expected :*
*{*
* "a" : "1"*
*}*
*I sent :*
*{*
* "a" : "1",*
* "b" : "2"*
*}*
*My expectation is, Ignore the "b" and the API should work, Is there a flag
in oVirt API which ignores the extra parameters? *
Thanks,
Hari
6 years, 5 months
Failed to execute stage 'Misc configuration': Command '/sbin/service' failed to execute
by patrickstar12358@hotmail.com
Hi man,I wanna to install ovirt 3.5 via allinone deploy,but I encoutered some issues.
stage:
#yum update
\\ I set some enviroment parameters like as: hostname,static ip ...etc
#yum install http://resources.ovirt.org/pub/yum-repo/ovirt-release35.rpm
\\Skipped "yum install -y ovirt-engine"
#yum install -y ovirt-engine-setup-plugin-allinone
#engine-setup
[ INFO ] Stage: Setup validation
[WARNING] Less than 16384MB of memory is available
--== CONFIGURATION PREVIEW ==--
Application mode : both
Update Firewall : False
Host FQDN : node.luccitech.com
Engine database name : engine
Engine database secured connection : False
Engine database host : localhost
Engine database user name : engine
Engine database host name validation : False
Engine database port : 5432
Engine installation : True
PKI organization : luccitech.com
Configure VDSM on this host : True
Local storage domain directory : /var/lib/images
Configure local Engine database : True
Set application as default page : True
Configure Apache SSL : True
Reports installation : False
Configure local Reports database : False
Engine Host FQDN : node.luccitech.com
Configure WebSocket Proxy : True
Please confirm installation settings (OK, Cancel) [OK]:
[ INFO ] Stage: Transaction setup
[ INFO ] Stopping reports service
[ INFO ] Stopping engine service
[ INFO ] Stopping ovirt-fence-kdump-listener service
[ INFO ] Stopping websocket-proxy service
[ INFO ] Stage: Misc configuration
[ INFO ] Stage: Package installation
[ INFO ] Stage: Misc configuration
[ INFO ] Creating PostgreSQL 'engine' database
[ ERROR ] Failed to execute stage 'Misc configuration': Command '/sbin/service' failed to execute
[ INFO ] Yum Performing yum transaction rollback
[ INFO ] Stage: Clean up
Log file is located at /var/log/ovirt-engine/setup/ovirt-engine-setup-20180621172148-ojd6fg.log
[ INFO ] Generating answer file '/var/lib/ovirt-engine/setup/answers/20180621172411-setup.conf'
[ INFO ] Stage: Pre-termination
[ INFO ] Stage: Termination
[ ERROR ] Execution of setup failed
I have no idea to deal it...
Best regard.
6 years, 5 months