Moving datacenters & IP addresses with minimal downtime
by David White
So it looks like I'm going to move to a new datacenter. I went into somewhere cheap on a month-to-month contract earlier this year, and they've been a pain to deal with. At the same time, I've grown a lot faster than expected, so I've decided to move into a better, more reputable datacenter sooner rather than later.
I currently have HCI with 3 servers. Within the cluster, I can tolerate a host "failure", so I can run on 2 servers. I have a 4th server that I haven't really been doing anything yet, but I think will be a good asset to me as I move, to keep things to a minimum. One of my production nodes, as well as my 4th server don't have the right storage, so if/when I move things, I'll need to configure Gluster to run on a single node for a time.
I'd like to ask about best practices here.
Before I move any hardware, I'm thinking about removing 1 of my 3 production servers from the existing HCI cluster, and installing a NEW hyperconverged single node onto this production server with Gluster storage. Then, add my 4th server to that new oVirt environment. Once I did that, I'm wondering if I could clone 100% of the VMs from the 1 environment over to the other.
Once that was done, I could move one of my oVirt clusters to the new datacenter, update DNS for the virtual servers, run a final rsync, and I'm done -- then I would just move the other two servers, do a fresh install of oVirt, and add them to the new cluster.
Is this possible?
Is there a better way to do this move?
Thanks,David
Sent with ProtonMail Secure Email.
3 years, 6 months
Unable to change Mail Recipient for Event Notifications
by Rodrigo G. López
Hello all,
There seems to be a problem with oVirt 4.4.7.7 when modifying the Mail
Recipient field in the Add Event Notification modal with the `admin'
user. It defaults to `root@localhost' and I can't change it no matter
what I try. It happens whether I check some Events for Notification or
not, and the ovirt-engine-notifier service is also running healthily.
Is this known behavior? What am I missing? I'm following §21 of the
Administration Guide.
Best,
-rodri
3 years, 6 months
I can't download console.rdp on VM Portal with Firefox
by Dominique D
I am testing the lastest version of ovirt-engine 4.4.7.7-1 before updating.
when I log into the VM portal and want to download the Remote Destop file (console.rdp) from a Windows VM, the file I receive is console.vv instead of console.rdp. So I have to rename the file to console.rdp.
This problem is only with firefox. Edge and google chrome don't have this problem.
Do you know why ?
3 years, 6 months
Faster local disk for VM
by Shantur Rathore
Hi Users,
Does oVirt support node local disk cache or scratch / transient disk for
faster disk access?
The idea is similar to options we get on cloud where one can attach fast
node local scratch disk/ssd to the VM which gets deleted when VM dies.
Thanks in advance.
Kind Regards,
Shantur
3 years, 6 months
oVirt keeps crashing
by Staniforth, Paul
Hello,
today we have a server that is repeatedly crashing after a few minutes after startup.
the output of systemctl status ovirt-engine.service
● ovirt-engine.service - oVirt Engine
Loaded: loaded (/usr/lib/systemd/system/ovirt-engine.service; enabled; vendor preset: disabled)
Active: failed (Result: exit-code) since Fri 2021-08-20 11:52:47 BST; 30s ago
Process: 6606 ExecStart=/usr/share/ovirt-engine/services/ovirt-engine/ovirt-engine.py --redirect-output --systemd=notify $EXTRA_ARGS start (code=exited, status=1/FAILURE)
Main PID: 6606 (code=exited, status=1/FAILURE)
Aug 20 11:47:12 hostname systemd[1]: Starting oVirt Engine...
Aug 20 11:47:12 hostname systemd[1]: Started oVirt Engine.
Aug 20 11:47:13 hostname ovirt-engine.py[6606]: 2021-08-20 11:47:13,760+0100 ovirt-engine: INFO _detectJBossVersion:186 Detecting JBoss version. Running: /usr/lib/jvm/jre-11-openjdk-11.0.11.0.>
Aug 20 11:47:14 hostname ovirt-engine.py[6606]: 2021-08-20 11:47:14,676+0100 ovirt-engine: INFO _detectJBossVersion:206 Return code: 0, | stdout: '['WildFly Full 23.0.2.Final (WildFly Core 15>
Aug 20 11:52:47 histname ovirt-engine.py[6606]: 2021-08-20 11:52:47,434+0100 ovirt-engine: ERROR run:542 Error: process terminated with status code 1
Aug 20 11:52:47 hostname systemd[1]: ovirt-engine.service: Main process exited, code=exited, status=1/FAILURE
Aug 20 11:52:47 hostname systemd[1]: ovirt-engine.service: Failed with result 'exit-code'.
Thanks for any help,
Regards,
Paul S.
To view the terms under which this email is distributed, please go to:-
https://leedsbeckett.ac.uk/disclaimer/email
3 years, 6 months
Ovirt-4.4.7 Adding host to existing cluster will attempt to run ovirt-hosted-engine-setup
by Paul-Erik Törrönen
I have an existing oVirt-installation to which I want to add a new host
(from the UI).
However the adding fails because oVirt tries to then install the
ovirt-hosted-engine-setup despite the fact that in the New host-dialog I
left the Choose hosted engine deployment action as None.
As a result the installation then aborts:
2021-08-15 13:17:37 EEST - TASK [ovirt-host-deploy-vdsm : Install
ovirt-hosted-engine-setup package] ******
2021-08-15 13:17:37 EEST -
2021-08-15 13:17:37 EEST - fatal: [new.host]: FAILED! => {"changed":
false, "failures": ["No package ovirt-hosted-engine-setup available."],
"msg": "Failed to install some of the specified packages", "rc": 1,
"results": []}
2021-08-15 13:17:37 EEST - {
"status" : "OK",
"msg" : "",
"data" : {
"uuid" : "98a244fb-15da-4192-a8fc-b23a5e5b7594",
"counter" : 55,
"stdout" : "fatal: [new.host]: FAILED! => {\"changed\": false,
\"failures\": [\"No package ovirt-hosted-engine-setup available.\"],
\"msg\": \"Failed to install some of the specified packages\", \"rc\":
1, \"results\": []}",
"start_line" : 47,
"end_line" : 48,
"runner_ident" : "fe710372-fdb1-11eb-81f0-ecf4bb63099f",
"event" : "runner_on_failed",
"pid" : 24931,
"created" : "2021-08-15T10:17:35.963320",
"parent_uuid" : "ecf4bb63-099f-2448-1b83-0000000001a7",
"event_data" : {
"playbook" : "ovirt-host-deploy.yml",
"playbook_uuid" : "8f70f945-4008-4159-b469-7ff04c9c242b",
"play" : "all",
"play_uuid" : "ecf4bb63-099f-2448-1b83-000000000006",
"play_pattern" : "all",
"task" : "Install ovirt-hosted-engine-setup package",
"task_uuid" : "ecf4bb63-099f-2448-1b83-0000000001a7",
"task_action" : "yum",
"task_args" : "",
"task_path" :
"/usr/share/ovirt-engine/ansible-runner-service-project/project/roles/ovirt-host-deploy-vdsm/tasks/packages.yml:6",
"role" : "ovirt-host-deploy-vdsm",
"host" : "new.host",
"remote_addr" : "new.host",
"res" : {
"msg" : "Failed to install some of the specified packages",
"failures" : [ "No package ovirt-hosted-engine-setup available." ],
"results" : [ ],
"rc" : 1,
"invocation" : {
"module_args" : {
"name" : [ "ovirt-hosted-engine-setup" ],
"state" : "present",
"allow_downgrade" : false,
"autoremove" : false,
"bugfix" : false,
"disable_gpg_check" : false,
"disable_plugin" : [ ],
"disablerepo" : [ ],
"download_only" : false,
"enable_plugin" : [ ],
"enablerepo" : [ ],
...
Poltsi
3 years, 7 months
about the clonetype of the vm disk when vm is cloned from template
by Tommy Sway
There are some options for clone disk for other platform like oVirt.
--clonetype [thin|full]
A thin clone uses copy-on-write (COW) reflinked
clone meaning it is dependent on the base template, but enables rapid
cloning.
Full clone will make a full physical copy hence not
be dependent on the base template.
Thin clone uses less space and will not be as performant as full clone,
especially under heavy write workloads.
By default a thin clone is created, great for
testing; should be avoided in production environments.
I want to know which type is oVirt using when we clone vm from template ?
3 years, 7 months
Hosted engine on HCI cluster is not running
by David White
Hello,
It appears that my Manager / hosted-engine isn't working, and I'm unable to get it to start.
I have a 3-node HCI cluster, but right now, Gluster is only running on 1 host (so no replication).
I was hoping to upgrade / replace the storage on my 2nd host today, but aborted that maintenance when I found that I couldn't even get into the Manager.
The storage is mounted, but here's what I see:
> [root@cha2-storage dwhite]# hosted-engine --vm-statusThe hosted engine configuration has not been retrieved from shared storage. Please ensure that ovirt-ha-agent is running and the storage server is reachable.
>
> [root@cha2-storage dwhite]# systemctl status ovirt-ha-agent● ovirt-ha-agent.service - oVirt Hosted Engine High Availability Monitoring Agent
> Loaded: loaded (/usr/lib/systemd/system/ovirt-ha-agent.service; enabled; vendor preset: disabled)
> Active: active (running) since Fri 2021-08-13 11:10:51 EDT; 2h 44min ago
> Main PID: 3591872 (ovirt-ha-agent)
> Tasks: 1 (limit: 409676)
> Memory: 21.5M
> CGroup: /system.slice/ovirt-ha-agent.service
> └─3591872 /usr/libexec/platform-python /usr/share/ovirt-hosted-engine-ha/ovirt-ha-agent
>
> Aug 13 11:10:51 cha2-storage.mgt.barredowlweb.com systemd[1]: Started oVirt Hosted Engine High Availability Monitoring Agent.
Any time I try to do anything like connect the engine storage, disconnect the engine storage, or connect to the console, it just sits there, and doesn't do anything, and I eventually have to ctl-c out of it.
Maybe I have to be patient? When I ctl-c, I get a trackback error:
> [root@cha2-storage dwhite]# hosted-engine --console^CTraceback (most recent call last):
> File "/usr/lib64/python3.6/runpy.py", line 193, in _run_module_as_main
>
> "__main__", mod_spec)
> File "/usr/lib64/python3.6/runpy.py", line 85, in _run_code
> exec(code, run_globals)
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_setup/vdsm_helper.py", line 214, in <module>
> [root@cha2-storage dwhite]# args.command(args)
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_setup/vdsm_helper.py", line 42, in func
> f(*args, **kwargs)
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_setup/vdsm_helper.py", line 91, in checkVmStatus
> cli = ohautil.connect_vdsm_json_rpc()
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/util.py", line 472, in connect_vdsm_json_rpc
> __vdsm_json_rpc_connect(logger, timeout)
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/util.py", line 395, in __vdsm_json_rpc_connect
> timeout=timeout)
> File "/usr/lib/python3.6/site-packages/vdsm/client.py", line 154, in connect
> outgoing_heartbeat=outgoing_heartbeat, nr_retries=nr_retries)
> File "/usr/lib/python3.6/site-packages/yajsonrpc/stompclient.py", line 426, in SimpleClient
> nr_retries, reconnect_interval)
> File "/usr/lib/python3.6/site-packages/yajsonrpc/stompclient.py", line 448, in StandAloneRpcClient
> client = StompClient(utils.create_connected_socket(host, port, sslctx),
> File "/usr/lib/python3.6/site-packages/vdsm/utils.py", line 379, in create_connected_socket
> sock.connect((host, port))
> File "/usr/lib64/python3.6/ssl.py", line 1068, in connect
> self._real_connect(addr, False)
> File "/usr/lib64/python3.6/ssl.py", line 1059, in _real_connect
> self.do_handshake()
> File "/usr/lib64/python3.6/ssl.py", line 1036, in do_handshake
> self._sslobj.do_handshake()
> File "/usr/lib64/python3.6/ssl.py", line 648, in do_handshake
> self._sslobj.do_handshake()
This is what I see in /var/log/ovirt-hosted-engine-ha/broker.log:
> MainThread::WARNING::2021-08-11 10:24:41,596::storage_broker::100::ovirt_hosted_engine_ha.broker.storage_broker.StorageBroker::(__init__) Can't connect vdsm storage: Connection to storage server failed
> MainThread::ERROR::2021-08-11 10:24:41,596::broker::69::ovirt_hosted_engine_ha.broker.broker.Broker::(run) Failed initializing the broker: Connection to storage server failed
> MainThread::ERROR::2021-08-11 10:24:41,598::broker::71::ovirt_hosted_engine_ha.broker.broker.Broker::(run) Traceback (most recent call last):
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/broker/broker.py", line 64, in run
> self._storage_broker_instance = self._get_storage_broker()
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/broker/broker.py", line 143, in _get_storage_broker
> return storage_broker.StorageBroker()
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/broker/storage_broker.py", line 97, in __init__
> self._backend.connect()
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/storage_backends.py", line 375, in connect
> sserver.connect_storage_server()
> File "/usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/lib/storage_server.py", line 451, in connect_storage_server
> 'Connection to storage server failed'
> RuntimeError: Connection to storage server failed
>
> MainThread::ERROR::2021-08-11 10:24:41,599::broker::72::ovirt_hosted_engine_ha.broker.broker.Broker::(run) Trying to restart the broker
> MainThread::INFO::2021-08-11 10:24:42,439::broker::47::ovirt_hosted_engine_ha.broker.broker.Broker::(run) ovirt-hosted-engine-ha broker 2.4.7 started
> MainThread::INFO::2021-08-11 10:24:44,442::monitor::45::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Searching for submonitors in /usr/lib/python3.6/site-packages/ovirt_hosted_engine_ha/broker/submonitors
> MainThread::INFO::2021-08-11 10:24:44,443::monitor::62::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor cpu-load
> MainThread::INFO::2021-08-11 10:24:44,449::monitor::62::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor cpu-load-no-engine
> MainThread::INFO::2021-08-11 10:24:44,450::monitor::62::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor engine-health
> MainThread::INFO::2021-08-11 10:24:44,451::monitor::62::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor mem-free
> MainThread::INFO::2021-08-11 10:24:44,451::monitor::62::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor mgmt-bridge
> MainThread::INFO::2021-08-11 10:24:44,452::monitor::62::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor network
> MainThread::INFO::2021-08-11 10:24:44,452::monitor::62::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Loaded submonitor storage-domain
> MainThread::INFO::2021-08-11 10:24:44,452::monitor::63::ovirt_hosted_engine_ha.broker.monitor.Monitor::(_discover_submonitors) Finished loading submonitors
And I see this in /var/log/vdsm/vdsm.log:
> 2021-08-13 14:08:10,844-0400 ERROR (Reactor thread) [ProtocolDetector.AcceptorImpl] Unhandled exception in acceptor (protocoldetector:76)
> Traceback (most recent call last):
> File "/usr/lib64/python3.6/asyncore.py", line 108, in readwrite
> File "/usr/lib64/python3.6/asyncore.py", line 417, in handle_read_event
> File "/usr/lib/python3.6/site-packages/yajsonrpc/betterAsyncore.py", line 57, in handle_accept
> File "/usr/lib/python3.6/site-packages/yajsonrpc/betterAsyncore.py", line 173, in _delegate_call
> File "/usr/lib/python3.6/site-packages/vdsm/protocoldetector.py", line 53, in handle_accept
> File "/usr/lib64/python3.6/asyncore.py", line 348, in accept
> File "/usr/lib64/python3.6/socket.py", line 205, in accept
> OSError: [Errno 24] Too many open files
Can anyone help?
Sent with ProtonMail Secure Email.
3 years, 7 months
Unable to connect to the Graphic server
by Gangi Reddy
We are receiving below error while attempt to connect SPICE console. No block in network firewall.
Version: Software Version:4.4.6.7-1.el8
Error:
Unable to connect to the Graphic server .vv
3 years, 7 months