Backup: how to download only used extents from imageio backend
by Michael Ablassmeier
hi,
im currently looking at the new incremental backup api that has been
part of the 4.4 and RHV 4.4-beta release. So far i was able to create
full/incremental backups and restore without any problem.
Now, using the backup_vm.py example from the ovirt-engine-sdk i get
the following is happening during a full backup:
1) imageio client api requests transfer
2) starts qemu-img to create a local qemu image with same size
3) starts qemu-nbd to serve this image
4) reads used extents from provided imageio source, passes data to
qemu-nbd process
5) resulting file is a thin provisioned qcow image with the actual
data of the VM's used space.
while this works great, it has one downside: if i backup a virtual
machine with lots of used extents, or multiple virtual machines at the
same time, i may run out of space, if my primary backup target is
not a regular disk.
Imagine i want to stream the FULL backup to tape directly like
backup_vm.py full [..] <vm_uuid> /dev/nst0
thats currently not possible, because qemu-img is not able to open
a tape device directly, given its nature of the qcow2 format.
So what iam basically looking for, is a way to download only the extents
from the imageio server that are really in use, not depending on qemu-*
tools, to be able to pipe the data somehwere else.
Standard tools, like for example curl, will allways download the full
provisioned image from the imageio backend (of course).
I noticed is that it is possible to query the extents via:
https://tranfer_node:54322/images/d471c659-889f-4e7f-b55a-a475649c48a6/ex...
As i failed to find them, are there any existing functions/api calls
that could be used to download only the used extents to a file/fifo
pipe?
So far, i played around with the _internal.io.copy function, beeing able
to at least read the data into a in memory BytesIO stream, but thats not
the solution to my "problem" :)
bye,
- michael
4 years, 4 months
Re: [ovirt-users] Re: ETL service aggregation error
by Shirly Radco
Hi,
I was unable to view your log file. But if its indeed an the issue please
run:
systemctl stop ovirt-engine-dwhd
ovirt_engine_history=# UPDATE history_configuration set var_datetime =
date_trunc('hour', now())- interval '2 hour' WHERE var_name =
'lastHourAggr';
ovirt_engine_history=# UPDATE history_configuration set var_datetime =
date_trunc('hour', now())- interval '1 day' WHERE var_name = 'lastDayAggr';
systemctl restart ovirt-engine-dwhd
If you would like me to review the errors please attach the dwh log file.
Best regards,
--
Shirly Radco
BI Principal Software Engineer
Red Hat <https://www.redhat.com/>
<https://www.redhat.com/>
On Fri, Jun 26, 2020 at 10:28 PM Staniforth, Paul <
P.Staniforth(a)leedsbeckett.ac.uk> wrote:
> Hello Ayansh,
> It looks like the lastHourAgg is wrong is should
> be 2 hours before the runTime but is 1:30 hours before runtime.
>
> Try
>
> systemctl stop ovirt-engine-dwhd
>
> in ovirt_engine_history database
>
> UPDATE history_configuration set var_datetime = var_datetime - interval '24 hour' WHERE var_name = 'lastHourAggr';
>
> UPDATE history_configuration set var_datetime = var_datetime - interval '1 day' WHERE var_name = 'lastDayAggr';
>
> systemctl stop ovirt-engine-dwhd
>
>
> see https://www.mail-archive.com/users@ovirt.org/msg53055.html
>
>
> Regards,
>
> Paul Staniforth
>
> School of Built Environment Engineering and Computing.
>
> Leeds Beckett University
>
> Networked Systems Analyst, Research and Engineering Support.
> Technical Lead Architect IMS and oVirt Virtualization
>
> tel: +44 (0)113 28123754
> email: p.staniforth(a)leedsbeckett.ac.uk
>
> <https://www.gofundme.com/f/bytes-for-heroes>
>
> ------------------------------
> *From:* Ayansh Rocks <shashank123rastogi(a)gmail.com>
> *Sent:* 26 June 2020 17:10
> *To:* Staniforth, Paul <P.Staniforth(a)leedsbeckett.ac.uk>; users <
> users(a)ovirt.org>; devel(a)ovirt.org <devel(a)ovirt.org>
> *Subject:* Re: [ovirt-users] Re: ETL service aggregation error
>
>
> *Caution External Mail:* Do not click any links or open any attachments
> unless you trust the sender and know that the content is safe.
> Can we fix this any how guys.
>
> On Thu, Jun 25, 2020 at 5:53 PM Ayansh Rocks <shashank123rastogi(a)gmail.com>
> wrote:
>
> Is it a bug ?
>
>
> On Thu, Jun 25, 2020 at 4:11 PM Ayansh Rocks <shashank123rastogi(a)gmail.com>
> wrote:
>
> See below as well if it helps -
>
> [root@iondelvm149 ~]# date
> Thu Jun 25 16:09:45 IST 2020
> [root@iondelvm149 ~]# psql -h localhost -U ovirt_engine_history -d
> ovirt_engine_history
> Password for user ovirt_engine_history:
> psql (9.2.24, server 10.6)
> WARNING: psql version 9.2, server version 10.0.
> Some psql features might not work.
> Type "help" for help.
>
> ovirt_engine_history=> select * from history_configuration;
> var_name | var_value | var_datetime
> -------------------+-----------+------------------------
> default_language | en_US |
> firstSync | false | 2018-03-23 12:27:00-04
> lastHourAggr | | 2020-06-25 05:00:00-04
> HourlyAggFailed | false |
> lastDayAggr | | 2020-06-24 01:00:00-04
> MinimalETLVersion | 4.3.0 |
> (6 rows)
>
> ovirt_engine_history=>
>
> On Thu, Jun 25, 2020 at 4:06 PM Staniforth, Paul <
> P.Staniforth(a)leedsbeckett.ac.uk> wrote:
>
> Hello Shashank,
> is it on the hour? it sounds like it got a
> problem aggregating the samples to create the hourly samples.
>
> I had to update the java memory usage as it was running out of heap memory.
>
> The defaults are
>
> DWH_HEAP_MIN=1g
> DWH_HEAP_MAX=1g
>
> see
>
> /usr/share/ovirt-engine-dwh/services/ovirt-engine-dwhd/ovirt-engine-dwhd.conf
>
> for settings.
>
> Regards,
> Paul S.
>
>
> ------------------------------
> *From:* Ayansh Rocks <shashank123rastogi(a)gmail.com>
> *Sent:* 25 June 2020 10:40
> *To:* Staniforth, Paul <P.Staniforth(a)leedsbeckett.ac.uk>; users <
> users(a)ovirt.org>
> *Subject:* Re: [ovirt-users] Re: ETL service aggregation error
>
>
> *Caution External Mail:* Do not click any links or open any attachments
> unless you trust the sender and know that the content is safe.
> Hi Paul,
>
> Yes i am able to connect to the database from ovirt engine machine.
>
> port is already open. what could be issue, i am getting this error in
> every hour.
>
> Thanks
>
> On Wed, Jun 24, 2020 at 6:30 PM Staniforth, Paul <
> P.Staniforth(a)leedsbeckett.ac.uk> wrote:
>
> Hello Shashank,
> it looks like it's had a problem for over 2
> years, is the Data Warehouse database local or remote? is there a firewall
> port open?
>
> Can you connect to the database from the engine machine?
>
> The credentials should be in
> /etc/ovirt-engine-dwh/ovirt-engine-dwhd.conf.d/10-setup-database.conf
>
> as your using 4.3 the version of postgresql is 10 and using scl
>
> scl enable rh-postgresql10 "psql -h localhost -U ovirt_engine_history
> -d ovirt_engine_history"
>
> if it's using the standard user, database and is local.
>
>
> Regards,
> Paul S.
> .
>
> ------------------------------
> *From:* Ayansh Rocks <shashank123rastogi(a)gmail.com>
> *Sent:* 24 June 2020 12:47
> *To:* Staniforth, Paul <P.Staniforth(a)leedsbeckett.ac.uk>; users <
> users(a)ovirt.org>
> *Subject:* Re: [ovirt-users] Re: ETL service aggregation error
>
>
> *Caution External Mail:* Do not click any links or open any attachments
> unless you trust the sender and know that the content is safe.
> Please find the attached error logs.
>
> On Thu, Jun 4, 2020 at 8:17 PM Staniforth, Paul <
> P.Staniforth(a)leedsbeckett.ac.uk> wrote:
>
> Hello Shashank,
> I can't see any of your images and also it
> would be better to have the log file as text.
>
> Regards,
> Paul S.
>
>
>
> ------------------------------
> *From:* Ayansh Rocks <shashank123rastogi(a)gmail.com>
> *Sent:* 04 June 2020 15:16
> *To:* users <users(a)ovirt.org>
> *Subject:* [ovirt-users] Re: ETL service aggregation error
>
>
> *Caution External Mail:* Do not click any links or open any attachments
> unless you trust the sender and know that the content is safe.
> Any update on this ?
>
> On Tue, May 26, 2020 at 1:41 PM Ayansh Rocks <shashank123rastogi(a)gmail.com>
> wrote:
>
> Hi,
>
> I am using 4.3.7 self hosted engine. From Few days i am getting regular
> below error messages :-
> [image: image.png]
>
> Logs in /var/log/ovirt-engine-dwh/ovirt-engine-dwhd.log
> [image: image.png]
>
> What could be the reason for this?
>
> Thanks
> Shashank
>
> To view the terms under which this email is distributed, please go to:-
> http://leedsbeckett.ac.uk/disclaimer/email/
> <https://eur02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fleedsbec...>
>
> To view the terms under which this email is distributed, please go to:-
> http://leedsbeckett.ac.uk/disclaimer/email/
> <https://eur02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fleedsbec...>
>
> To view the terms under which this email is distributed, please go to:-
> http://leedsbeckett.ac.uk/disclaimer/email/
> <https://eur02.safelinks.protection.outlook.com/?url=http%3A%2F%2Fleedsbec...>
>
> To view the terms under which this email is distributed, please go to:-
> http://leedsbeckett.ac.uk/disclaimer/email/
>
> _______________________________________________
> Users mailing list -- users(a)ovirt.org
> To unsubscribe send an email to users-leave(a)ovirt.org
> Privacy Statement: https://www.ovirt.org/privacy-policy.html
> oVirt Code of Conduct:
> https://www.ovirt.org/community/about/community-guidelines/
> List Archives:
> https://lists.ovirt.org/archives/list/users@ovirt.org/message/BCU7ZTSVHZR...
>
4 years, 4 months
VM rebooted during OST test_hotplug_cpu
by Yedidyah Bar David
Hi all,
I am trying to verify fixes for ovirt-engine-rename, specifically for
OVN. Engine top patch is [1], OST patch [2]. Ran the manual job on
these [3].
In previous patches, OST failed in earlier tests. Now, it passed these
tests, so I hope that my patches are enough for what I am trying to
do. However, [3] did fail later, during test_hotplug_cpu - it set the
number of CPUs, then tried to connect to the VM, and timed out.
The logs imply that right after it changed the number of CPUs, the VM
was rebooted, apparently by libvirtd. Relevant log snippets:
vdsm [4]:
2020-06-29 10:21:10,889-0400 DEBUG (jsonrpc/1) [virt.vm]
(vmId='7474280d-4501-4355-9425-63898757682b') Setting number of cpus
to : 2 (vm:3089)
2020-06-29 10:21:10,952-0400 INFO (jsonrpc/1) [api.virt] FINISH
setNumberOfCpus return={'status': {'code': 0, 'message': 'Done'},
'vmList': {}} from=::ffff:192.168.201.4,54576, flow_id=7f9503ed,
vmId=7474280d-4501-4355-9425-63898757682b (api:54)
2020-06-29 10:21:11,111-0400 DEBUG (periodic/0)
[virt.sampling.VMBulkstatsMonitor] sampled timestamp 2925.602824355
elapsed 0.160 acquired True domains all (sampling:451)
2020-06-29 10:21:11,430-0400 DEBUG (jsonrpc/1) [jsonrpc.JsonRpcServer]
Return 'VM.setNumberOfCpus' in bridge with {} (__init__:356)
2020-06-29 10:21:11,432-0400 INFO (jsonrpc/1) [jsonrpc.JsonRpcServer]
RPC call VM.setNumberOfCpus succeeded in 0.56 seconds (__init__:312)
2020-06-29 10:21:12,228-0400 INFO (libvirt/events) [virt.vm]
(vmId='7474280d-4501-4355-9425-63898757682b') reboot event (vm:1033)
qemu [5]:
2020-06-29T14:21:12.260303Z qemu-kvm: terminating on signal 15 from
pid 42224 (<unknown process>)
2020-06-29 14:21:12.462+0000: shutting down, reason=destroyed
libvirtd [6] itself does not log anything relevant AFAICT, but at
least it shows that the above unknown process is itself:
2020-06-29 14:18:16.212+0000: 42224: error : qemuMonitorIO:620 :
internal error: End of file from qemu monitor
(Note that above line is from 3 minutes before the reboot, and the
only place in the log with '42224'. No other log there has 42224,
other than these and audit.log).
Any idea? Is this a bug in libvirt? vdsm? I'd at least expect
something in the log for such a severe step.
[1] https://gerrit.ovirt.org/109961
[2] https://gerrit.ovirt.org/109734
[3] https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-te...
[4] https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-te...
[5] https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-te...
[6] https://jenkins.ovirt.org/view/oVirt%20system%20tests/job/ovirt-system-te...
--
Didi
4 years, 4 months