Backup Solution

6 Feb 2020

      Hey folks,

Running a 3-way HCI (again (sigh)) on gluster. Now the _inside_ of the 
vms is backup'ed seperatly using bareos on an hourly basis, so files are 
present with worst case 59 minutes data loss.

Now, on the outside I thought of doing gluster snapshots and then 
syncing those .snap dirs away to a remote 10gig connected machine on a 
weekly-or-so basis. As those contents of the snaps are the oVirt images 
(entire DC) I could re-setup gluster and copy those files back into 
gluster and be done with it.

Now some questions, if I may:

  - If the hosts remain intact but gluster dies, I simply setup Gluster, 
stop the ovirt engine (seperate standalone hardware) copy everything 
back and start ovirt engine again. All disks are accessible again 
(tested). The bricks are marked as down (new bricks, same name). There 
is a "reset brick" button that made the bricks come back online again. 
What _exactly_ does it do? Does it reset the brick info in oVirt or copy 
all the data over from another node and really, really reset the brick?

- If the hosts remain intact, but the engine dies: Can I re-attach the 
engine the the running cluster?

- If hosts and engine dies and everything needs to be re-setup would it 
be possible to do the setup wizard(s) again up to a running point then 
copy the disk images to the new gluster-dc-data-dir? Would oVirt rescan 
the dir for newly found vms?

- If _one_ host dies, but 2 and the engine remain online: Whats the 
oVirt way of resetting up the failed one? Reinstalling the node and then 
what? From all the cases above this is the most likely one.

Having had to reinstall the entire Cluster three times already scares 
me. Always gluster related.

Again thank you community for your great efforts!

-- 
with kind regards,
mit freundlichen Gruessen,

Christian Reiss

Christian Reiss

Jayme

Christian Reiss

Jayme

Rob

Strahil Nikolov

tags

participants (4)