Amanda versus Bacula and (modern as of 2017) large backup jobs

amandabackupbacula

I am deploying my first tape backup solution (with Quantum Superloader 3 LTO-7, CentOS 6.9), and I am surprised and stymied by the what seems like daunting configuration requirements of both Bacula and Amanda.

I am attracted by the philosophical approach that Amanda takes regarding using existing common tools and formats. This is how I've been doing my disk-based backups for years. The problem is that when I look for hints online, all I can find are posts from the early 2000s (precious few around 2010) about backing up only a few GB to tapes of only a few dozen GB. Further, the parameters as they exist on a fresh yum install amanda-server of the configuration files in /etc/amanda/DailySet1/amanda.conf seem aligned with these types of numbers (e.g. bumpsize is 20MB!). This seems totally out of touch with modern reality. I am dealing with multiple highly compressible 40+ TB file systems and 6 TB tapes on a 16-tape autoloader.

Of course, the anachronism wouldn't matter by itself, but I am at a loss regarding how to configure parameters (e.g. dumpcycle, runspercycle, tapecycle, bumpsize, etimeout) in the absence of any collective wisdom on the web. The defaults seem abysmal. Also, as a matter of course, rather than rotating many small backups across many large tapes, I have a mix of many small backups that fit on a tape necessarily coexisting with large backups that will have to span several tapes, and I need to be sure tapes aren't being overwritten. The tapes in the autoloader definitely have sufficient capacity for multiple level 0s and several level 1s, but I am not sure how to configure it.

So my questions:

  1. Is Amanda used for jobs of this magnitude? Is it superseded at this point by Bacula? Is there a compelling reason to use one of the other?
  2. It almost seems simpler and less risky to just write my own tape-changing logic and tar/dump/dd/rsync invocation script than to rely on massive configuration files with dozens of parameters? Foolhardy or advisable long-term? Do people do this?
  3. I want to be able to rotate a subset of tapes out of the autoloader for offsite backup. Will this totally flummox Amanda or Bacula or cause administration headaches?
  4. Can these systems take advantage of the bar codes I have on the tapes given that the hardware reads the bar codes?

I apologize if these questions seem uninformed. I've been reading literature for days now, but this is opaque to me. Any help or guides getting started with dozens of TB backups on modern autoloaders with modern tapes on a modern OS would be very much appreciated!

Best Answer

I use Bacula since about 13 years, and Quantum Superloader LT6 Tape Library since about 3-4 years.

  1. Is Amanda used for jobs of this magnitude? Is it superseded at this point by Bacula? Is there a compelling reason to use one of the other?

Our backup volume is one 25TB storage, and a few TB other smaller folders. Changes are not big, so incrementals are fast. The biggest problem I have is that one full job with 25TB would take a few days to complete, blocking the other jobs. That means you need to split the job to more manageable junk sizes. LTO-7 is faster but your amount of data is also bigger.

We had Bacula working before and integration of the tape library was not that difficult so we stick with it. I am quite happy with the stability and features, restores are not difficult once you get it. No experience with Amanda, though.

  1. It almost seems simpler and less risky to just write my own tape-changing logic and tar/dump/dd/rsync invocation script than to rely on massive configuration files with dozens of parameters? Foolhardy or advisable long-term? Do people do this?

Sure you can do it, but Amanda and Bacula have the kinks worked out mostly. Some write a list of files and their checksums in the first blocks, the tar archive after that. Tar can do multi-volume archives out-of-the-box (checked with GNU tar 1.29).

  1. I want to be able to rotate a subset of tapes out of the autoloader for offsite backup. Will this totally flummox Amanda or Bacula or cause administration headaches?

Yes that certainly works, Bacula has an "update slots" command to update which tapes are in the library. It will tell you at backup time which tape you need to load in case it is missing. We do that for a small backup job and need to change a tape every month out of a pool of 7 tapes. The other 6 are always offsite (offsite and more space in the library). For a restore we bring what we need, but luckily we seldom need a restore.

  1. Can these systems take advantage of the bar codes I have on the tapes given that the hardware reads the bar codes?

Yes, using "label barcodes storage=LTO6" will use the barcodes to label the tapes. There are websites around to print your own labels.

What I never tried is LTFS, i.e. a file system on tape, maybe combined with rsnapshot could be a feasible alternative.

I strongly suggest to try out all the solutions, and do not be afraid to look at old documentation, a lot is still valid. And do restores!