Rsync and off-lined data on tape

backuprsynctape

I have a backup system handling large amounts of data. I use rsync to backup the data to a remote directory. In the remote location they archive their data on tape for security reasons.

The data is static. Once created it does not change.

Right now I am facing the option of off-lining some of the data to tape, in order to gain disk storage space on the remote location. The off-lining is done by keeping the structure of the file system, so existing files can be browsed without calling them back to disk.

I have been looking at how to manage this through rsync. Since the tape storage is not made for fast data retrieval. Will the -W flag achieve this?

Best Answer

Based on your comments it sounds like you've got a hierarchical storage management (HSM) system that is automatically handling restoring files from tape when they're accessed. (You don't specifically state this in your question, but your comment "...I believe it will attempt to call files from tape back to disk.")

The -W argument disables delta copying. This would have no effect on the problem you're trying to avoid.

By default, rsync will use the modification timestamp and file size as a test to decide which files have changed. If your HSM maintains the file size and timestamp on the "stub" files (usually done via sparse files) then rsync shouldn't attempt to perform delta copies of the stubbed files. If, however, the sizes and timestamps don't match then you're going to assume the files have changed and attempt to do a copy.

I don't see any functionality in rsync that would allow for automatic exclusion of files that are stubbed out to your HSM. (I don't know what operating systems are at play, either. Windows, for example, has a file attribute that identifies files stubbed out to HSM.)

If your stub files don't have the proper timestamps and sizes then your best bet would probably be to generate an exclusion list of files that have been stubbed-out and use that to exclude files from rsync.