NetBackup Multiplexing for Oracle RMAN Backups

backupmultiplexingnetbackuporacleperformance-tuning

My question is… what multiplexing factor in NetBackup is recommended / do you use for Oracle RMAN backups over 1Gb/s management network to LTO3?

JB

Background:

With enterprise backup tools like NetBackup, there is the concept of Multiplexing which is the combining of data from a number of backup clients simultaneously in order to feed modern high speed tape drives as quickly as possible.

The number of simultaneous client data streams interleaved is determined by the Multiplexing factor. The higher the multiplexing factor, the more data fed to the tape drive, but the slower any restores.

As overall restore speed is mostly determined by the messing about (log incident, determine if tapes are available, recall from offsite, load, inventory, etc. ) than by the actual tape restore speed, I feel confident using a high factor for filesystem backups.

Oracle backups with large datasets, which are more often restored all together, pose a different challenge to filesystem backups.

Best Answer

The first thing to check is how much network (TCP) throughput your server can handle. Use netcat, etc. If it is less than around 30 MB/s, multiplexing from network is of no use to you, and my further advice can be ignored. Work on tuning your network throughput instead. Now, to the point.

The LTO3 drive, just like any other linear tape drive, works well only when it gets a stream of data with a certain constant throughput.

The tape is passing under the head at a high speed, and you don't want to stop it. At each stop the drive has to perform lengthy procedure: decelerate to full stop, accelerate back, pass the end-of-data point, decelerate again, accelerate forth to reach the end-of-data point. When data is not feeded by NetBackup fast enough, the buffer underruns frequently and so the drive has to stop/rewind/start frequently. The performance is hurt dramatically. This is called "start-stop" operation or "shoe shining".

Drive adjusts the speed of the tape somewhat, but not very much, it can drop to about 50% of maximum speed.

The whole point of Netbackup multiplexing is to provide a better streaming throughput and avoid start-stop operation. Check the throughput of your RMAN backup, if it is 30 MB/s or less you have a classic start-stop operation.

Now, let me make one thing clear. If you do not have start-stop I would not recommend multiplexing RMAN backups at all. RMAN is complicated enough without multiplexing. I don't want to mess with RMAN, I want my restore to be as fast, easy and seamless as possible.

However, if you find your backup throughput unacceptably low, I would suggest implementing around three multiplexing streams for starters. Increase the number each night until you will not gain any more throughput. And make sure each stream is coming from the different disk spindle(s). Not from different partitions/tablespaces/filesystems/databases/servers/LUNs/other-virtualization-layers. These matter little, if any. Physical disk spindles. If you feed many streams from the same spindles you will just cause thrashing and overall performance will drop even more.

Note: NetBackup theoretically can also de-multiplex a restore. If I remember correctly, it pauses a little before a restore, to give a chance for more restore attempts to launch. In this case they will run jointly, just like multiplexed backups. But please verify this with a manual, I am only 90% sure on this one.

Related Topic