RAID 10 Stripe Size for XenServer

optimizationraidraid10xenserver

Below is our current server configuration. In a few weeks I will be simulating a disaster recovery by installing 5 new disks (1 hot spare) and restoring all VMs from the backups.

Will I gain anything by changing the RAID stripe size to something other than 64KB? The RAID controller has options for 8KB, 16KB, 32KB, 64KB, 128KB, 256KB, 512KB, 1MB.

Any recommendations based on the specification below would be greatly appreciated – thanks.

Hardware:

Dell PowerEdge 2900 III
Dell PERC 6/i
Intel Xeon 2.5GHz (x2)
32GB RAM
Seagate ST32000645SS ES.2 2TB Near-Line SAS 7.2K (x4)

Software:

Citrix XenServer 6.2 SP1
VM - Windows SBS 2008 x64 - Exchange & multiple SQL express instances
VM - Windows Server 2003 R2 x86 - single SQL express instance
VM - CentOS 6.6 x64 (x2) - cPanel & video transcoding and streaming
VM - CentOS 6.3 x86 - Trixbox (VoIP)
VM - PHD Virtual Backup 6.5.3 (running Ubuntu 12.04.1 LTS)

Configuration:

RAID 10, 64k Stripe Size

Best Answer

I am going to try and sum up my comments into an answer. The basic line is:

You should not tinker with the strip size unless you have good evidence that it will benefit your workload.

Reasoning:

For striping, you have to choose some strip size and 64 KB is the default the manufacturer has chosen. As the manufacturer (LSI in this case, rebranded by Dell) does have a shitload of experience running a vast number of setups with different RAID levels and workloads, you might just trust them to have chosen wisely
64 KB is likely to roughly match the average size of your requests in a virtualized environment (at least much more so than 256KB or 1 MB) and thus be a good trade-off between latency and seek time optimizations¹.
accurate model-driven predictions about application performance with varying strip sizes are close to impossible due to the highly variant nature of workloads and the complexity of the models taking into account different read-ahead and caching algorithms at different layers

If you are the kind to get this evidence, you can do so by running your typical load and some of the atypical load scenarios with different strip size configurations, gather the data (I/O subsystem performance at the Xen Server layer, backend server performance and answer times at the application layer) and run it through a statistical evaluation. This however will be extremely time-consuming and is not likely to produce any groundbreaking results apart from "I might just have left it at default values in the end", so I would consider it a waste of resources.

¹ If you assume a transfer rate of 100MB/s for a single disk, it is rather easy to see that a Kilobyte takes around 0,01ms to read, thus 64 KB will have a reading latency of 0,64ms. Considering that the average "service time" of a random I/O request typically will be in the range of 5-10ms, the reading latency is only is a small fraction of the total wait time. On the other hand, reading 512 KB will take around 5ms - which will matter for the "random small read" type of workload, considerably reducing the number of IOPS your array will be able to deliver in this specific case by the factor of 1.5 - 2. A scenario with concurrent random large read operations is going to benefit as larger block reads will induce less time-consuming seeks, but you are very unlikely to see this scenario in a virtualized environment.

Related Solutions

How to select the best Stripe Size when configuring a RAID Array

Unfortunately you might find such a dissertation difficult to find, and even if you did stumble across one while browsing relevant academic paper collections, sod's law says that the one circumstance not covered by the paper is the one closest to your expected I/O patterns.

It is actually a complex area - not complicated, as each part of the problem isn't rocket science, but complex because the different factors can interact with each other in fairly subtle ways - this is why you will find significant inconsistencies in some recommendations (the result of the tests that the recommendations are based upon depend greatly on the exact nature of the tests and the I/O pattern they are trying to simulate). Therefore finding a paper that covers your exact needs would be quite a stroke of luck unless your needs are very basic (in which case more generic recommendations like "the defaults will probably be fine" will actually suffice). Any paper that tried to cover everything (or even just most things) would take so long to produce as to be irrelevant long before it were completed.

The only true way to be sure is to give a few combinations a try. Try replicate a typical I/O load for your application on a couple of configurations and see how it benchmarks. I hope this doesn't come across as unhelpful, but I think it really is the only way to be sure. Having said that: if you provide a few more specific details of the environment you are considering RAID configuration for, someone might be able to point you to a paper or other resource that is more geared toward that sort of environment+kit+application+load combination than the sources you have already found.

As a nearly-on-topic example I did a few RAID tests on my netbook this last week (after deciding the internal SSD was writing far far far too slowly) and found some results that were far from what I expected when I started (though part of this is due to me not realising exactly how bad the internal SSD was by some metrics!). I'll not bore you with the exact details here as the I/O patterns I cared about in this circumstance will be entire worlds away from what it sounds like you are considering, but I'll reiterate that I think there really is no substitute for a few benchmarking runs based upon your knowledge of the system you are intending to build if you are wanting specific indicators/recommendations.

Lvm – Optimizing ext2 filesystem for use on LVM + RAID device? Stride, stripe-width, LVM IO size considerations

I could suggest checking this answer for ideas. I provided some feedback there regarding aligning xfs and someone commented about ext3. It may give you some clues.

Also, before aligning file system make sure your volume is aligned to stripe size as well. It can be done by properly sizing metadata and optimizing an extent size. Here is an example:

pvcreate -M2 --metadatasize 2048K --metadatacopies 2 <raw_device>
vgcreate --physicalextentsize 256M --autobackup y vg10 <pv_device>

Best Answer

You should not tinker with the strip size unless you have good evidence that it will benefit your workload.

Related Solutions

How to select the best Stripe Size when configuring a RAID Array

Lvm – Optimizing ext2 filesystem for use on LVM + RAID device? Stride, stripe-width, LVM IO size considerations

Related Topic