Capacity management – what metrics/KPI for CPU and RAM

capacity-planningcentral-processing-unitmemory

Context: business applications servers

What are good metrics/KPI for measuring CPU usage and performance? I think that things like %usage are not really relevant (it's not because my PC uses 100% CPU at one point that i need to buy a new one). How can i anticipate the point where i need to add more computing power?

What are good metrics/KPI for measuring RAM usage and performance? How can i anticipate the point where i need to add more RAM?

Are there industry-recognized general ratios for the above metrics? (e.g. should be < xyz unit)

Best Answer

Those metrics are mostly for OS developers: if you want specu=ifically to know about cpu and memory, then you want cpu-seconds actually used vs load, memory vs load, and io to the paging device vs load

The first will rise to your capacity, then plateau The second will rise, start to plateau, and then the third will show a step upwards at that same load.

If you want a good general indicator, look at cpu run-queue length. Think of 0-1 as 0-100% cpu, and numbers above 1 as overload. By about 4 per cpu core, the system never catches up in quiet milliseconds, and you're in trouble (;-))

--dave