Electronic – Does cache access time scale down with frequency in modern CPUs

I have an application where I suspect the main overhead is accessing L3 cache. It is run on a modern Intel server-grade microprocessor with a huge L3 cache. There are many microprocessors available with various core counts and frequencies, and selecting between these is hard.

So, that led me to wonder whether L3 cache access (hit) time scales down with frequency. So, if 2GHz CPU accesses L3 cache in e.g. 12 ns, does that mean that a 3GHz CPU accesses L3 cache in 8 ns? Of course, this may depend on internal implementation details of the CPU, so let's limit answers to modern i386/AMD64 microprocessors.

I know that at least in the case of DRAM the delays are practically constant, so if you do a random DRAM access, higher frequency means only better sequential data rate after the random access penalty has been paid. But is SRAM fundamentally different? It is based on transistors, so I guess it could be clocked at pretty good frequencies.

Sorry if this is the wrong place to ask, there is no better place to ask detailed computer architecture questions. I have considered CS StackExchange and StackOverflow, but I think Electrical Engineering StackExchange was the best place. There really should be a Computer Architecture StackExchange but there isn't!

Best Answer

It was in old days (mid-90') when every CPU block did scale with frequency. With modern processors it is not true anymore. The CPU core and caches have different design topologies, and memories usually do not scale much. The timing alignment between CPU core clock and caches is usually done by re-configuring the pipeline access to cache, so it is very likely that the cacheline access time doesn't scale with core frequency at all. I recall running into similar issues while working on K6 processors, when for every clock multiplier the cache access must be re-configured.

With modern CPUs and adaptive core frequency the things are much-much more complicated. CPUs are designed by thousands of design engineers and architected by hundreds of architects, so to get into details you need top clearance access to top secret documentation, and even then it will be difficult to find the exact answer.

Best Answer

Related Solutions

How to calculate access time for a cache simulation

Electronic – Why does small cache memory take less time to index

Related Topic