Diagnosing Hardware Problems in Linux Server with Kernel Panicking

kernel-paniclinuxtroubleshooting

We have a server that has been occasionally kernel panicking for a while now that we believe has a hardware problem. How would you go about troubleshooting hardware that you don't have physical access to? Are there any tools that I can use within the OS itself to diagnose different pieces of the system to try to figure out what's causing all of this panicking?

Best Answer

Barring anything revealing in the system's logs or vendor-supplied test tools (front panel display, Dell Diagnostics, etc.), most diagnostic procedures will require physical access to the system.

My suggestion would be to have memtest86 or memtest86+ run on the system: Most panics/random crashes are caused by bad RAM and this will usually catch it.

Related Topic