HP ProLiant ML350p Gen8 server keeps crashing (full speed fans)

fanhphp-proliantserver-crashesvmware-esxi

I have a problem with a HP ProLiant ML350p Gen8 server. Most of the time it runs fine, but after several weeks of uptime, the server crashes out of the blue. This happened about 5 times now. When it crashes, the OS (VMWare ESXi 5.5) stops working and the fans are running on full speed. Pressing the power button doesn't change anything then. I have to unplug and plug back in the power cable to get it to restart.
I've done a memtest without any errors. I also didn't find anything in the logs. Do you have any ideas how to solve this?

Best Answer

There are a couple of reasons this could be happening.

  • Firmware.
  • Updates.
  • possibly hardware.

Please see: http://meta.serverfault.com/q/6195/13325

If you're running Windows virtual machines configured with Intel e1000 virtual NICs, there is a chance that your VMware host is crashing. That's resolved with updates to ESXi and/or a change in your vNIC configuration.

If you're running old HP firmware, please update it.

Since you have HP hardware, please look in the ILO and the IML log to get a detailed reason for the crash. That will tell you if you're facing a hardware issue.

Memtest+ is useless on server equipment like this.