Linux – HP ProLiant DL360 G7 watchdog

hardwarehphp-proliantlinuxwatchdog

I have dl360 G7 servers. some times kernel got panic. I understood, that server have hardware watchdog timer.

dmesg | grep wdt
[ 6.344992] hpwdt 0000:02:00.0: PCI INT A -> GSI 16 (level, low) -> IRQ 16
[ 6.345563] hpwdt: New timer passed in is 30 seconds.

Server already is in production. Question is: how i can enable it? Did i need some software to install on system too? On server is running debian.
I understood that i need on system some software that tells watchdog, that system is alive. Or i am wrong? Thanks!

Best Answer

A watchdog timer in this context resets the system or takes some action if it loses a "pulse". HP ProLiant servers have a hardware watchdog that ties into the Automatic Server Recovery (ASR) feature. The ASR reboots the server if it detects a hang lasting a certain period of time (default is 10 minutes).

On your Debian system, the hpwdt == HP Watchdog Timer. This is a kernel module. Instructions for its use are contained within the kernel documentation.