Centos and dell PowerEdge Rxxx Series

centosdell

we lately purchase few dell servers
all of them from Rxxx series
couple of R410
and R710

the OS we used on those servers is: CentOS 5.4

we're getting very weird error messages
and we lost network connectivity couple of times (restarting the network interface was needed to fix it)

the messages we're getting are:
Message from syslogd@ at Wed Nov 18 12:07:08 2009 …
servername kernel: Uhhuh. NMI received for unknown reason 20.
Message from syslogd@ at Wed Nov 18 12:07:08 2009 …
servername kernel: Do you have a strange power saving mode enabled?
Message from syslogd@ at Wed Nov 18 12:07:08 2009 …
servername kernel: Dazed and confused, but trying to continue

we have never seen those messages in the previous series of dell poweredge servers

do someone here using centOS 5.4 on Rxxx series?
did it happen to him too?

maybe you have a suggestion about how to prevent it from happening


Update:

thanks for the info

well, i've already contact dell ofcourse
they even change the motherboard in 2 of our servers

the fact i've seen those weird OS messages in more than 1 server (one R410 and the other R710) makes me think that maybe there is a conflict problem between the OS and the server

it is just doesn't make any sense that it will happen on more than 1 server, and even after motherboard replacement

dell do say they don't support centOS, i did thier DSET diagnostics and sent to them, they didnt see anything there.

all fimrwares are up to date.

Best Answer

Have a look at http://kbase.redhat.com/faq/docs/DOC-16294 for a possible solution.

The solution to hangs on RHEL5.3 running the Xen Kernel and the bnx2 driver is given as editing /etc/modprobe.conf by adding the line

options bnx2 "disable_msi=1"