ESXi :: vmxnet3 vNIC and Linux kernel errors

linux-kernelvmware-esxi

a long shot, but figured I'd give here a try (no solution on VMware community forum).

In a Linux guest (CentOS 5.7 64-bit) with vmxnet3 vNIC we are getting a few hundred kernel errors per day on primary eth0, DMZ NIC, which handles majority of network traffic (eth1 & eth2 perform backups and other non-frequent network activity).

All 3 NICs have vmxnet3 as their adapter type, but the kernel errors only occur on eth0, the only NIC with public exposure (via Cisco ASA NAT'd public IPs).

Sample log entry:

Nov  2 17:49:40 localhost kernel: eth0: tq error 0x80000000
Nov  2 17:49:40 localhost kernel: eth0: resetting
Nov  2 17:49:40 localhost kernel: eth0: intr type 2, mode 0, 1 vectors allocated
Nov  2 17:49:40 localhost kernel: eth0: NIC Link is Up 10000 Mbps

The entries are disconcerting given that eth0 went down yesterday and had to be ifup'd (although new server has been up for 2 weeks without issue otherwise).

Going to downgrade to vmxnet2 in the AM and see if that resolves the issue, but for sake of myself and future sufferers of this issue, I'll leave this out there — every problem at some point has a solution 😉

Best Answer

Just some guesses.

You also might try using the e1000 driver instead of the vmxnet3. It's limitation is 1G MBits but it might be a good backup test.

Just a thought are you on the current level of VMWare Tools on the host system? You might have to re-install the VMWare Tools after a kernel upgrade.

Is there the possibility of an actual ethernet h/w error in the ESX host itself?

Is the OS driver/kernel up-to-date?

 Linux hostname 2.6.18-274.7.1.el5 #1 SMP Thu Oct 20 16:21:01 EDT 2011 x86_64 x86_64 x86_64 GNU/Linux
Related Topic