[PVE-User] Serious error with e1000e driver with Intel Corporation 82574L NIC

Pongrácz István pongracz.istvan at gmail.com
Thu Oct 3 11:10:46 CEST 2013


Hi,

So, the NIC died now again. I logged the NIC with findep, so, I have a log, later I check it.

Here is the report:
Uptime with the new kernel and e1000e driver: 7 days, 20:59

Dead NIC: eth1
eth1 Link encap:Ethernet HWaddr 00:03:1d:0b:8a:e3 
 inet6 addr: fe80::203:1dff:fe0b:8ae3/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:342904 errors:10075993274070 dropped:1679332212345 overruns:0 frame:6717328849380
 TX packets:91491 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:1000 
 RX bytes:175495398 (167.3 MiB) TX bytes:12947825 (12.3 MiB)
 Interrupt:17 Memory:e8900000-e8920000


Here is the working one:
eth0 Link encap:Ethernet HWaddr 00:03:1d:0b:8a:e2 
 inet6 addr: fe80::203:1dff:fe0b:8ae2/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:925760 errors:0 dropped:0 overruns:0 frame:0
 TX packets:396204 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:1000 
 RX bytes:455640232 (434.5 MiB) TX bytes:180212843 (171.8 MiB)
 Interrupt:16 Memory:e8a00000-e8a20000 

Here is the network topology (ethX, vmbrX etc., only OPENVZ containers, no KVM, linked to vmbr1 -> eth1):

eth0 Link encap:Ethernet HWaddr 00:03:1d:0b:8a:e2 
 inet6 addr: fe80::203:1dff:fe0b:8ae2/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:927285 errors:0 dropped:0 overruns:0 frame:0
 TX packets:397248 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:1000 
 RX bytes:456225665 (435.0 MiB) TX bytes:181038520 (172.6 MiB)
 Interrupt:16 Memory:e8a00000-e8a20000 

eth1 Link encap:Ethernet HWaddr 00:03:1d:0b:8a:e3 
 inet6 addr: fe80::203:1dff:fe0b:8ae3/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:342904 errors:14070312858420 dropped:2345052143070 overruns:0 frame:9380208572280
 TX packets:91491 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:1000 
 RX bytes:175495398 (167.3 MiB) TX bytes:12947825 (12.3 MiB)
 Interrupt:17 Memory:e8900000-e8920000 

eth2 Link encap:Ethernet HWaddr 00:03:1d:0b:8a:e4 
 UP BROADCAST MULTICAST MTU:1500 Metric:1
 RX packets:0 errors:0 dropped:0 overruns:0 frame:0
 TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:1000 
 RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
 Interrupt:18 Memory:e8800000-e8820000 


eth5 Link encap:Ethernet HWaddr 00:03:1d:0b:8a:e7
 UP BROADCAST MULTICAST MTU:1500 Metric:1
 RX packets:0 errors:0 dropped:0 overruns:0 frame:0
 TX packets:0 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:1000
 RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)
 Interrupt:17 Memory:e8500000-e8520000

lo Link encap:Local Loopback
 inet addr:127.0.0.1 Mask:255.0.0.0
 inet6 addr: ::1/128 Scope:Host
 UP LOOPBACK RUNNING MTU:16436 Metric:1
 RX packets:114157 errors:0 dropped:0 overruns:0 frame:0
 TX packets:114157 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:0
 RX bytes:99029473 (94.4 MiB) TX bytes:99029473 (94.4 MiB)

venet0 Link encap:UNSPEC HWaddr 00-00-00-00-00-00-00-00-00-00-00-00-00-00-00-00
 inet6 addr: fe80::1/128 Scope:Link
 UP BROADCAST POINTOPOINT RUNNING NOARP MTU:1500 Metric:1
 RX packets:0 errors:0 dropped:0 overruns:0 frame:0
 TX packets:0 errors:0 dropped:3 overruns:0 carrier:0
 collisions:0 txqueuelen:0
 RX bytes:0 (0.0 B) TX bytes:0 (0.0 B)

veth2101.0 Link encap:Ethernet HWaddr ae:b6:53:9c:14:77
 inet6 addr: fe80::acb6:53ff:fe9c:1477/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:3714 errors:0 dropped:0 overruns:0 frame:0
 TX packets:199997 errors:0 dropped:673 overruns:0 carrier:0
 collisions:0 txqueuelen:0
 RX bytes:446113 (435.6 KiB) TX bytes:14918707 (14.2 MiB)

veth2102.0 Link encap:Ethernet HWaddr 06:57:76:97:ed:8e
 inet6 addr: fe80::457:76ff:fe97:ed8e/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:2440 errors:0 dropped:0 overruns:0 frame:0
 TX packets:209056 errors:0 dropped:722 overruns:0 carrier:0
 collisions:0 txqueuelen:0
 RX bytes:140470 (137.1 KiB) TX bytes:15556447 (14.8 MiB)

veth2103.0 Link encap:Ethernet HWaddr 9a:bb:6f:e6:20:8a
 inet6 addr: fe80::98bb:6fff:fee6:208a/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:25326 errors:0 dropped:0 overruns:0 frame:0
 TX packets:239265 errors:0 dropped:700 overruns:0 carrier:0
 collisions:0 txqueuelen:0
 RX bytes:3041917 (2.9 MiB) TX bytes:72043541 (68.7 MiB)

veth2104.0 Link encap:Ethernet HWaddr 52:a7:a5:05:be:5c
 inet6 addr: fe80::50a7:a5ff:fe05:be5c/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:33288 errors:0 dropped:0 overruns:0 frame:0
 TX packets:71063 errors:0 dropped:159 overruns:0 carrier:0
 collisions:0 txqueuelen:0
 RX bytes:2550475 (2.4 MiB) TX bytes:85037338 (81.0 MiB)

vmbr0 Link encap:Ethernet HWaddr 00:03:1d:0b:8a:e2
 inet addr: XXXXXXXXXXXXX Bcast: XXXXXXXXXXXXX Mask:255.255.255.224
 inet6 addr: fe80::203:1dff:fe0b:8ae2/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:693946 errors:0 dropped:0 overruns:0 frame:0
 TX packets:360419 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:0
 RX bytes:430967708 (411.0 MiB) TX bytes:178169459 (169.9 MiB)

vmbr1 Link encap:Ethernet HWaddr 00:03:1d:0b:8a:e3
 inet addr:10.0.2.2 Bcast:10.0.2.255 Mask:255.255.255.0
 inet6 addr: fe80::203:1dff:fe0b:8ae3/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:214893 errors:0 dropped:0 overruns:0 frame:0
 TX packets:14947 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:0
 RX bytes:10425639 (9.9 MiB) TX bytes:5083108 (4.8 MiB)

vmbr5 Link encap:Ethernet HWaddr 00:03:1d:0b:8a:e7
 inet addr:192.168.0.247 Bcast:192.168.0.255 Mask:255.255.255.0
 inet6 addr: fe80::203:1dff:fe0b:8ae7/64 Scope:Link
 UP BROADCAST RUNNING MULTICAST MTU:1500 Metric:1
 RX packets:0 errors:0 dropped:0 overruns:0 frame:0
 TX packets:6 errors:0 dropped:0 overruns:0 carrier:0
 collisions:0 txqueuelen:0
 RX bytes:0 (0.0 B) TX bytes:508 (508.0 B)

pveversion:
proxmox-ve-2.6.32: 3.1-111 (running kernel: 2.6.32-24-pve)
pve-manager: 3.1-14 (running version: 3.1-14/d914b943)
pve-kernel-2.6.32-24-pve: 2.6.32-111
pve-kernel-2.6.32-23-pve: 2.6.32-109
lvm2: 2.02.98-pve4
clvm: 2.02.98-pve4
corosync-pve: 1.4.5-1
openais-pve: 1.1.4-3
libqb0: 0.11.1-2
redhat-cluster-pve: 3.2.0-2
resource-agents-pve: 3.9.2-4
fence-agents-pve: 4.0.0-2
pve-cluster: 3.0-7
qemu-server: 3.1-4
pve-firmware: 1.0-23
libpve-common-perl: 3.0-6
libpve-access-control: 3.0-6
libpve-storage-perl: 3.0-13
pve-libspice-server1: 0.12.4-2
vncterm: 1.1-4
vzctl: 4.0-1pve3
vzprocps: 2.0.11-2
vzquota: 3.1-2
pve-qemu-kvm: 1.4-17
ksm-control-daemon: 1.1-1
glusterfs-client: 3.4.0-2



lspci for NICs:

02:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection
 Subsystem: Intel Corporation Device 0000
 Flags: bus master, fast devsel, latency 0, IRQ 33
 Memory at e8a00000 (32-bit, non-prefetchable) [size=128K]
 I/O ports at 8000 [size=32]
 Memory at e8a20000 (32-bit, non-prefetchable) [size=16K]
 Expansion ROM at dfb00000 [disabled] [size=2K]
 Capabilities: [c8] Power Management version 2
 Capabilities: [d0] MSI: Enable+ Count=1/1 Maskable- 64bit+
 Capabilities: [e0] Express Endpoint, MSI 00
 Capabilities: [a0] MSI-X: Enable- Count=1 Masked-
 Capabilities: [100] Advanced Error Reporting
 Capabilities: [140] Device Serial Number 00-03-1d-ff-ff-0b-8a-e2
 Kernel driver in use: e1000e

03:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection
 Subsystem: Intel Corporation Device 0000
 Flags: fast devsel, IRQ 36
 [virtual] Memory at e8900000 (32-bit, non-prefetchable) [size=128K]
 I/O ports at 7000 [size=32]
 [virtual] Memory at e8920000 (32-bit, non-prefetchable) [size=16K]
 [virtual] Expansion ROM at dfc00000 [disabled] [size=2K]
 Capabilities: [c8] Power Management version 2
 Capabilities: [d0] MSI: Enable- Count=1/1 Maskable- 64bit+
 Capabilities: [e0] Express Endpoint, MSI 00
 Capabilities: [a0] MSI-X: Enable- Count=1 Masked-
 Capabilities: [100] Advanced Error Reporting
 Capabilities: [140] Device Serial Number 00-03-1d-ff-ff-0b-8a-e3
 Kernel driver in use: e1000e

04:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection
 Subsystem: Intel Corporation Device 0000
 Flags: bus master, fast devsel, latency 0, IRQ 37
 Memory at e8800000 (32-bit, non-prefetchable) [size=128K]
 I/O ports at 6000 [size=32]
 Memory at e8820000 (32-bit, non-prefetchable) [size=16K]
 Expansion ROM at dfd00000 [disabled] [size=2K]
 Capabilities: [c8] Power Management version 2
 Capabilities: [d0] MSI: Enable+ Count=1/1 Maskable- 64bit+
 Capabilities: [e0] Express Endpoint, MSI 00
 Capabilities: [a0] MSI-X: Enable- Count=1 Masked-
 Capabilities: [100] Advanced Error Reporting
 Capabilities: [140] Device Serial Number 00-03-1d-ff-ff-0b-8a-e4
 Kernel driver in use: e1000e

05:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection
 Subsystem: Intel Corporation Device 0000
 Flags: bus master, fast devsel, latency 0, IRQ 38
 Memory at e8700000 (32-bit, non-prefetchable) [size=128K]
 I/O ports at 5000 [size=32]
 Memory at e8720000 (32-bit, non-prefetchable) [size=16K]
 Expansion ROM at dfe00000 [disabled] [size=2K]
 Capabilities: [c8] Power Management version 2
 Capabilities: [d0] MSI: Enable+ Count=1/1 Maskable- 64bit+
 Capabilities: [e0] Express Endpoint, MSI 00
 Capabilities: [a0] MSI-X: Enable- Count=1 Masked-
 Capabilities: [100] Advanced Error Reporting
 Capabilities: [140] Device Serial Number 00-03-1d-ff-ff-0b-8a-e5
 Kernel driver in use: e1000e

06:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection
 Subsystem: Intel Corporation Device 0000
 Flags: bus master, fast devsel, latency 0, IRQ 39
 Memory at e8600000 (32-bit, non-prefetchable) [size=128K]
 I/O ports at 4000 [size=32]
 Memory at e8620000 (32-bit, non-prefetchable) [size=16K]
 Expansion ROM at dff00000 [disabled] [size=2K]
 Capabilities: [c8] Power Management version 2
 Capabilities: [d0] MSI: Enable+ Count=1/1 Maskable- 64bit+
 Capabilities: [e0] Express Endpoint, MSI 00
 Capabilities: [a0] MSI-X: Enable- Count=1 Masked-
 Capabilities: [100] Advanced Error Reporting
 Capabilities: [140] Device Serial Number 00-03-1d-ff-ff-0b-8a-e6
 Kernel driver in use: e1000e

07:00.0 Ethernet controller: Intel Corporation 82574L Gigabit Network Connection
 Subsystem: Intel Corporation Device 0000
 Flags: bus master, fast devsel, latency 0, IRQ 40
 Memory at e8500000 (32-bit, non-prefetchable) [size=128K]
 I/O ports at 3000 [size=32]
 Memory at e8520000 (32-bit, non-prefetchable) [size=16K]
 Expansion ROM at e8d00000 [disabled] [size=2K]
 Capabilities: [c8] Power Management version 2
 Capabilities: [d0] MSI: Enable+ Count=1/1 Maskable- 64bit+
 Capabilities: [e0] Express Endpoint, MSI 00
 Capabilities: [a0] MSI-X: Enable- Count=1 Masked-
 Capabilities: [100] Advanced Error Reporting
 Capabilities: [140] Device Serial Number 00-03-1d-ff-ff-0b-8a-e7
 Kernel driver in use: e1000e

ETHTOOL
root at hn2 :~# ethtool -t eth1

The test result is FAIL
The test extra info:
Register test (offline) 40
Eeprom test (offline) 2
Interrupt test (offline) 4
Loopback test (offline) 0
Link test (on/offline) 0

ethtool -e eth1
Offset Values
------ ------
0x0000: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 
0x0010: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 
0x0020: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 
0x0030: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 
0x0040: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 
0x0050: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 
0x0060: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 
0x0070: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff 

Just to compare, here is eth0 and eth2:

ethtool -e eth0
Offset Values
------ ------
0x0000: 00 03 1d 0b 8a e2 30 0b 46 f7 11 30 ff ff ff ff
0x0010: ff ff ff ff 6b 02 00 00 86 80 d3 10 86 80 df 80
0x0020: 00 00 00 20 14 7e 00 00 00 00 d8 00 00 00 00 27
0x0030: c9 6c 50 31 2e 07 0b 04 84 09 00 00 00 c0 06 07
0x0040: 08 10 00 00 04 0f ff 7f 01 4d ff ff ff ff ff ff
0x0050: ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff ff
0x0060: 00 01 00 40 1c 12 07 40 ff ff ff ff ff ff ff ff
0x0070: ff ff ff ff ff ff ff ff ff ff ff ff ff ff 86 43

 ethtool -d eth1
MAC Registers
-------------
0x00000: CTRL (Device control register) 0xFFFFFFFF
 Endian mode (buffers): big
 Link reset: reset
 Set link up: 1
 Invert Loss-Of-Signal: yes
 Receive flow control: enabled
 Transmit flow control: enabled
 VLAN mode: enabled
 Auto speed detect: enabled
 Speed select: not used
 Force speed: yes
 Force duplex: yes
0x00008: STATUS (Device status register) 0xFFFFFFFF
 Duplex: full
 Link up: link config
 TBI mode: enabled
 Link speed: not used
 Bus type: PCI-X
 Bus speed: 133MHz
 Bus width: 64-bit
0x00100: RCTL (Receive control register) 0xFFFFFFFF
 Receiver: enabled
 Store bad packets: enabled
 Unicast promiscuous: enabled
 Multicast promiscuous: enabled
 Long packet: enabled
 Descriptor minimum threshold size: reserved
 Broadcast accept mode: accept
 VLAN filter: enabled
 Canonical form indicator: enabled
 Discard pause frames: ignored
 Pass MAC control frames: pass
 Receive buffer size: 4096
0x02808: RDLEN (Receive desc length) 0xFFFFFFFF
0x02810: RDH (Receive desc head) 0xFFFFFFFF
0x02818: RDT (Receive desc tail) 0xFFFFFFFF
0x02820: RDTR (Receive delay timer) 0xFFFFFFFF
0x00400: TCTL (Transmit ctrl register) 0xFFFFFFFF
 Transmitter: enabled
 Pad short packets: enabled
 Software XOFF Transmission: enabled
 Re-transmit on late collision: enabled
0x03808: TDLEN (Transmit desc length) 0xFFFFFFFF
0x03810: TDH (Transmit desc head) 0xFFFFFFFF
0x03818: TDT (Transmit desc tail) 0xFFFFFFFF
0x03820: TIDV (Transmit delay timer) 0xFFFFFFFF
PHY type: unknown


To compare, eth0:
ethtool -d eth0
MAC Registers
-------------
0x00000: CTRL (Device control register) 0x00100248
 Endian mode (buffers): little
 Link reset: reset
 Set link up: 1
 Invert Loss-Of-Signal: no
 Receive flow control: disabled
 Transmit flow control: disabled
 VLAN mode: disabled
 Auto speed detect: disabled
 Speed select: 1000Mb/s
 Force speed: no
 Force duplex: no
0x00008: STATUS (Device status register) 0x80080783
 Duplex: full
 Link up: link config
 TBI mode: disabled
 Link speed: 1000Mb/s
 Bus type: PCI
 Bus speed: 33MHz
 Bus width: 32-bit
0x00100: RCTL (Receive control register) 0x0400801A
 Receiver: enabled
 Store bad packets: disabled
 Unicast promiscuous: enabled
 Multicast promiscuous: enabled
 Long packet: disabled
 Descriptor minimum threshold size: 1/2
 Broadcast accept mode: accept
 VLAN filter: disabled
 Canonical form indicator: disabled
 Discard pause frames: filtered
 Pass MAC control frames: don't pass
 Receive buffer size: 2048
0x02808: RDLEN (Receive desc length) 0x00001000
0x02810: RDH (Receive desc head) 0x00000012
0x02818: RDT (Receive desc tail) 0x00000010
0x02820: RDTR (Receive delay timer) 0x00000020
0x00400: TCTL (Transmit ctrl register) 0x3103F0FA
 Transmitter: enabled
 Pad short packets: enabled
 Software XOFF Transmission: disabled
 Re-transmit on late collision: enabled
0x03808: TDLEN (Transmit desc length) 0x00001000
0x03810: TDH (Transmit desc head) 0x0000006D
0x03818: TDT (Transmit desc tail) 0x0000006D
0x03820: TIDV (Transmit delay timer) 0x00000008
PHY type: unknown




More information about the pve-user mailing list