2^31 packet counter bug?

Jerry Glomph Black black@real.com
Sun Dec 12 05:13:38 1999


2 billion packets and you're off the air...?

I've seen several 2.2.12/2.2.13 machines lose
their network connections after functioning perfectly
a long period.   Tonight a number of our busy machines fell off
the net.  I visited one of the
boxes, it had not crashed at all.   However, it was
not communicating via its
(Intel eepro100) ethernet port.

The evil evidence:

eth0      Link encap:Ethernet  HWaddr 00:90:27:50:A8:DE
          inet addr:172.16.0.20  Bcast:172.16.255.255  Mask:255.255.0.0
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metric:1
          RX packets:15 errors:288850 dropped:0 overruns:0 frame:0
          TX packets:2147483647 errors:1 dropped:0 overruns:0 carrier:0
          collisions:0 txqueuelen:100
          Interrupt:10 Base address:0xd000



Check out the TX packets number!  That's 2^31-1.

This is a busy (router) machine.  It takes about a week to hit this many
packets.   And it took a week for this box to fall off the net.

ifconfigging eth0 down-and-up did not clear the
problem.  A reboot was necessary... What is this,
NT or something?  Bleah!

This is absolutely dreadful.  Is this 'feature' a
property of the eepro100
driver, or something in the kernel itself?
Either way, it's a killer.  Gotta get to the bottom of it.

Standard 2.2.12/2.2.13 kernels, with included eepro100 code.