[vortex] 3C905C Hangs

ruben@nutz.nl ruben@nutz.nl
Wed, 10 Oct 2001 14:34:55 +0200


On Tue, Oct 09, 2001 at 11:07:52PM -0400, Karyl F. Stein wrote:

> > Try to update to a more recent kernel like 2.4.9.
> I'd rather save that one as a last resort.

As others allready said, older 2.4-kernels are rather flaky. I'm still using
2.2.19 whenever possible. :)

> Address			HWtype	HWaddress	    Flags Mask		  Iface
> 192.168.2.146          	        (incomplete) 		eth0
> 192.168.2.145          	ether   00:02:44:0C:61:84   C	eth
> 192.168.2.144          	ether   00:02:44:0C:61:85   C	eth0
> Entries: 3	Skipped: 0	Found: 3

You have not only lost the hw-address of your www-1 box, but also lost
connectivity to a working nameserver, hence the raw adresses.

> Kernel Interface table
> Iface   MTU Met    RX-OK RX-ERR RX-DRP RX-OVR    TX-OK TX-ERR TX-DRP TX-OVR
> eth0   1500   0   512915      0      0      0    69151      0      0      0
> lo    16436   0       21      0      0      0       21      0      0      0

On layer 3 everything still looks nice.

> Iface
> 192.168.2.0     0.0.0.0         255.255.255.0   U     0      0        0 eth0
> 127.0.0.0       0.0.0.0         255.0.0.0       U     0      0        0 lo
> 0.0.0.0         192.168.2.144   0.0.0.0         UG    0      0        0 eth0

and your default gateway is gone.

Are you using a routing protocol like RIP2?

I'm thinking along two lines. One is that for some reason you lose the
*route* to the other machine. The other is that somehow the other machine
becomes disconnected, which would point to cabling or damaged connectors.

Does this *only* happen during large transfers? 

Have you played with any kernel-parameters like /proc/sys/net/ipv4/*? 

Could you install arp-watch on 192.168.2.144 or 145 and tell us what if
anything is happening?

Probably unrelated: your nameserver seems to be failing certain queries:

$ host -l -a xenos.net ns-1.xenos.net
[snip]
mail.xenos.net	43200 IN	A	65.104.130.145
mail.xenos.net	43200 IN	HINFO	Intel Pentium 133	RedHat Linux 7.1
mymai\000	0 0	0	???
ListHosts: error receiving zone transfer:
  result: NOERROR, answers = 168, authority = 49152, additional = 705

-- 
Ruben

	Q: How many IBM 370's does it take to execute a job?

	A: Four. Three to hold it down, and one to rip its head off.