Mysterious kernel hangs
hahn at coffee.psychology.mcmaster.ca
Thu Mar 15 13:35:25 PST 2001
> If this is happening on all 16 nodes, it sounds very, very much like a
> kernel deadlock, although problems with the specific motherboard/chipset
> cannot be ruled out. Can't help you with the motherboard if that turns
I see essentially zero reason to blame the kernel, especially since:
1. it's new hardware, of unknown quality and config.
2. he's tried a HUGE variety of kernels (2.2 shares very
little with 2.4.2-ac20!)
3. he's demonstrated it under purely compute loads.
I'd be looking at whether the power is clean, bios/jumper settings,
reducing the number of cards, lm-sensors, checking s-specs, etc.
More information about the Beowulf