[Beowulf] Timeout on eth0 when booting disk less
tegner at renget.se
Sun Mar 12 03:01:40 PDT 2017
I'm booting some nodes disk less, using root file system on NFS. The
first phase (PXE, tftp etc) works without problems. The second phase,
when the system is actually supposed to boot over NFS the machine hangs
about every second time - and it seems to be a result of the relevant
nic (eth0) not negotiating its properties before some kind of timeout
kicks in. Obviously resulting in a failure to boot the NFS file system.
If I reboot the node it eventually works.
This could possibly be a problem with the switch, but one remedy would
seem to be to extend the time before the timeout kicks in (it seems to
be a few seconds at the moment). Any hints on how to achieve this?
More information about the Beowulf