clustermatic installation/loading of boot kernel
lothar at triumf.ca
lothar at triumf.ca
Thu Dec 12 10:10:49 PST 2002
Erik A. Hendriks wrote:
>On Thu, Dec 12, 2002 at 09:22:43AM -0800, lothar at triumf.ca wrote:
>
>
>>Hi,
>>I am trying to set up a clustermatic/bproc based system. I installed
>>your latest version
>>of the CDrom. I made bootimages for a floppy and latter loading.
>>/etc/rc.d/init.d/beowulf
>>starts without problems. I have put one of the diskless-floppy only
>>slaves on a videocard
>>and monitor. ethernet card on slave and master or both 3com905. When I
>>boot up, the
>>floppy installation seems to run flawless till it makes contact to the
>>master. When loading
>>/var/beowulf/boot.img it starts to spill out messages of the the nature
>>missed block nnnn (eg.. 1340)
>>for a while, later it reverses into some
>>rcv /var/beowulf/boot.img
>>followed occassionally by missed block messagess.
>>
>>For being busy with other things I left it for two days.
>>Amazingly enough this morning it had successfully booted.
>>
>>What is going on?
>>
>>
>
>Most likely your network switch is dropping a lot of the multicast
>traffic. In my experience pretty much all managed switches can't
>handle even a few megabits per second of multicast traffic.
>
>The solution to this is to switch it to using broadcast instead of
>multicast. Put the following in /etc/beowulf/config:
>
>mcastbcast ethX # switch image service on ethX to broadcast
>
>You can also throttle the boot image transmits like this:
>
>mcastthrottle ethX NN # throttle multicast/broadcast on ethX to NN megabits/sec.
>
>- Erik
>
>
>
I put these commands into /etc/beowuld/config with throtteling to 1
megabit/s.
I restarted /etc/rc.d/init.d/beowulf.
Unfortunately the same messages appear.
Lothar
More information about the Beowulf
mailing list