[Beowulf] Problem while booting diskless node.

Maxime Kinet mkinet at ulb.ac.be
Wed May 2 08:11:20 PDT 2007


ok, I succeeded.
Thanks to all for helpfull comments.

------------------
Maxime Kinet
Université Libre de Bruxelles
Physique Statistique et Plasmas, CP 231
Campus Plaine - Boulevard du Triomphe,
1050 Bruxelles.

Tel.   : +32-2-650.59.08
e-mail : mkinet at ulb.ac.be


On 01 May 2007, at 12:34, Panagiotis Christopoulos wrote:

> On Monday 30 April 2007 13:04, Maxime Kinet wrote:
>> Hi,
>>
>> I'm trying to set up my first cluster with diskless nodes. To achieve
>> that, I'm using PXElinux on a server, running Fedora Core 6, and a
>> NFS-mounted root partition on the node.
>> Everything works perfectly
>> (getting the IP address, loading the kernel and mounting the
>> filesytem) until the node has to run some binaries located into /sbin
>> during the boot process. Apparently it's unable to execute them
>> because they have been compiled with dynamically linked libraries and
>> not statically. The /sbin directory of the node is a simple copy of
>> the one of the server.
>  I suppose, that you have done something like, creating a /diskless  
> directory
> inside your nfs,tftp,dhcp etc... server, copying files(eg. /usr/* )  
> from the
> server, inside that /diskless dir with the same hierarchy, and this  
> resulted
> in a structure which would home, your nfs exported, root fs of your  
> nodes.
> I'm not an expert, but because you said about dynamic libraries, I  
> cannot
> understand, why this is a problem, you copied /sbin inside / 
> diskless, but you
> didn't copy /lib or /usr/lib? The problem with dynamic libraries,  
> starts
> because your /diskless does not have these libraries.
>> I tried to avoid the problem using the busybox
>> tools, and it worked a bit better but then it couldn't execute bash
>> scripts such as rc.sysinit.
> Have you created an init in your busybox, to chroot(exec  
> switch_root) inside
> your nfs root fs after mounting it?
>> As anybody ever encountered such problems and what should I do to
>> solve it? recompile the kernel of the node or of the server? change
>> the distribution? Are there any other simpler method to proceed than
>> using PXE?
> There are two things you can do. As Douglas Eadline said, it starts by
> thinking if you want to reinvent the wheel or not. If you have time,
> machines, if you know that your teachers won't get annoyed  and you  
> can work
> in a university lab, so you will not pay for the power supply  
> yourself:p
> continue with fedora and all these brainstorming things. You will  
> learn linux
> administration and propably you will do amazing things. If you  
> don't have
> time etc. the guys in warewulf, are doing the same job for about 7  
> years, and
> they provide you all this knowledge they gained, in a simple  
> installation
> process.
>   Back in the technical stuff, from your sayings, I think that  
> something is
> wrong with your /diskless dir(if you have one, of course). I cannot
> understand why you want to use busybox. We use busybox when why  
> want an
> initramfs to do specific jobs(such as unlocking  and mounting  
> encrypted
> partitions,yes, I know this is not the best example I could give)  
> before
> chrooting inside our real root fs and exec init as Mark Hahn said  
> or if we
> are running embedded. Also, I don't think that you have to change your
> distribution, and if you don't like PXE, you can see how the guys  
> in LTSP
> boot( I think they use both PXE and "etherboot" and you can make a  
> choice),
> but for me, syslinux is fine!
>
> This was my point of view, I hope I helped and if you want to ask  
> something,
> feel free to send me a mail, or of course, ask again in the list,
>
> Panagiotis Christopoulos
> System Administrator
> Technological Institute of Athens
> Department of Informatics





More information about the Beowulf mailing list