MPI not running with Scyld

David Leunen leunen.d at fsagx.ac.be
Mon Mar 25 08:29:44 PST 2002


Hello everyone,

I installed Scyld Beowulf 27z-8.
When I boot a slave node, here are the lines that seems errors. I also 
provide the complete log file.

[...]
eth0: 3Com 3C905C Tornado  *** INVALID CHECKSUM 002F ***
[...]
hda: no response (satus = 0xa1), resetting drive
hda: no response (satus = 0xa1)
hdb: no response (satus = 0xa1), resetting drive
hdb: no response (satus = 0xa1)
[...]

It obviously looks for hard drives though we've diskless nodes ...
Here is what the /etc/beowulf/fstab looks like:

/dev/ram3      /         ext2    fs_size=65536      0 0
none           /proc     proc    defaults           0 0
none           /dev/pts  devpts  gid=5,mode=620     0 0
$MASTER:/home  /home     nfs     defaults           0 0


Even displaying these errors, the nodes complete booting sequences and 
become UP. But if I try to run an MPI program (one of the examples)
It says, on master node:

p0_7699: p4_error: net_create_slave: bproc_rfork: -1
     p4_error: latest msg from perror: Operation not permitted
bm_list_7650: p4_error: interrupt SIGINT: 2

and on the slave nodes:

vmadump: mmap failed: /usr/lib/mlibbproc.so.1.0.0

Can anyone tell me what is wrong.

Thank you.


David




---------------------------------------------------------

node_up: Setting system clock.
mke2fs 1.18, 11-Nov-1999 for EXT2 FS 0.5b, 95/08/09
ext2fs_check_if_mountFilesystem label=
OS type: Linux
Block size=1024 (log=0)
Fragment size=1024 (log=0)
128 inodes, 1024 blocks
51 blocks (4.98%) reserved for the super user
First data block=1
1 block group
8192 blocks per group, 8192 fragments per group
128 inodes per group

Writing inode tables: 0/1done
Writing superblocks and filesystem accounting information: : No such 
file or directory while determining whether /dev/ram1 is mounted.
done
node_up: TODO set interface netmask.
node_up: Configuring loopback interface.
/dev/hda: No such device
beoboot: /lib/modules/2.2.16-21.beosmp/modules.dep missing
/usr/lib/beoboot/bin/node_modprobe: 
/lib/modules/2.2.16-21.beosmp/modules.dep: No such file or directory
setup_fs: Checking /dev/ram3 (type=ext2)...
e2fsck 1.18, 11-Nov-1999 for EXT2 FS 0.5b, 95/08/09
ext2fs_check_if_mount: No such file or directory while determining 
whether /dev/ram3 is mounted.
Couldn't find ext2 superblock, trying backup blocks...

The e2fsck: Bad magic number in super-block while trying to open /dev/ram3
setup_fs: FSCK failure.
setup_fs: Creating ext2 on /dev/ram3...
superblock could not be read or does not describe a correct ext2
filesystem.  If the device is valid and it really contains an ext2
filesystem (and not swap or ufs or something else), then the superblock
is corrupt, and you might try running e2fsck with an alternate superblock:
     e2fsck -b 8193 <device>

mke2fs 1.18, 11-Nov-1999 for EXT2 FS 0.5b, 95/08/09
ext2fs_check_if_mount: No such file or directory while determining 
whether /dev/ram3 is mounted.
setup_fs: Mounting /dev/ram3 on /rootfs//... (type=ext2; options=defaults)
setup_fs: Checking 10.0.0.1:/home (type=nfs)...
setup_fs: Mounting 10.0.0.1:/home on /rootfs//home... (type=nfs; 
options=defaults)
beoboot: /lib/modules/2.2.16-21.beosmp/modules.dep missing
/usr/lib/beoboot/bin/node_modprobe: 
/lib/modules/2.2.16-21.beosmp/modules.dep: No such file or directory
node_modprobe: installing kernel module: nfs
/tmp/nfs.o: unresolved symbol rpc_execute_Rsmpf4ada430
/tmp/nfs.o: unresolved symbol xprt_create_proto_Rsmp6093ad98
/tmp/nfs.o: unresolved symbol rpc_create_client_Rsmp7777f9b5
/tmp/nfs.o: unresolved symbol nlmclnt_proc_Rsmp1dcdbeda
/tmp/nfs.o: unresolved symbol rpc_wake_up_next_Rsmpf17f0ee2
/tmp/nfs.o: unresolved symbol rpc_shutdown_client_Rsmp1e425c72
/tmp/nfs.o: unresolved symbol rpc_proc_register_Rsmp969b20f2
/tmp/nfs.o: unresolved symbol xprt_destroy_Rsmpbb395036
/tmp/nfs.o: unresolved symbol rpc_proc_unregister_Rsmp5bd26000
/tmp/nfs.o: unresolved symbol rpcauth_lookupcred_Rsmp692156ca
/tmp/nfs.o: unresolved symbol rpc_proc_init_Rsmpf56e5632
/tmp/nfs.o: unresolved symbol rpc_release_task_Rsmp576c36de
/tmp/nfs.o: unresolved symbol rpc_sleep_on_Rsmp2829779f
/tmp/nfs.o: unresolved symbol rpc_clnt_sigunmask_Rsmpa353229e
/tmp/nfs.o: unresolved symbol rpc_register_sysctl_Rsmpbf9a77c0
/tmp/nfs.o: unresolved symbol rpc_call_setup_Rsmpbcf9dc01
/tmp/nfs.o: unresolved symbol rpc_wake_up_task_Rsmp46a9490b
/tmp/nfs.o: unresolved symbol rpc_allocate_Rsmp0cd1c989
/tmp/nfs.o: unresolved symbol rpc_init_task_Rsmp847aa7dd
/tmp/nfs.o: unresolved symbol rpciod_down_Rsmpbabf0f35
/tmp/nfs.o: unresolved symbol nfs_debug_Rsmpaf5bf6ef
/tmp/nfs.o: unresolved symbol rpciod_up_Rsmp375492a4
/tmp/nfs.o: unresolved symbol rpc_do_call_Rsmpd9e876b4
/tmp/nfs.o: unresolved symbol xdr_encode_string_Rsmpabc0fe0c
/tmp/nfs.o: unresolved symbol rpc_clnt_sigmask_Rsmpa52bba47
/tmp/nfs.o: unresolved symbol rpc_killall_tasks_Rsmpc1303e64
mount: fs type nfs not supported by kernel
Failed to mount 10.0.0.1:/home on /home.





More information about the Beowulf mailing list