[Beowulf] debugging
Matt Funk
mafunk at nmsu.edu
Thu Apr 12 16:42:11 PDT 2007
thanks for all the replies first of all,
i don't know the exact scyld distribution. However, i am running mpich 1.2.5.
When i run my program (stripped down to a mere MPI_INIT(...) call) and test it
with valgrind i get something like :
==21799== Use of uninitialised value of size 8
==21799== at 0x56F0252: vfprintf (in /lib64/libc-2.3.2.so)
==21799== by 0x570C844: vsprintf (in /lib64/libc-2.3.2.so)
==21799== by 0x56F7B69: sprintf (in /lib64/libc-2.3.2.so)
==21799== by 0x4F81755: net_create_slave
(in /usr/lib64/MPICH/p4/gnu/libmpich-gnu.so.1.0)
==21799== by 0x4F810B8: create_remote_processes
(in /usr/lib64/MPICH/p4/gnu/libmpich-gnu.so.1.0)
==21799== by 0x4F7D37A: p4_startup
(in /usr/lib64/MPICH/p4/gnu/libmpich-gnu.so.1.0)
==21799== by 0x4F7D1CC: p4_create_procgroup
(in /usr/lib64/MPICH/p4/gnu/libmpich-gnu.so.1.0)
==21799== by 0x4F9383B: MPID_P4_Init
(in /usr/lib64/MPICH/p4/gnu/libmpich-gnu.so.1.0)
==21799== by 0x4F9271B: MPID_CH_InitMsgPass
(in /usr/lib64/MPICH/p4/gnu/libmpich-gnu.so.1.0)
==21799== by 0x4F87691: MPID_Init
(in /usr/lib64/MPICH/p4/gnu/libmpich-gnu.so.1.0)
==21799== by 0x4FA32B3: MPIR_Init
(in /usr/lib64/MPICH/p4/gnu/libmpich-gnu.so.1.0)
==21799== by 0x4FA2F68: PMPI_Init
(in /usr/lib64/MPICH/p4/gnu/libmpich-gnu.so.1.0)
which i think is a problem with the mpi distribution. Does anyone have any
experience with building a new mpi library on a scyld machine? Should it be
straightforward?
thanks
mat
More information about the Beowulf
mailing list