[mpich-discuss] errors while running the test suite of mpich2

Rajeev Thakur thakur at mcs.anl.gov
Wed Oct 22 14:35:22 CDT 2008


If you configure with --disable-aio, many of the I/O ones will go away. Some
of the others may be timing out because you are writing to NFS, which is
slow.

Rajeev  

> -----Original Message-----
> From: owner-mpich-discuss at mcs.anl.gov 
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of 
> Kamaraju Kusumanchi
> Sent: Wednesday, October 22, 2008 11:39 AM
> To: mpich-discuss at mcs.anl.gov
> Subject: [mpich-discuss] errors while running the test suite of mpich2
> 
> The cluster configuration is something like
> 
> 
>                master
>                  |
>           ETHERNET SWITCH
>            /   /   \   \
>         /    /       \    \
>      /     /           \     \
>   /      /               \      \
> node1  node2 .......... node16 node17
>   \     \                /      /
>      \    \            /     /
>         \   \        /    /
>            \  \    /   /
>         INFINIBAND SWITCH
> 
> 
> All the compilations have to be performed on the master node. All the
> executions have to be performed on the slave nodes only.
> 
> I configured mpich2 1.0.7 (on the master node) using gfortran 4.3.2,
> gcc 4.3.2 with the following command
> 
> ~/software/unZipped/mpich2-1.0.7/configure --enable-f77 --enable-f90
> --prefix=/home6/raju/software/compiledLibs/mpich2_1.0.7_gcc_4.
3.2_gfortran_4.3.2
> 2>&1 | tee configure.log
> 
> then I compiled the source code (on the master node) using the command
> 
> nohup make > make.log 2>&1 &
> 
> Then I installed the software using
> 
> nohup make install > install.log 2>&1 &
> 
> Next I logged into the slave node and performed installcheck using
> 
> mpd&
> make installcheck 2>&1 | tee installcheck.log
> 
> After that, I ran the testsuite (on the slave node) using
> 
> make testing 2>&1 | tee make_testing.log
> 
> At this stage, I am getting errors such as
> 
> Looking in ./f77/pt2pt/testlist
> Processing directory info
> Looking in ./f77/info/testlist
> Processing directory spawn
> Looking in ./f77/spawn/testlist
> Processing directory io
> Looking in ./f77/io/testlist
> Unexpected output in iwriteatf: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program iwriteatf exited without No Errors
> Unexpected output in iwritef: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program iwritef exited without No Errors
> Unexpected output in iwriteshf: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program iwriteshf exited without No Errors
> Unexpected output in writef: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program writef exited without No Errors
> Unexpected output in writeatf: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program writeatf exited without No Errors
> Unexpected output in writeallf: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program writeallf exited without No Errors
> Unexpected output in writeshf: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program writeshf exited without No Errors
> Unexpected output in writeordf: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program writeordf exited without No Errors
> Unexpected output in writeatallf: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program writeatallf exited without No Errors
> Unexpected output in writeatallbef: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program writeatallbef exited without No Errors
> Unexpected output in writeallbef: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program writeallbef exited without No Errors
> Unexpected output in writeordbef: mpiexec_node1.ank.mae.cornell.edu
> (handle_sig_occurred 1122): job ending due to env var
> MPIEXEC_TIMEOUT=180
> Program writeordbef exited without No Errors
> Processing directory rma
> Looking in ./f77/rma/testlist
> Processing directory init
> Looking in ./f77/init/testlist
> 
> 
> .....
> 
> 
> Processing directory attr
> Looking in ./cxx/attr/testlist
> Processing directory pt2pt
> Looking in ./cxx/pt2pt/testlist
> Failed to build bsend1cxx; make[3]: Entering directory
> `/home6/raju/software/compileHere/mpich2-1.0.7/test/mpi/cxx/pt2pt'
> /home6/raju/software/compiledLibs/mpich2_1.0.7_gcc_4.3.2_gfort
ran_4.3.2/bin/mpicxx
> -DHAVE_CONFIG_H -I.
> -I/home6/raju/software/unZipped/mpich2-1.0.7/test/mpi/cxx/pt2pt
> -I../../include
> -I/home6/raju/software/unZipped/mpich2-1.0.7/test/mpi/cxx/pt2p
> t/../../include
>   -c 
> /home6/raju/software/unZipped/mpich2-1.0.7/test/mpi/cxx/pt2pt/
> bsend1cxx.cxx
> /home6/raju/software/unZipped/mpich2-1.0.7/test/mpi/cxx/pt2pt/
> bsend1cxx.cxx:
> In function 'int main(int, char**)':
> /home6/raju/software/unZipped/mpich2-1.0.7/test/mpi/cxx/pt2pt/
> bsend1cxx.cxx:81:
> error: 'strcmp' was not declared in this scope
> /home6/raju/software/unZipped/mpich2-1.0.7/test/mpi/cxx/pt2pt/
> bsend1cxx.cxx:91:
> error: 'strcmp' was not declared in this scope
> make[3]: *** [bsend1cxx.o] Error 1
> make[3]: Leaving directory
> `/home6/raju/software/compileHere/mpich2-1.0.7/test/mpi/cxx/pt2pt'
> 
> 
> 
> 
> I am attaching all the log files for your reference. I have tried
> mpich2 1.0.8rc1. I am getting similar errors when running the test
> suite. Could someone please tell me how to fix these problems?
> 
> thanks
> raju
> 




More information about the mpich-discuss mailing list