[MPICH] collective abort of all ranks
Rajeev Thakur
thakur at mcs.anl.gov
Sun Jun 10 11:08:26 CDT 2007
I am able to run your program with pgf90 and gcc 3.3.5 without any problem.
Rajeev
> -----Original Message-----
> From: owner-mpich-discuss at mcs.anl.gov
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of
> Kamaraju Kusumanchi
> Sent: Saturday, June 09, 2007 10:12 PM
> To: mpich-discuss at mcs.anl.gov
> Subject: [MPICH] collective abort of all ranks
>
> Hi all,
>
> I would appreciate if someone can help me with the following code. I
> am having trouble in understanding why the parallel version is giving
> errors while the serial version of it is working fine. Also, what does
> "collective abort of all ranks" mean?
>
> Consider the test.f90 attached in this email.
>
> $mpif90 test.f90
>
> compiles fine. However, when I run it gives the following error.
>
> $mpiexec -l -n 4 ./a.out
> rank 3 in job 282 node1.jit.mae.cornell.edu_33436 caused collective
> abort ofall ranks
> exit status of rank 3: killed by signal 11
> rank 1 in job 282 node1.jit.mae.cornell.edu_33436 caused collective
> abort ofall ranks
> exit status of rank 1: killed by signal 11
>
> Do you think this is a memory issue? But the size of the arrays
> involved is pretty small 600x600 double precision real arrays.
> Moreover the code does not give any errors if I comment out all the
> MPI statements and just run it using f90.
>
> $mpif90 -show
> f90
> -I/home/raju/software/compiledLibs/mpich2_1.0.5p4_gcc_4.3.0_ab
soft_8.0/include
> -p/home/raju/software/compiledLibs/mpich2_1.0.5p4_gcc_4.3.0_ab
soft_8.0/include
> -L/home/raju/software/compiledLibs/mpich2_1.0.5p4_gcc_4.3.0_ab
soft_8.0/lib
> -lmpichf90 -lmpichf90 -lmpich -lpthread -lrt
>
> Here f90 is absoft 8.0 fortran 90 compiler. I am using mpich2 1.0.5p4,
> compiled with absoft fortran compiler 8.0, gcc 4.3.0.
>
> $mpdtrace -l
> node1.jit.mae.cornell.edu_33436 (192.168.1.1)
> node3.jit.mae.cornell.edu_55240 (192.168.1.3)
> node2.jit.mae.cornell.edu_54930 (192.168.1.2)
> node4.jit.mae.cornell.edu_46424 (192.168.1.4)
>
> $gcc -v
> Using built-in specs.
> Target: i386-pc-linux-gnu
> Configured with: /home/fxcoudert/gfortran_nightbuild/trunk/configure
> --prefix=/home/fxcoudert/gfortran_nightbuild/irun-20070123
> --enable-languages=c,fortran --host=i386-pc-linux-gnu
> --with-gmp=/home/fxcoudert/gfortran_nightbuild/software
> Thread model: posix
> gcc version 4.3.0 20070123 (experimental)
>
>
> Can someone, please clarify what I am doing wrong? Any
> suggestions/comments to overcome this problem are most welcome.
>
> thanks
> raju
>
More information about the mpich-discuss
mailing list