[mpich-discuss] Problems with mpiexec.hydra in mpich2-1.2

Bryan Putnam bfp at purdue.edu
Sun Oct 18 14:05:00 CDT 2009


I've recently installed mpich2-1.2 on three of our Linux clusters, and 
have found that mpiexec.hydra is no longer working on any of them. One is 
a Chelsio cluster, the other two are Infiniband, and all three are running 
RHEL5.

The first few lines of the errors I'm seeing are appended below.

Note that if I use the same executable built with mpich2-1.2, and then use 
the previous mpich2-1.1.1p1 version of mpiexec.hydra. it still works fine.

Thanks,
Bryan

rossmann-000 1005% mpif90 hellof.f -o hellof
rossmann-000 1006% mpiexec.hydra -f $PBS_NODEFILE -np 4 ./hellof
*** glibc detected *** ./hellof: double free or corruption (fasttop): 
0x000000000bf022d0 ***
======= Backtrace: =========
/lib64/libc.so.6[0x39dc4722ef]
/lib64/libc.so.6(cfree+0x4b)[0x39dc47273b]
/apps/rhel5/mpich2-1.2/64/nemesis-intel-11.1.038/lib/libmpich.so.1.2(MPIDI_Populate_vc_node_ids+0x636)[0x2b41873bfcf6]
/apps/rhel5/mpich2-1.2/64/nemesis-intel-11.1.038/lib/libmpich.so.1.2(MPID_Init+0x114)[0x2b41873b2054]
/apps/rhel5/mpich2-1.2/64/nemesis-intel-11.1.038/lib/libmpich.so.1.2(MPIR_Init_thread+0x2a4)[0x2b41873aca44]
/apps/rhel5/mpich2-1.2/64/nemesis-intel-11.1.038/lib/libmpich.so.1.2(MPI_Init+0x262)[0x2b41873ac562]
/apps/rhel5/mpich2-1.2/64/nemesis-intel-11.1.038/lib/libmpich.so.1.2(mpi_init__+0x20)[0x2b41873ac290]
./hellof[0x400f51]
./hellof[0x400eec]
/lib64/libc.so.6(__libc_start_main+0xf4)[0x39dc41d994]
./hellof[0x400df9]
======= Memory map: ========
00400000-00402000 r-xp 00000000 00:1c 3841958377                         
/autohome/u100/bfp/demos/MPI


More information about the mpich-discuss mailing list