[mpich-discuss] Problem using Total View with mpiexec.hydra

Anatoly G anatolyrishon at gmail.com
Thu May 10 00:23:22 CDT 2012


Dear MPICH2,
I have following short endless program.
1) I perform MPI_Init
2) MPI_Recv without any send. Program is stalled (as expected).

I"m using  MPICH2
I execute program on Linux Red Hat 64 bit Server  release 6.0 (Santiago).
I use /bin/tcsh.

I want to debug only first process (rank 0) of program with
TotalView (TotalView 8.9.0-0)

My execute line is
/space/local/mpich2/bin/mpiexec.hydra -genvall -disable-auto-cleanup -f
machines.txt -launcher=rsh -n 1 /usr/bin/tv mpi_endless : -n 1 mpi_endless

Total View is starting.
I start step-by-step debug, but Total View fails on any of 3 MPI_Init lines
(I tried each one of them - 3 different runs):
*)  // MPI_Init_thread( 0, 0, MPI_THREAD_MULTIPLE, &provided );
*)  // MPI_Init_thread( 0, 0, MPI_THREAD_SINGLE, &provided );
*)   MPI_Init(&argc, &argv);

*The error message that I see in shell window:*
system msg for write_line failure : Bad file descriptor
[cli_0]: Unable to write to PMI_fd
[cli_0]: write_line error; fd=6 buf=:cmd=get_appnum
:
system msg for write_line failure : Bad file descriptor
Fatal error in MPI_Init: Other MPI error, error stack:
MPIR_Init_thread(411):
MPID_Init(138).......: channel initialization failed
MPID_Init(423).......: PMI_Get_appnum returned -1



Can you please advice me?


Anatoly.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20120510/11cb1178/attachment.htm>
-------------- next part --------------
boogie:100
-------------- next part --------------
A non-text attachment was scrubbed...
Name: mpi_endless.cpp
Type: text/x-c++src
Size: 509 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20120510/11cb1178/attachment.cpp>


More information about the mpich-discuss mailing list