[MPICH] Fatal error in MPI_Test
Kevin Van Workum
vanw at tticluster.com
Thu Sep 20 08:52:03 CDT 2007
I have a user getting the following error messages, apparently from MPICH2.
One node gets this message:
[cli_2]: aborting job:
Fatal error in MPI_Isend: Other MPI error, error stack:
MPI_Isend(145).............: MPI_Isend(buf=0x24d9f0, count=939,
MPI_DOUBLE_PRECISION, dest=6, tag=887, MPI_COMM_WORLD,
request=0x8612e58) failed
MPIDI_EagerContigIsend(468): failure occurred while attempting to send
an eager message
MPIDU_Sock_writev(625).....: connection closed by peer
(set=0,sock=5,errno=32:Broken pipe)
All the others get this message:
[cli_0]: aborting job:
Fatal error in MPI_Test: Other MPI error, error stack:
MPI_Test(145).............................:
MPI_Test(request=0x85e5518, flag=0xbf86f0b4, status=0x85e5520) failed
MPIDI_CH3I_Progress(144)..................: handle_sock_op failed
MPIDI_CH3I_Progress_handle_sock_event(175):
MPIDU_Socki_handle_read(607)..............: connection closed by peer
(set=0,sock=4)
I'm using mpich2-1.0.5p4, ssm, on a system running Torque and OSC's
mpiexec. If anyone has a clue as to the cause of these errors, please
let me know.
Kevin
--
Kevin Van Workum, Ph.D.
Vice President
Senior System Administrator
www.clusterondemand.com
ONLINE COMPUTER CLUSTERS
More information about the mpich-discuss
mailing list