[mpich-discuss] mpich2-1.2.1p1 runs for a while and failed

Koh Voon Li kohvoonli at gmail.com
Sun Feb 27 20:47:39 CST 2011


Hi, I am running 2 PC both with Window 7 home premium edition for parallel
calculation by using MPICH2 version mpich2-1.2.1p1, it run for 3D FDS
calculation which runs for a while and then fails with a number of MPI error
messages as below.

Fatal error in MPI_Allreduce: Other MPI error, error stack:
MPI_Allreduce(773)........................:
MPI_Allreduce(sbuf=000000003FC70738,
 rbuf=000000003FC706F8, count=10, MPI_LOGICAL, MPI_LXOR, MPI_COMM_WORLD)
failed
MPIR_Bcast(1031)..........................:
MPIR_Bcast_binomial(157)..................:
MPIC_Recv(83).............................:
MPIC_Wait(513)............................:
MPIDI_CH3i_Progress_wait(215).............: an error occurred while handling
an
event returned by MPIDU_Sock_Wait()
MPIDI_CH3I_Progress_handle_sock_event(420):
MPIDU_Sock_wait(2606).....................: The semaphore timeout period has
exp
ired. (errno 121)
Fatal error in MPI_Allreduce: Other MPI error, error stack:
MPI_Allreduce(773)........................:
MPI_Allreduce(sbuf=000000003FC707B8,
 rbuf=000000003FC70778, count=10, MPI_LOGICAL, MPI_LXOR, MPI_COMM_WORLD)
failed
MPIR_Allreduce(289).......................:
MPIC_Sendrecv(164)........................:
MPIC_Wait(513)............................:
MPIDI_CH3i_Progress_wait(215).............: an error occurred while handling
an
event returned by MPIDU_Sock_Wait()
MPIDI_CH3I_Progress_handle_sock_event(420):
MPIDU_Sock_wait(2606).....................: The semaphore timeout period has
exp
ired. (errno 121)

I tried to ping test on each PC and its failed. It seem like I got no
response from the network adapter.
I disabled the network adapter and enabled it then everything seem to be
normal again.
Both PC are connected by using a crossover cable.
Thanks.
Regards,
Koh
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20110228/2514a5a9/attachment-0001.htm>


More information about the mpich-discuss mailing list