[mpich-discuss] Fatal error in MPI_Barrier

Rajeev Thakur thakur at mcs.anl.gov
Mon Feb 2 10:55:03 CST 2009


Are you really trying to use the wireless network? Looks like that's what is getting used.
 
You can use the mpdcheck utility to diagnose network configuration problems. See Appendix A.2 of the installation guide.
 
Rajeev


  _____  

From: mpich-discuss-bounces at mcs.anl.gov [mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of Antonio José Gallardo Díaz
Sent: Monday, February 02, 2009 9:49 AM
To: mpich-discuss at mcs.anl.gov
Subject: [mpich-discuss] Fatal error in MPI_Barrier


Hello, this error show me when i try my jobs that use MPI.


Fatal error in MPI_Barrier: Other MPI error, error stack:
MPI_Barrier(406).............................: MPI_Barrier(MPI_COMM_WORLD) failed
MPIR_Barrier(77).............................:
MPIC_Sendrecv(123)...........................:
MPIC_Wait(270)...............................:
MPIDI_CH3i_Progress_wait(215)................: an error occurred while handling an event returned by MPIDU_Sock_Wait()
MPIDI_CH3I_Progress_handle_sock_event(640)...:
MPIDI_CH3_Sockconn_handle_connopen_event(887): unable to find the process group structure with id <��oz�>[cli_1]: aborting job:
Fatal error in MPI_Barrier: Other MPI error, error stack:
MPI_Barrier(406).............................: MPI_Barrier(MPI_COMM_WORLD) failed
MPIR_Barrier(77).............................:
MPIC_Sendrecv(123)...........................:
MPIC_Wait(270)...............................:
MPIDI_CH3i_Progress_wait(215)................: an error occurred while handling an event returned by MPIDU_Sock_Wait()
MPIDI_CH3I_Progress_handle_sock_event(640)...:
MPIDI_CH3_Sockconn_handle_connopen_event(887): unable to find the process group structure with id <��oz�>
rank 1 in job 15  wireless_43226   caused collective abort of all ranks
  exit status of rank 1: killed by signal 9

I have two PC's with linux (kubuntu 8.10). I make a cluster using this machines. When use for example the command "mpiexec -l -n 2 hostname" i can see that it's all right, but when i try to send o receive some thing i have the same error. I don't know why. Please i need one hand. Thanks for all. 


  _____  

El doble de diversión: Con Windows Live Messenger comparte fotos mientras hablas. <http://www.microsoft.com/windows/windowslive/messenger.aspx>  

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20090202/010865fc/attachment.htm>


More information about the mpich-discuss mailing list