[mpich-discuss] Problem sometimes when running on winxp on >=2 processes and MPE_IBCAST

Rajeev Thakur thakur at mcs.anl.gov
Wed May 7 10:31:41 CDT 2008


Can you send us the code? 

MPE_IBCAST is not a part of the MPI standard. There is no equivalent for it
in MPICH2. You could spawn a thread that calls MPI_Bcast though (after
following all the caveats of MPI and threads as defined in the standard). 

Rajeev 

> -----Original Message-----
> From: owner-mpich-discuss at mcs.anl.gov 
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Ben Tay
> Sent: Wednesday, May 07, 2008 10:25 AM
> To: mpich-discuss at mcs.anl.gov
> Subject: [mpich-discuss] Problem sometimes when running on 
> winxp on >=2 processes and MPE_IBCAST
> 
> Hi,
> 
> I tried to run a mpi code which is copied from an example by 
> the RS 6000 book. It is supposed to broadcast and synchronize 
> all values. When I ran it on my school's linux servers, there 
> is no problem. However, if I run it on my own winxp, on >=2 
> processes, sometimes it work, other times I get the error:
> 
> [01:3216].....ERROR:result command received but the wait_list 
> is empty.
> [01:3216]...ERROR:unable to handle the command: "cmd=result 
> src=1 dest=1
> tag=7 c
> md_tag=3 cmd_orig=dbget ctx_key=1 value="port=1518 
> description=gotchama-16e5ed i
> fname=192.168.1.105 " result=DBS_SUCCESS "
> [01:3216].ERROR:error closing the unknown context socket: 
> generic socket failure , error stack:
> MPIDU_Sock_wait(2603): The I/O operation has been aborted 
> because of either a th read exit or an application request. 
> (errno 995) [01:3216]..ERROR:sock_op_close returned while 
> unknown context is in
> state: SMPD_
> IDLE
> 
> Or
> 
> [01:3308].....ERROR:result command received but the wait_list 
> is empty.
> [01:3308]...ERROR:unable to handle the command: "cmd=result 
> src=1 dest=1
> tag=15
> cmd_tag=5 cmd_orig=barrier ctx_key=0 result=DBS_SUCCESS "
> [01:3308]..ERROR:sock_op_close returned while unknown context is in
> state: SMPD_
> IDLE
> 
> There is no problem if I run on 1 process. If it's >=4, then 
> the error happens all the time. Moreover, it's a rather 
> simple code and so there shouldn't be anything wrong with it. 
> Why is this so?
> 
> Btw, the RS 6000 book also mention a routine called 
> MPE_IBCAST, which is a non-blocking version of MPI_BCAST. Is 
> there a similar routine in MPICH2?
> 
> Thank you very much
> 
> Regards.
> 
> 
> 
> 
> 




More information about the mpich-discuss mailing list