[mpich-discuss] Problem sometimes when running on winxp on >=2 processes and MPE_IBCAST
Rajeev Thakur
thakur at mcs.anl.gov
Wed May 7 10:31:41 CDT 2008
Can you send us the code?
MPE_IBCAST is not a part of the MPI standard. There is no equivalent for it
in MPICH2. You could spawn a thread that calls MPI_Bcast though (after
following all the caveats of MPI and threads as defined in the standard).
Rajeev
> -----Original Message-----
> From: owner-mpich-discuss at mcs.anl.gov
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Ben Tay
> Sent: Wednesday, May 07, 2008 10:25 AM
> To: mpich-discuss at mcs.anl.gov
> Subject: [mpich-discuss] Problem sometimes when running on
> winxp on >=2 processes and MPE_IBCAST
>
> Hi,
>
> I tried to run a mpi code which is copied from an example by
> the RS 6000 book. It is supposed to broadcast and synchronize
> all values. When I ran it on my school's linux servers, there
> is no problem. However, if I run it on my own winxp, on >=2
> processes, sometimes it work, other times I get the error:
>
> [01:3216].....ERROR:result command received but the wait_list
> is empty.
> [01:3216]...ERROR:unable to handle the command: "cmd=result
> src=1 dest=1
> tag=7 c
> md_tag=3 cmd_orig=dbget ctx_key=1 value="port=1518
> description=gotchama-16e5ed i
> fname=192.168.1.105 " result=DBS_SUCCESS "
> [01:3216].ERROR:error closing the unknown context socket:
> generic socket failure , error stack:
> MPIDU_Sock_wait(2603): The I/O operation has been aborted
> because of either a th read exit or an application request.
> (errno 995) [01:3216]..ERROR:sock_op_close returned while
> unknown context is in
> state: SMPD_
> IDLE
>
> Or
>
> [01:3308].....ERROR:result command received but the wait_list
> is empty.
> [01:3308]...ERROR:unable to handle the command: "cmd=result
> src=1 dest=1
> tag=15
> cmd_tag=5 cmd_orig=barrier ctx_key=0 result=DBS_SUCCESS "
> [01:3308]..ERROR:sock_op_close returned while unknown context is in
> state: SMPD_
> IDLE
>
> There is no problem if I run on 1 process. If it's >=4, then
> the error happens all the time. Moreover, it's a rather
> simple code and so there shouldn't be anything wrong with it.
> Why is this so?
>
> Btw, the RS 6000 book also mention a routine called
> MPE_IBCAST, which is a non-blocking version of MPI_BCAST. Is
> there a similar routine in MPICH2?
>
> Thank you very much
>
> Regards.
>
>
>
>
>
More information about the mpich-discuss
mailing list