[mpich-discuss] Problem sometimes when running on winxp on >=2 processes and MPE_IBCAST

Ben Tay zonexo at gmail.com
Wed May 7 10:59:43 CDT 2008


Hi Rajeev,

I've attached the code. Thank you very much.

Regards.

Rajeev Thakur wrote:
> Can you send us the code? 
>
> MPE_IBCAST is not a part of the MPI standard. There is no equivalent for it
> in MPICH2. You could spawn a thread that calls MPI_Bcast though (after
> following all the caveats of MPI and threads as defined in the standard). 
>
> Rajeev 
>
>   
>> -----Original Message-----
>> From: owner-mpich-discuss at mcs.anl.gov 
>> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Ben Tay
>> Sent: Wednesday, May 07, 2008 10:25 AM
>> To: mpich-discuss at mcs.anl.gov
>> Subject: [mpich-discuss] Problem sometimes when running on 
>> winxp on >=2 processes and MPE_IBCAST
>>
>> Hi,
>>
>> I tried to run a mpi code which is copied from an example by 
>> the RS 6000 book. It is supposed to broadcast and synchronize 
>> all values. When I ran it on my school's linux servers, there 
>> is no problem. However, if I run it on my own winxp, on >=2 
>> processes, sometimes it work, other times I get the error:
>>
>> [01:3216].....ERROR:result command received but the wait_list 
>> is empty.
>> [01:3216]...ERROR:unable to handle the command: "cmd=result 
>> src=1 dest=1
>> tag=7 c
>> md_tag=3 cmd_orig=dbget ctx_key=1 value="port=1518 
>> description=gotchama-16e5ed i
>> fname=192.168.1.105 " result=DBS_SUCCESS "
>> [01:3216].ERROR:error closing the unknown context socket: 
>> generic socket failure , error stack:
>> MPIDU_Sock_wait(2603): The I/O operation has been aborted 
>> because of either a th read exit or an application request. 
>> (errno 995) [01:3216]..ERROR:sock_op_close returned while 
>> unknown context is in
>> state: SMPD_
>> IDLE
>>
>> Or
>>
>> [01:3308].....ERROR:result command received but the wait_list 
>> is empty.
>> [01:3308]...ERROR:unable to handle the command: "cmd=result 
>> src=1 dest=1
>> tag=15
>> cmd_tag=5 cmd_orig=barrier ctx_key=0 result=DBS_SUCCESS "
>> [01:3308]..ERROR:sock_op_close returned while unknown context is in
>> state: SMPD_
>> IDLE
>>
>> There is no problem if I run on 1 process. If it's >=4, then 
>> the error happens all the time. Moreover, it's a rather 
>> simple code and so there shouldn't be anything wrong with it. 
>> Why is this so?
>>
>> Btw, the RS 6000 book also mention a routine called 
>> MPE_IBCAST, which is a non-blocking version of MPI_BCAST. Is 
>> there a similar routine in MPICH2?
>>
>> Thank you very much
>>
>> Regards.
>>
>>
>>
>>
>>
>>     
>
>
>   
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: mpi_test14.f90
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20080507/9e9ba11e/attachment.diff>


More information about the mpich-discuss mailing list