[MPICH] MPI_Reduce_scatter

Calin Iaru calin at dolphinics.no
Thu Dec 27 03:24:38 CST 2007


It's not so easy because this is a third party RDMA integration which 
now is expected to be broken.

Rajeev Thakur wrote:
> 1.0.2p1 is a very old version of MPICH2. Some memory leaks have been fixed
> since then. Please try with the latest release, 1.0.6p1.
>
> Rajeev 
>
>   
>> -----Original Message-----
>> From: owner-mpich-discuss at mcs.anl.gov 
>> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Calin Iaru
>> Sent: Friday, December 21, 2007 9:32 AM
>> To: mpich-discuss at mcs.anl.gov
>> Subject: [MPICH] MPI_Reduce_scatter
>>
>> I am using PALLAS to stress MPI_Reduce_scatter. The error 
>> reported after millions of inner loops is:
>>
>> 3: MPI error  875666319 occurred
>> 3: Other MPI error, error stack:
>> 3: MPI_Reduce_scatter(1201): MPI_Reduce_scatter(sbuf=0x2aaaabdfb010,
>> rbuf=0x2aaaac1fc010, rcnts=0x176e1850, MPI_INT, MPI_SUM, 
>> comm=0x84000000) failed
>> 3: MPIR_Reduce_scatter(372):
>> 3: MPIC_Send(48):
>> 3: MPIC_Wait(321):
>> 3: MPIDI_CH3_Progress(115): Unable to make message passing progress
>> 3: handle_read(280):
>> 3: MPIDI_CH3U_Handle_recv_pkt(250): failure occurred while 
>> allocating memory for a request object
>> 3: aborting job:
>> 3: application called MPI_Abort(MPI_COMM_WORLD, 875666319) - process 3
>>
>>
>> The library is 1.0.2p1 and I would like to know if there are 
>> some changes that would fix this issue.
>>
>> Best regards,
>>     Calin
>>
>>
>>     
>
>   




More information about the mpich-discuss mailing list