[mpich-discuss] problem with collective on sub-communicator

Miguel Oliveira m.a.oliveira at coimbra.lip.pt
Thu Nov 10 15:14:40 CST 2011


Hi all,

I wrote a very simple master/slave code in MPI and I'm having problems with MPI_Reduce, or even, MPI_Barrier, inside a subset of the world communicator.
These operations don't seem to be waiting for all the processes in the subgroup.

The code is a straightforward master/slave case where the master generates random numbers when requested and then retrieves a reduction of the sum of these
done on the slaves.

When run on more than three processes sometimes it happens that the message after the reduction, done from one of the slaves to inform the master of the final
result gets to the master before some of the requests for random numbers... This ought to be impossible with a blocking reduction...

Am I missing something?

Code is attached.

Help is appreciated.

Cheers,

MAO

-------------- next part --------------
A non-text attachment was scrubbed...
Name: master_slave.c
Type: application/octet-stream
Size: 1810 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20111110/f4e5be13/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: smime.p7s
Type: application/pkcs7-signature
Size: 1580 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20111110/f4e5be13/attachment.bin>


More information about the mpich-discuss mailing list