[mpich-discuss] Assertion failure from too many MPI_Gets between fences

Fri Jan 7 16:46:41 CST 2011

On Jan 7, 2011, at 4:03 PM CST, Jeremiah Willcock wrote:

> On Fri, 7 Jan 2011, Dave Goodell wrote:
> 
>> On Jan 7, 2011, at 3:44 PM CST, Jeremiah Willcock wrote:
>> 
>>> On Fri, 7 Jan 2011, Dave Goodell wrote:
>>> 
>>>> On Jan 7, 2011, at 12:56 PM CST, Jeremiah Willcock wrote:
>> Not off the top of my head.  It seems like you are using a small-ish test program to check this.  Can you send that to me so that I can reproduce this for myself and play with the bug?
> 
> I don't really have a small program for this, just part of a larger one that I was putting print statements in and hacking on.  The relevant part of the code just creates a window for an array (with different data on different ranks) then does MPI_Gets on random parts of it to implement a large gather operation.  A code such as row-distributed sparse matrix-vector multiplication would have this kind of access pattern as well.

I won't be able to dig into this today, so I've created a ticket to track this: https://trac.mcs.anl.gov/projects/mpich2/ticket/1156

>>> Inserting fences periodically in my code fixes the problem, but the fence frequency needed is proportional to the number of ranks in the job.  I think the MPI implementation should automatically do whatever flow control it needs to avoid running out of memory, no matter how many requests the application feeds in.
>> 
>> I agree, the MPI implementation should take care of that, within reason. But forcing an MPI_Get to be nonblocking will require a careful reading of the MPI standard to ensure that's valid behavior.  I think I can construct scenarios where an MPI_Get that blocks would cause a deadlock...
> 
> I believe (like everything else in MPI) they can be nonblocking or blocking, at the implementation's choice, but I don't know for sure. Remember that you can't assume that a get has completed until the next fence, so you can't depend on seeing the answer (which I think was what you were worried about causing deadlocks).  I treat the MPI_Gets as nonblocking in my application; that's why I start a large number of them then use a fence to complete them all before using the results.

MPI-2.2, page 339, line 13-14: "These operations are nonblocking: the call initiates the transfer, but the transfer may continue after the call returns."

This language is weaker than I would like, because the presence of the clarifying statements after the colon don't say that the call cannot block, implicitly watering down the natural MPI meaning of "nonblocking".  But I think that the intent is clear, that the call should not block the user waiting on the action of another process.  After further thought I can't come up with any realistic example where a blocking-for-flow-control MPI_Get causes a deadlock, but I think the behavior is still intended to be disallowed by the standard.

>> Without knowing exactly why it's failing, other than some sort of request allocation problem, it's hard to say what the right fix is.
> 
> Is there a patch I could apply to MPICH to help diagnose the problem?  A branch I could switch to for testing?

Eventually, but nothing that I have cooked up already.

-Dave