The memory overhead is on the application end, I need to make a copy of the iobuffer, because it is potentially reused for each call and if I do non-blocking calls it needs to persist.<br><br><div class="gmail_quote">On Mon, Feb 20, 2012 at 2:01 PM, Wei-keng Liao <span dir="ltr">&lt;<a href="mailto:wkliao@ece.northwestern.edu">wkliao@ece.northwestern.edu</a>&gt;</span> wrote:<br>

<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">We have a few yet-to-be-published results (using FLASH and a climate<br>

application) that show significant improvement of using these<br>

nonblocking (or we should refer them as &quot;aggregating&quot;) APIs.<br>

Below is a scenario that these APIs can be useful. Say, one<br>

application has many small sized variables defined and they all<br>

will be written to a file at about the same time, like checkpointing.<br>

The blocking APIs allow writing/reading one variable at a time,<br>

so the performance can be poor. The nonblocking APIs aggregate<br>

multiple requests into a larger one and hence can result in better<br>

performance.<br>

<br>

There is not much overhead for the additional memory management,<br>

as the APIs do not introduce extra memory copy, nor allocating<br>

additional buffers. The essence of these APIs are to create a<br>

new, combined MPI file view for the single collective I/O at<br>

the end.<br>

<span class="HOEnZb"><font color="#888888"><br>

<br>

Wei-keng<br>

</font></span><div class="HOEnZb"><div class="h5"><br>

<br>

On Feb 20, 2012, at 2:23 PM, Jim Edwards wrote:<br>

<br>

&gt; Hi Wei-keng,<br>

&gt;<br>

&gt; That&#39;s the answer that I thought I would get, and no I guess there is no point in having one.   Is there a demonstrable performance benefit of this non-blocking interface that would make it worth taking on the additional memory management that it will require?<br>


&gt;<br>

&gt; Jim<br>

&gt;<br>

&gt; On Mon, Feb 20, 2012 at 1:18 PM, Wei-keng Liao &lt;<a href="mailto:wkliao@ece.northwestern.edu">wkliao@ece.northwestern.edu</a>&gt; wrote:<br>

&gt; Hi, Jim,<br>

&gt;<br>

&gt; The &quot;non-blocking&quot; APIs in pnetcdf are not truly asynchronous.<br>

&gt; They actually defer the I/O requests till ncmpi_wait_all.<br>

&gt; So, if there is a corresponding test call and it is called<br>

&gt; in between the post of nonblocking and wait, it will simply<br>

&gt; return false, indicating not yet complete.<br>

&gt;<br>

&gt; Given that, would you still like to see a test API available<br>

&gt; in pnetcdf? (That will not be too hard to add one.)<br>

&gt;<br>

&gt;<br>

&gt; Wei-keng<br>

&gt;<br>

&gt;<br>

&gt; On Feb 20, 2012, at 1:47 PM, Jim Edwards wrote:<br>

&gt;<br>

&gt; &gt; I am working on an async interface using pnetcdf and wondering why there is no analog to mpi_test in the API?<br>

&gt; &gt;<br>

&gt; &gt; --<br>

&gt; &gt; Jim Edwards<br>

&gt; &gt;<br>

&gt; &gt; CESM Software Engineering Group<br>

&gt; &gt; National Center for Atmospheric Research<br>

&gt; &gt; Boulder, CO<br>

&gt; &gt; <a href="tel:303-497-1842" value="+13034971842">303-497-1842</a><br>

&gt; &gt;<br>

&gt;<br>

&gt;<br>

&gt;<br>

&gt;<br>

&gt; --<br>

&gt; Jim Edwards<br>

&gt;<br>

&gt;<br>

&gt;<br>

<br>

</div></div></blockquote></div><br><br clear="all"><br>-- <br><pre>Jim Edwards<br><br><br></pre><br>