<html><head>

<meta http-equiv="Content-Type" content="text/html; charset=utf-8">

  </head>

  <body>

    <p>Hi Clément,</p>

    <p>What benchmark are you using to generate these numbers?</p>

    <p>My first guess would be a difference in polling strategy (how

      frequently HG_Progress() is being called, and with what timeout

      value), and how well the provider handles that.</p>

    <p>One quick and dirty way to test this theory would be to set the

      margo_init_info -> <span class="pl-smi">hg_init_info -> </span><br>

      <span class="pl-smi"><span class="pl-smi">na_init_info</span>

        -> <span class="pl-smi">progress_mode</span> to NA_NO_BLOCK

        before initializing margo with margo_init_ext().  There is an

        example that does this here:<br>

      </span></p>

    <p><span class="pl-smi"><a class="moz-txt-link-freetext" href="https://github.com/mochi-hpc-experiments/mochi-tests/blob/main/perf-regression/margo-p2p-latency.c#L105">https://github.com/mochi-hpc-experiments/mochi-tests/blob/main/perf-regression/margo-p2p-latency.c#L105</a><br>

      </span></p>

    <p><span class="pl-smi">That's tedious from an API perspective, but

        the reason it might be informative as a quick hack is that it

        will force Mercury to busy poll on the underlying transport no

        matter what margo or other callers are actually asking it to

        do.  It effectively short circuits any higher level polling

        strategy decisions.<br>

      </span></p>

    <p><span class="pl-smi">Depending on what that tells us, we can go

        from there.  I suspect that the sockets provider mechanism for

        waiting for events (as opposed to just polling for events) might

        be problematic.<br>

      </span></p>

    <p><span class="pl-smi">thanks,</span></p>

    <p><span class="pl-smi">-Phil</span></p>

    <div class="moz-cite-prefix">On 9/2/21 8:29 AM, Clement Barthelemy

      wrote:<br>

    </div>

    <blockquote type="cite" cite="mid:1720497516.7518024.1630585783155.JavaMail.zimbra@inria.fr">

      <pre class="moz-quote-pre" wrap="">Hello all,

I did some latency measurement to compare Mercury and Margo with different providers, the results are below:

                  Mercury (s/RPC)  Margo (s/RPC)

ofi+psm2            6.21e-5          5.01e-5  

ofi+tcp;ofi_rxm     8.20e-5          9.55e-5

ofi+sockets         7.54e-5          2.08e-2 (!)

As you can see, the Margo + the sockets provider is 250 times slower than the rest. I first suspected libfabric, but Mercury does not have the problem. Do you know what could be causing this?

I've tested with Margo 0.9.5, Mercury 2.0.1 and libfabric 1.12.1 & 1.13.1.

Thanks,

Clément

_______________________________________________

mochi-devel mailing list

<a class="moz-txt-link-abbreviated" href="mailto:mochi-devel@lists.mcs.anl.gov">mochi-devel@lists.mcs.anl.gov</a>

<a class="moz-txt-link-freetext" href="https://lists.mcs.anl.gov/mailman/listinfo/mochi-devel">https://lists.mcs.anl.gov/mailman/listinfo/mochi-devel</a>

<a class="moz-txt-link-freetext" href="https://www.mcs.anl.gov/research/projects/mochi">https://www.mcs.anl.gov/research/projects/mochi</a>

</pre>

    </blockquote>

  </body>

</html>