<html>

  <head>

    <meta content="text/html; charset=ISO-8859-1"

      http-equiv="Content-Type">

  </head>

  <body bgcolor="#FFFFFF" text="#000000">

    <div class="moz-cite-prefix">Hi Karli,<br>

      <br>

      On 12/12/2013 02:50 PM, Karl Rupp wrote:<br>

    </div>

    <blockquote cite="mid:52AA2FA6.8080802@mcs.anl.gov" type="cite"><br>

      Hmm, this does not sound like something I would consider a good

      fit for GPUs. With 16 MPI processes you have additional congestion

      of the one or two GPUs per node, so you would have the rethink the

      solution procedure as a whole.<br>

      <br>

    </blockquote>

    Are you sure about that for Titan? Supposedly the K20X's can deal

    with multiple MPI processes hitting a single GPU pretty well using

    Hyper-Q. Paul has seen pretty good speed up with small GPU kernels

    simply by over-subscribing each GPU with 4 MPI processes.<br>

    <br>

    See here:<br>

    <meta http-equiv="content-type" content="text/html;

      charset=ISO-8859-1">

    <a

href="http://blogs.nvidia.com/blog/2012/08/23/unleash-legacy-mpi-codes-with-keplers-hyper-q/">http://blogs.nvidia.com/blog/2012/08/23/unleash-legacy-mpi-codes-with-keplers-hyper-q/</a><br>

    <br>

    <br>

    Cheers,<br>

    Dominic<br>

    <br>

    <br>

    <pre class="moz-signature" cols="72">-- 

Dominic Meiser

Tech-X Corporation

5621 Arapahoe Avenue

Boulder, CO 80303

USA

Telephone: 303-996-2036

Fax: 303-448-7756

<a class="moz-txt-link-abbreviated" href="http://www.txcorp.com">www.txcorp.com</a></pre>

  </body>

</html>