[petsc-users] GPU local direct solve of penta-diagonal

Dominic Meiser dmeiser at txcorp.com
Thu Dec 12 17:29:12 CST 2013


Hi Karli,

On 12/12/2013 02:50 PM, Karl Rupp wrote:
>
> Hmm, this does not sound like something I would consider a good fit 
> for GPUs. With 16 MPI processes you have additional congestion of the 
> one or two GPUs per node, so you would have the rethink the solution 
> procedure as a whole.
>
Are you sure about that for Titan? Supposedly the K20X's can deal with 
multiple MPI processes hitting a single GPU pretty well using Hyper-Q. 
Paul has seen pretty good speed up with small GPU kernels simply by 
over-subscribing each GPU with 4 MPI processes.

See here:
http://blogs.nvidia.com/blog/2012/08/23/unleash-legacy-mpi-codes-with-keplers-hyper-q/


Cheers,
Dominic


-- 
Dominic Meiser
Tech-X Corporation
5621 Arapahoe Avenue
Boulder, CO 80303
USA
Telephone: 303-996-2036
Fax: 303-448-7756
www.txcorp.com

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20131212/374ced55/attachment.html>


More information about the petsc-users mailing list