[petsc-dev] Proper matrix size to choose when evaluating MatMult?
Junchao Zhang
jczhang at mcs.anl.gov
Fri Feb 21 15:51:28 CST 2020
Hello,
I want to evaluate MatMult on GPU. I took a 2M x 2M matrix and ran with 6
mpi ranks and 6 GPUs. It took about 0.9 seconds. A kernel launch or a
stream synchronization took about 10us. Compared with MatMult, they are
tiny. Does it mean we can ignore them? What is a proper size to evaluate
MatMult? I heard it is a few thousand rows per MPI rank. Why?
Thanks.
--Junchao Zhang
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-dev/attachments/20200221/37efd58e/attachment.html>
More information about the petsc-dev
mailing list