[petsc-users] GPU speedup in Poisson solvers
Jed Brown
jed at jedbrown.org
Tue Sep 23 00:31:21 CDT 2014
Dominic Meiser <dmeiser at txcorp.com> writes:
> - To get reliable timing you should configure PETSc without debugging
> (i.e. --with-debugging=no)
> - The ILU preconditioning in your GPU benchmark is done on the CPU. The
> host-device data transfers are killing performance. Can you try to run
> with the additional option --pc_factor_mat_solver_packe cusparse? This
> will perform the preconditioning on the GPU.
> - If you're interested in running benchmarks in parallel you will need a
> few patches that are not yet in petsc/master. I can put together a
> branch that has the needed fixes.
And for the CPU version, considering using a configuration that makes
sense there. Like FMG with Gauss-Seidel or Chebyshev smoothers and an
error tolerance proportional to discretization error. You might find
that not enough time is spent on the fine grid to see a significant
speed-up.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 818 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20140922/cddd498c/attachment.pgp>
More information about the petsc-users
mailing list