[petsc-users] Increasing parallel speed-up

Jed Brown jedbrown at mcs.anl.gov
Mon Jul 4 12:24:56 CDT 2011


On Mon, Jul 4, 2011 at 12:09, Haren, S.W. van (Steven) <vanharen at nrg.eu>wrote:

> one of the ksp solvers (Conjugate Gradient method with ILU(0)
> preconditioning) gives poor parallel performance for the
>

We need to identify how much the poor scaling is due to the preconditioner
changing (e.g. block Jacobi with ILU(0)) such that more iterations are
needed versus memory bandwidth. Run with -ksp_monitor or
-ksp_converged_reason to see the iterations. You can try -pc_type asm (or
algebraic multigrid using third-party libraries) to improve the iteration
count.

If you want help seeing what's going on, send -log_summary output for each
case.


> following settings:
>
> - number of unknowns ~ 2 million
> - 1, 2 and 4 processors (quad core CPU)
>

What kind? In particular, what memory bus and how many channels? Sparse
matrix kernels are overwhelmingly limited by memory performance, so extra
cores do very little good unless the memory system is very good (or the
matrix fits in cache).
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20110704/c891512a/attachment.htm>


More information about the petsc-users mailing list