Performance Issues on ccNuma-System
Barry Smith
bsmith at mcs.anl.gov
Mon Oct 13 15:24:21 CDT 2008
The sparse matrix is MUCH to big for the cache so has to stream
through from memory,
thus the huge number of "cache misses". This same performance issue
occurs on
all modern systems.
Barry
On Oct 13, 2008, at 7:12 AM, Christoph Statz wrote:
> Dear PETSc-users,
>
> i'm trying to work with PETSc on a ccNuma-system, where i am
> confronted with severe performance problems.
> Is there anyone using PETSc on e.g. a SGI Altix System?
> Which are the best kernels to use on cache coherent systems?
> The fortran kernels produces many cache misses (in functions like
> fsolve and fmatmul) slowing down a 3GFLOP/s machine to about
> 200MFLOP/s .
> Has anyone any advice to increase speed on ccNuma-system?
>
> Sincerly,
>
> Christoph Statz
>
> --
> Christoph Statz
>
> Institut für Nachrichtentechnik
> Technische Universität Dresden
> 01062 Dresden
>
> Email: christoph.statz at mailbox.tu-dresden.de
> Phone: +49 351 463 32287
>
>
>
More information about the petsc-users
mailing list