Can PETSc detect the number of CPUs on each computer node?

Satish Balay balay at mcs.anl.gov
Tue Jun 16 14:29:47 CDT 2009


On Tue, 16 Jun 2009, Alex Peyser wrote:

> On Tuesday 16 June 2009 02:29:14 pm Matthew Knepley wrote:
> >
> > This is a common misconception. In fact, most time is spent in MatVec or
> > BLAS1, neither of which benefit from MT BLAS.

> Interesting. At least my misconception is common. 
> That makes things tricky with ATLAS, since the number of threads is a 
> compile-time constant. I can't imagine it would be a good idea to have an 8x 
> BLAS running 8xs simultaneously -- unless the mpi jobs were all 
> unsynchronized. It may be only 10-20% of the time, but that's still a large 
> overlap of conflicting threads degrading performance.
> 
> I'll have to do some benchmarks. Is the 10-20% number still true for fairly 
> dense matrices?

Its just a number I pulled out of a hat [for sparse matrix
solves]. -log_summary would be the correct thing for a given
application.

If using MATDENSE - a much higher percentage of time will be in blas.

Satish

> 
> Ah, another layer of administration-code may now be required to properly 
> allocate jobs.
> 
> Alex
> 



More information about the petsc-users mailing list