[petsc-users] Configuring petsc with MPI on ubuntu quad-core
Barry Smith
bsmith at mcs.anl.gov
Thu Feb 3 13:30:31 CST 2011
On Feb 3, 2011, at 1:25 PM, Jed Brown wrote:
> On Thu, Feb 3, 2011 at 16:17, Barry Smith <bsmith at mcs.anl.gov> wrote:
> In src/benchmarks/streams you can run make test and have it generate a report of how the streams benchmark is able to utilize the memory bandwidth. Run that and send us the output (run with just 2 threads).
>
> That test does no software prefetch, is not vectorized (look at the assembly, you want all movapd and addpd/mulpd with memory addresses instead of addsd/mulsd or addpd/mulpd operating only on register operands), and is not NUMA-aware (which depending on the hardware, can cause performance problems). The output is still relevant and indicates what can be done without tuning, but does not accurately represent the peak achievable by the hardware.
Completely true. If you are aware of a "sophisticated" portable streams tester please add it to that directory. I'd love to have it.
It gives an idea of what "code just compiled by the compiler can do" which is what we need in this situation, in particular what happens in going from 1 process to 2 processes.
Barry
More information about the petsc-users
mailing list