[petsc-users] Cray perftools

Jed Brown jed at jedbrown.org
Mon Feb 21 16:45:49 CST 2022


If you can share before/after output from -log_view, it would likely help localize.

Another unintrusive thing (if you're allowed to run Linux perf) is to

$ perf record --call-graph dwarf -F99 ./app
[... runs ...]
$ perf script | stackcollapse-perf | flamegraph > flame.svg

and open flame.svg in a browser (it's interactive). This uses the flamegraph tools (https://github.com/brendangregg/FlameGraph). You can direct `perf script` to a file and share that if you can't/won't install flamegraph. This doesn't require compiling any special way and yet helps understand where time is spent.

Adrian Croucher <a.croucher at auckland.ac.nz> writes:

> hi,
>
> We have our PETSc-based code compiled on a Cray XC-50 machine, and it 
> has just recently started running about 2.5 times slower on there. 
> Neither the code nor PETSc has been recompiled lately.
>
> Turning the PETSc logging on, it appears to be spending more time on I/O 
> than it used to.
>
> The cluster admins have suggested we rebuild with the Cray "perftools" 
> module loaded to get profiling info. It's a slight hassle to rebuild 
> everything, so I wondered, would this actually tell us anything that we 
> don't already know from the PETSc logs?
>
> - Adrian
>
> -- 
> Dr Adrian Croucher
> Senior Research Fellow
> Department of Engineering Science
> University of Auckland, New Zealand
> email: a.croucher at auckland.ac.nz
> tel: +64 (0)9 923 4611


More information about the petsc-users mailing list