[petsc-dev] Kokkos/Crusher perforance

Mark Adams mfadams at lbl.gov
Sat Jan 22 10:20:35 CST 2022


On Sat, Jan 22, 2022 at 10:25 AM Jed Brown <jed at jedbrown.org> wrote:

> Mark Adams <mfadams at lbl.gov> writes:
>
> > On Fri, Jan 21, 2022 at 9:55 PM Barry Smith <bsmith at petsc.dev> wrote:
> >
> >>
> >> Interesting, Is this with all native Kokkos kernels or do some kokkos
> >> kernels use rocm?
> >>
> >
> > Ah, good question. I often run with tpl=0 but I did not specify here on
> > Crusher. In looking at the log files I see
> >
> -I/gpfs/alpine/csc314/scratch/adams/petsc/arch-olcf-crusher/externalpackages/git.kokkos-kernels/src/impl/tpls
> >
> > Here is a run with tpls turned off. These tpl includes are gone.
> >
> > It looks pretty much the same. A little slower but that could be noise.
>
> >
> ************************************************************************************************************************
> > ***             WIDEN YOUR WINDOW TO 120 CHARACTERS.  Use 'enscript -r
> -fCourier9' to print this document            ***
> >
> ************************************************************************************************************************
>
> We gotta say 160 chars because that's what we use now.
>
>
done

as far as streams, does it know to run on the GPU? You don't specify
something like -G 1 here for GPUs. I think you just get them all.


11:14 adams/aijkokkos-gpu-logging=
crusher:/gpfs/alpine/csc314/scratch/adams/petsc$ make
PETSC_DIR=/gpfs/alpine/csc314/scratch/adams/petsc
PETSC_ARCH=arch-olcf-crusher streams
cc -o MPIVersion.o -c -fPIC -Wall -Wwrite-strings -Wno-strict-aliasing
-Wno-unknown-pragmas -fstack-protector -Qunused-arguments
-fvisibility=hidden -g -O3
 -I/gpfs/alpine/csc314/scratch/adams/petsc/include
-I/gpfs/alpine/csc314/scratch/adams/petsc/arch-olcf-crusher/include
-I/opt/rocm-4.5.0/include    `pwd`/MPIVersion.c
Running streams with '/usr/bin/srun -p batch -N 1 -A csc314_crusher -t
00:10:00 ' using 'NPMAX=128'
1  53355.9207   Rate (MB/s)
2  39565.2208   Rate (MB/s) 0.741534
3  34538.3431   Rate (MB/s) 0.64732
4  32469.3375   Rate (MB/s) 0.608543
5  31041.1569   Rate (MB/s) 0.581776
6  30113.3826   Rate (MB/s) 0.564387
7  29562.5285   Rate (MB/s) 0.554063
8  29228.8090   Rate (MB/s) 0.547808
9  31474.3616   Rate (MB/s) 0.589895
10  31306.7647   Rate (MB/s) 0.586754
11  31147.4674   Rate (MB/s) 0.583768
12  31006.5008   Rate (MB/s) 0.581126
13  30859.4559   Rate (MB/s) 0.57837
14  30796.0587   Rate (MB/s) 0.577182
15  30604.4849   Rate (MB/s) 0.573591
16  30565.4340   Rate (MB/s) 0.572859
17  32421.9349   Rate (MB/s) 0.607654
18  34365.3424   Rate (MB/s) 0.644078
19  36289.4518   Rate (MB/s) 0.680139
20  38194.5300   Rate (MB/s) 0.715845
21  40160.4660   Rate (MB/s) 0.75269
22  42062.3931   Rate (MB/s) 0.788336
23  43890.2036   Rate (MB/s) 0.822593
24  45775.4680   Rate (MB/s) 0.857927
25  47708.8770   Rate (MB/s) 0.894163
26  49559.6810   Rate (MB/s) 0.928851
27  51457.5537   Rate (MB/s) 0.964421
28  53528.3420   Rate (MB/s) 1.00323
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-dev/attachments/20220122/7d6aac9e/attachment-0001.html>


More information about the petsc-dev mailing list