[petsc-users] log_view for the master branch

Matthew Knepley knepley at gmail.com
Wed May 3 13:27:47 CDT 2017


On Wed, May 3, 2017 at 1:24 PM, Kong, Fande <fande.kong at inl.gov> wrote:

> Hi,
>
> I am using the current master branch. The log_view gives me the summary as
> follows, and the "WARNING" box repeats three times. Are we intending to do
> so?
>

Yep, Barry is Really Freaking Serious@ that you should not interpret these
numbers without optimization on.

   Matt


> Thanks,
>
> Fande,
>
>
> ************************************************************
> ************************************************************
> ***             WIDEN YOUR WINDOW TO 120 CHARACTERS.  Use 'enscript -r
> -fCourier9' to print this document            ***
> ************************************************************
> ************************************************************
>
> ---------------------------------------------- PETSc Performance Summary:
> ----------------------------------------------
>
>
>
>       ##########################################################
>       #                                                        #
>       #                          WARNING!!!                    #
>       #                                                        #
>       #   This code was compiled with a debugging option,      #
>       #   To get timing results run ./configure                #
>       #   using --with-debugging=no, the performance will      #
>       #   be generally two or three times faster.              #
>       #                                                        #
>       ##########################################################
>
>
> ./ex29 on a arch-darwin-c-debug-master named FN604208 with 1 processor, by
> kongf Wed May  3 12:28:23 2017
> Using Petsc Development GIT revision: v3.7.6-3529-g76c7fe0  GIT Date:
> 2017-05-03 08:46:23 -0500
>
>                          Max       Max/Min        Avg      Total
> Time (sec):           1.350e-02      1.00000   1.350e-02
> Objects:              4.100e+01      1.00000   4.100e+01
> Flop:                 3.040e+02      1.00000   3.040e+02  3.040e+02
> Flop/sec:            2.251e+04      1.00000   2.251e+04  2.251e+04
> Memory:               1.576e+05      1.00000              1.576e+05
> MPI Messages:         0.000e+00      0.00000   0.000e+00  0.000e+00
> MPI Message Lengths:  0.000e+00      0.00000   0.000e+00  0.000e+00
> MPI Reductions:       0.000e+00      0.00000
>
> Flop counting convention: 1 flop = 1 real number operation of type
> (multiply/divide/add/subtract)
>                             e.g., VecAXPY() for real vectors of length N
> --> 2N flop
>                             and VecAXPY() for complex vectors of length N
> --> 8N flop
>
> Summary of Stages:   ----- Time ------  ----- Flop -----  --- Messages
> ---  -- Message Lengths --  -- Reductions --
>                         Avg     %Total     Avg     %Total   counts
> %Total     Avg         %Total   counts   %Total
>  0:      Main Stage: 1.3483e-02  99.8%  3.0400e+02 100.0%  0.000e+00
> 0.0%  0.000e+00        0.0%  0.000e+00   0.0%
>
> ------------------------------------------------------------
> ------------------------------------------------------------
> See the 'Profiling' chapter of the users' manual for details on
> interpreting output.
> Phase summary info:
>    Count: number of times phase was executed
>    Time and Flop: Max - maximum over all processors
>                    Ratio - ratio of maximum to minimum over all processors
>    Mess: number of messages sent
>    Avg. len: average message length (bytes)
>    Reduct: number of global reductions
>    Global: entire computation
>    Stage: stages of a computation. Set stages with PetscLogStagePush() and
> PetscLogStagePop().
>       %T - percent time in this phase         %F - percent flop in this
> phase
>       %M - percent messages in this phase     %L - percent message lengths
> in this phase
>       %R - percent reductions in this phase
>    Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over
> all processors)
> ------------------------------------------------------------
> ------------------------------------------------------------
>
>
>       ##########################################################
>       #                                                        #
>       #                          WARNING!!!                    #
>       #                                                        #
>       #   This code was compiled with a debugging option,      #
>       #   To get timing results run ./configure                #
>       #   using --with-debugging=no, the performance will      #
>       #   be generally two or three times faster.              #
>       #                                                        #
>       ##########################################################
>
>
> Event                Count      Time (sec)
> Flop                             --- Global ---  --- Stage ---   Total
>                    Max Ratio  Max     Ratio   Max  Ratio  Mess   Avg len
> Reduct  %T %F %M %L %R  %T %F %M %L %R Mflop/s
> ------------------------------------------------------------
> ------------------------------------------------------------
>
> --- Event Stage 0: Main Stage
>
> KSPGMRESOrthog         1 1.0 1.3617e-04 1.0 3.50e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  1 12  0  0  0   1 12  0  0  0     0
> KSPSetUp               1 1.0 4.1097e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  3  0  0  0  0   3  0  0  0  0     0
> KSPSolve               1 1.0 1.4596e-03 1.0 2.85e+02 1.0 0.0e+00 0.0e+00
> 0.0e+00 11 94  0  0  0  11 94  0  0  0     0
> VecMDot                1 1.0 1.7958e-05 1.0 1.70e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  0  6  0  0  0   0  6  0  0  0     1
> VecNorm                2 1.0 1.9152e-05 1.0 3.40e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  0 11  0  0  0   0 11  0  0  0     2
> VecScale               1 1.0 4.4771e-05 1.0 9.00e+00 1.0 0.0e+00 0.0e+00
> 0.0e+00  0  3  0  0  0   0  3  0  0  0     0
> VecCopy                1 1.0 1.2218e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
> VecSet                10 1.0 7.3789e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  1  0  0  0  0   1  0  0  0  0     0
> VecAXPY                1 1.0 6.3397e-05 1.0 1.80e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  0  6  0  0  0   0  6  0  0  0     0
> VecMAXPY               2 1.0 4.8989e-05 1.0 3.60e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  0 12  0  0  0   0 12  0  0  0     1
> VecAssemblyBegin       2 1.0 7.5148e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
> VecAssemblyEnd         2 1.0 7.5093e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
> VecNormalize           2 1.0 9.5865e-05 1.0 4.30e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  1 14  0  0  0   1 14  0  0  0     0
> MatMult                1 1.0 1.3781e-05 1.0 5.70e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  0 19  0  0  0   0 19  0  0  0     4
> MatSolve               2 1.0 7.4019e-04 1.0 1.14e+02 1.0 0.0e+00 0.0e+00
> 0.0e+00  5 38  0  0  0   5 38  0  0  0     0
> MatLUFactorNum         1 1.0 2.8001e-05 1.0 1.90e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  0  6  0  0  0   0  6  0  0  0     1
> MatILUFactorSym        1 1.0 9.1556e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  1  0  0  0  0   1  0  0  0  0     0
> MatAssemblyBegin       2 1.0 7.7938e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
> MatAssemblyEnd         2 1.0 4.5131e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
> MatGetRowIJ            1 1.0 4.0429e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
> MatGetOrdering         1 1.0 1.7907e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  1  0  0  0  0   1  0  0  0  0     0
> PCSetUp                1 1.0 5.8597e-04 1.0 1.90e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  4  6  0  0  0   4  6  0  0  0     0
> PCApply                2 1.0 7.8497e-04 1.0 1.14e+02 1.0 0.0e+00 0.0e+00
> 0.0e+00  6 38  0  0  0   6 38  0  0  0     0
> ------------------------------------------------------------
> ------------------------------------------------------------
>
> Memory usage is given in bytes:
>
> Object Type          Creations   Destructions     Memory  Descendants' Mem.
> Reports information only for process 0.
>
> --- Event Stage 0: Main Stage
>
>        Krylov Solver     1              1        18408     0.
>      DMKSP interface     1              1          648     0.
>               Vector    12             12        19224     0.
>       Vector Scatter     2              2         1312     0.
>               Matrix     2              2         7380     0.
>     Distributed Mesh     3              3        14960     0.
>            Index Set     7              7         5632     0.
>    IS L to G Mapping     2              2         1368     0.
> Star Forest Bipartite Graph     6              6         4864     0.
>      Discrete System     3              3         2596     0.
>       Preconditioner     1              1         1000     0.
>               Viewer     1              0            0     0.
> ============================================================
> ============================================================
> Average time to get PetscTime(): 4.50294e-08
> #PETSc Option Table entries:
> -log_view
> #End of PETSc Option Table entries
> Compiled without FORTRAN kernels
> Compiled with full precision matrices (default)
> sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8
> sizeof(PetscScalar) 8 sizeof(PetscInt) 4
> Configure options: --download-hypre=1 --with-ssl=0 --with-debugging=yes
> --with-pic=1 --with-shared-libraries=1 --with-cc=mpicc --with-cxx=mpicxx
> --with-fc=mpif90 --download-fblaslapack=1 --download-metis=1
> --download-parmetis=1 --download-superlu_dist=1 --download-scalapack=1
> --download-mumps=1 CC=mpicc CXX=mpicxx FC=mpif90 F77=mpif77 F90=mpif90
> CFLAGS="-fPIC -fopenmp" CXXFLAGS="-fPIC -fopenmp" FFLAGS="-fPIC -fopenmp"
> FCFLAGS="-fPIC -fopenmp" F90FLAGS="-fPIC -fopenmp" F77FLAGS="-fPIC
> -fopenmp" PETSC_ARCH=arch-darwin-c-debug-master
> -----------------------------------------
> Libraries compiled on Wed May  3 11:04:44 2017 on FN604208
> Machine characteristics: Darwin-15.5.0-x86_64-i386-64bit
> Using PETSc directory: /Users/kongf/projects/petsc
> Using PETSc arch: arch-darwin-c-debug-master
> -----------------------------------------
>
> Using C compiler: mpicc -fPIC -fopenmp   -g3  ${COPTFLAGS} ${CFLAGS}
> Using Fortran compiler: mpif90 -fPIC -fopenmp  -g   ${FOPTFLAGS} ${FFLAGS}
> -----------------------------------------
>
> Using include paths: -I/Users/kongf/projects/petsc/
> arch-darwin-c-debug-master/include -I/Users/kongf/projects/petsc/include
> -I/Users/kongf/projects/petsc/include -I/Users/kongf/projects/petsc/
> arch-darwin-c-debug-master/include -I/opt/X11/include
> -----------------------------------------
>
> Using C linker: mpicc
> Using Fortran linker: mpif90
> Using libraries: -Wl,-rpath,/Users/kongf/projects/petsc/arch-darwin-c-debug-master/lib
> -L/Users/kongf/projects/petsc/arch-darwin-c-debug-master/lib -lpetsc
> -Wl,-rpath,/Users/kongf/projects/petsc/arch-darwin-c-debug-master/lib
> -L/Users/kongf/projects/petsc/arch-darwin-c-debug-master/lib
> -Wl,-rpath,/opt/X11/lib -L/opt/X11/lib -Wl,-rpath,/opt/moose/mpich/mpich-3.2/clang-opt/lib
> -L/opt/moose/mpich/mpich-3.2/clang-opt/lib -Wl,-rpath,/opt/moose/llvm-3.9.0/lib
> -L/opt/moose/llvm-3.9.0/lib -Wl,-rpath,/opt/moose/llvm-3.9.0/lib/clang/3.9.0/lib/darwin
> -L/opt/moose/llvm-3.9.0/lib/clang/3.9.0/lib/darwin
> -Wl,-rpath,/opt/moose/gcc-6.2.0/lib/gcc/x86_64-apple-darwin15.6.0/6.2.0
> -L/opt/moose/gcc-6.2.0/lib/gcc/x86_64-apple-darwin15.6.0/6.2.0
> -Wl,-rpath,/opt/moose/gcc-6.2.0/lib -L/opt/moose/gcc-6.2.0/lib
> -Wl,-rpath,/opt/moose/llvm-3.9.0/bin/../lib/clang/3.9.0/lib/darwin
> -L/opt/moose/llvm-3.9.0/bin/../lib/clang/3.9.0/lib/darwin -lsuperlu_dist
> -lHYPRE -lcmumps -ldmumps -lsmumps -lzmumps -lmumps_common -lpord
> -lscalapack -lflapack -lfblas -lparmetis -lmetis -lX11 -lclang_rt.osx
> -lmpifort -lgfortran -lgomp -lgcc_ext.10.5 -lquadmath -lm -lclang_rt.osx
> -lmpicxx -lc++ -lclang_rt.osx -ldl -lmpi -lpmpi -lomp -lSystem
> -lclang_rt.osx -ldl
> -----------------------------------------
>
>
>
>       ##########################################################
>       #                                                        #
>       #                          WARNING!!!                    #
>       #                                                        #
>       #   This code was compiled with a debugging option,      #
>       #   To get timing results run ./configure                #
>       #   using --with-debugging=no, the performance will      #
>       #   be generally two or three times faster.              #
>       #                                                        #
>       ##########################################################
>
>
>


-- 
What most experimenters take for granted before they begin their
experiments is infinitely more interesting than any results to which their
experiments lead.
-- Norbert Wiener
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20170503/f4e67a99/attachment-0001.html>


More information about the petsc-users mailing list