[petsc-users] Weird behavior for log_summary

Michele Rosso mrosso at uci.edu
Mon Jun 8 13:46:47 CDT 2015


Jed,

In the petscconf.h I have

#ifndef PETSC_USE_LOG
#define PETSC_USE_LOG 1
#endif

so I guess that is not the problem.
I run ex50: I attached the output. It does prints the summary.
The I guess there is something wrong with my code. 
I call mpi_init before petsc_initialize and then I finalize everything
with petsc_finalize. I do not believe I added any other command that
could 
turn off the log_summary output.

Thanks,
Michele


On Mon, 2015-06-08 at 12:02 -0600, Jed Brown wrote: 

> Michele Rosso <mrosso at uci.edu> writes:
> 
> > Is there any external software I should link at compile time to allow
> > PETSc to provide log infos?
> > Also, is there any define in petscconf.h that controls profiling?
> 
> There is PETSC_USE_LOG, but that should always be on, in which case
> everything is run-time rather than compile-time.  When you run a PETSc
> example, does the log_summary correctly contain events?  Is it possible
> that your code is turning off events somewhere?



-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20150608/47d518cb/attachment.html>
-------------- next part --------------
  0 KSP Residual norm 4.419256232155e+01 
  1 KSP Residual norm 5.451780353427e-01 
  2 KSP Residual norm 2.277974394337e-02 
  3 KSP Residual norm 2.972141446447e-04 
************************************************************************************************************************
***             WIDEN YOUR WINDOW TO 120 CHARACTERS.  Use 'enscript -r -fCourier9' to print this document            ***
************************************************************************************************************************

---------------------------------------------- PETSc Performance Summary: ----------------------------------------------

./ex50 on a gnu-opt-32idx named À‘þÿÿ with 16 processors, by mrosso Mon Jun  8 13:25:36 2015
Using Petsc Release Version 3.5.4, May, 23, 2015 

                         Max       Max/Min        Avg      Total 
Time (sec):           3.684e+00      1.00250   3.680e+00
Objects:              5.830e+02      1.00000   5.830e+02
Flops:                4.062e+08      1.00571   4.045e+08  6.472e+09
Flops/sec:            1.103e+08      1.00588   1.099e+08  1.759e+09
MPI Messages:         1.823e+03      2.00550   1.372e+03  2.196e+04
MPI Message Lengths:  1.306e+06      1.99861   7.158e+02  1.572e+07
MPI Reductions:       1.307e+03      1.00000

Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract)
                            e.g., VecAXPY() for real vectors of length N --> 2N flops
                            and VecAXPY() for complex vectors of length N --> 8N flops

Summary of Stages:   ----- Time ------  ----- Flops -----  --- Messages ---  -- Message Lengths --  -- Reductions --
                        Avg     %Total     Avg     %Total   counts   %Total     Avg         %Total   counts   %Total 
 0:      Main Stage: 3.6802e+00 100.0%  6.4725e+09 100.0%  2.196e+04 100.0%  7.158e+02      100.0%  1.306e+03  99.9% 

------------------------------------------------------------------------------------------------------------------------
See the 'Profiling' chapter of the users' manual for details on interpreting output.
Phase summary info:
   Count: number of times phase was executed
   Time and Flops: Max - maximum over all processors
                   Ratio - ratio of maximum to minimum over all processors
   Mess: number of messages sent
   Avg. len: average message length (bytes)
   Reduct: number of global reductions
   Global: entire computation
   Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop().
      %T - percent time in this phase         %F - percent flops in this phase
      %M - percent messages in this phase     %L - percent message lengths in this phase
      %R - percent reductions in this phase
   Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors)
------------------------------------------------------------------------------------------------------------------------
Event                Count      Time (sec)     Flops                             --- Global ---  --- Stage ---   Total
                   Max Ratio  Max     Ratio   Max  Ratio  Mess   Avg len Reduct  %T %F %M %L %R  %T %F %M %L %R Mflop/s
------------------------------------------------------------------------------------------------------------------------

--- Event Stage 0: Main Stage

KSPGMRESOrthog        93 1.0 2.7896e-01 1.2 8.37e+07 1.0 0.0e+00 0.0e+00 9.3e+01  7 21  0  0  7   7 21  0  0  7  4778
KSPSetUp              21 1.0 5.7312e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.5e+02  1  0  0  0 12   1  0  0  0 12     0
KSPSolve               1 1.0 3.6051e+00 1.0 4.06e+08 1.0 2.2e+04 7.1e+02 1.3e+03 98100100 99 99  98100100 99 99  1795
VecMDot               93 1.0 1.3218e-01 1.1 4.18e+07 1.0 0.0e+00 0.0e+00 9.3e+01  3 10  0  0  7   3 10  0  0  7  5042
VecNorm              103 1.0 5.8708e-02 4.9 9.84e+06 1.0 0.0e+00 0.0e+00 1.0e+02  1  2  0  0  8   1  2  0  0  8  2671
VecScale             103 1.0 1.5907e-02 1.2 4.92e+06 1.0 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   0  1  0  0  0  4929
VecCopy               46 1.0 2.1967e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  1  0  0  0  0   1  0  0  0  0     0
VecSet               188 1.0 2.1128e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  1  0  0  0  0   1  0  0  0  0     0
VecAXPY               19 1.0 1.0012e-02 1.3 1.93e+06 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0  3076
VecAYPX              288 1.0 1.1642e-01 1.1 1.41e+07 1.0 0.0e+00 0.0e+00 0.0e+00  3  3  0  0  0   3  3  0  0  0  1924
VecAXPBYCZ           144 1.0 9.7574e-02 1.1 2.81e+07 1.0 0.0e+00 0.0e+00 0.0e+00  3  7  0  0  0   3  7  0  0  0  4592
VecMAXPY             103 1.0 1.8963e-01 1.5 5.04e+07 1.0 0.0e+00 0.0e+00 0.0e+00  4 12  0  0  0   4 12  0  0  0  4238
VecAssemblyBegin       1 1.0 7.0770e-0348.1 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecAssemblyEnd         1 1.0 1.2207e-04 8.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecScatterBegin      407 1.0 7.0384e-03 1.8 0.00e+00 0.0 1.9e+04 8.1e+02 0.0e+00  0  0 86 97  0   0  0 86 97  0     0
VecScatterEnd        407 1.0 5.3825e-03 1.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecNormalize         103 1.0 7.4179e-02 2.9 1.48e+07 1.0 0.0e+00 0.0e+00 1.0e+02  1  4  0  0  8   1  4  0  0  8  3171
MatMult              318 1.0 6.5145e-01 1.1 1.18e+08 1.0 1.5e+04 9.4e+02 0.0e+00 17 29 70 91  0  17 29 70 91  0  2880
MatMultAdd            36 1.0 3.7364e-02 1.0 6.32e+06 1.0 1.2e+03 3.3e+02 0.0e+00  1  2  5  3  0   1  2  5  3  0  2697
MatMultTranspose      45 1.0 4.8568e-02 1.1 7.90e+06 1.0 1.5e+03 3.3e+02 0.0e+00  1  2  7  3  0   1  2  7  3  0  2593
MatSolve               4 1.0 2.3127e-05 1.7 1.57e+03 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0  1088
MatSOR               315 1.0 1.3988e+00 1.0 1.23e+08 1.0 0.0e+00 0.0e+00 0.0e+00 38 30  0  0  0  38 30  0  0  0  1402
MatLUFactorSym         1 1.0 3.0994e-05 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatLUFactorNum         1 1.0 3.2902e-05 3.3 8.93e+02 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0   434
MatConvert             1 1.0 4.2915e-05 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatResidual           36 1.0 8.2138e-02 1.1 1.40e+07 1.0 1.7e+03 9.1e+02 0.0e+00  2  3  8 10  0   2  3  8 10  0  2726
MatAssemblyBegin      30 1.0 5.0912e-02 5.9 0.00e+00 0.0 0.0e+00 0.0e+00 5.8e+01  1  0  0  0  4   1  0  0  0  4     0
MatAssemblyEnd        30 1.0 1.1132e-01 1.1 0.00e+00 0.0 1.6e+03 1.6e+02 1.5e+02  3  0  7  2 12   3  0  7  2 12     0
MatGetRowIJ            1 1.0 2.3127e-05 3.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetSubMatrice       1 1.0 2.5797e-04 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetOrdering         1 1.0 5.7936e-05 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetRedundant        1 1.0 3.2496e-04 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatMPIConcateSeq       1 1.0 5.2929e-05 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
PCSetUp                1 1.0 3.0851e-01 1.0 1.58e+06 1.0 3.2e+03 1.1e+02 5.5e+02  8  0 14  2 42   8  0 14  2 42    82
PCApply                4 1.0 2.7321e+00 1.0 3.86e+08 1.0 1.8e+04 7.9e+02 6.8e+02 74 95 84 93 52  74 95 84 93 52  2251
------------------------------------------------------------------------------------------------------------------------

Memory usage is given in bytes:

Object Type          Creations   Destructions     Memory  Descendants' Mem.
Reports information only for process 0.

--- Event Stage 0: Main Stage

       Krylov Solver    21             21       303808     0
     DMKSP interface     6              6         3936     0
              Vector   315            315     98901528     0
      Vector Scatter    31             31      2845572     0
              Matrix    60             60     61654640     0
   Matrix Null Space    11             11         6556     0
    Distributed Mesh    10             10        48960     0
Star Forest Bipartite Graph    20             20        16000     0
     Discrete System    10             10         7920     0
           Index Set    67             67      1476092     0
   IS L to G Mapping    10             10      1420508     0
      Preconditioner    21             21        19192     0
              Viewer     1              0            0     0
========================================================================================================================
Average time to get PetscTime(): 9.53674e-08
Average time for MPI_Barrier(): 1.2207e-05
Average time for zero size MPI_Send(): 1.49012e-06
#PETSc Option Table entries:
-da_grid_x 2049
-da_grid_y 2049
-ksp_monitor
-log_summary
-pc_mg_levels 10
-pc_type mg
#End of PETSc Option Table entries
Compiled without FORTRAN kernels
Compiled with full precision matrices (default)
sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4
Configure options: --known-level1-dcache-size=16384 --known-level1-dcache-linesize=64 --known-level1-dcache-assoc=4 --known-memcmp-ok=1 --known-sizeof-char=1 --known-sizeof-void-p=8 --known-sizeof-short=2 --known-sizeof-int=4 --known-sizeof-long=8 --known-sizeof-long-long=8 --known-sizeof-float=4 --known-sizeof-double=8 --known-sizeof-size_t=8 --known-bits-per-byte=8 --known-sizeof-MPI_Comm=4 --known-sizeof-MPI_Fint=4 --known-mpi-long-double=1 --known-mpi-int64_t=1 --known-mpi-c-double-complex=1 --known-sdot-returns-double=0 --known-snrm2-returns-double=0 --with-batch="1 " --known-mpi-shared="0 " --known-mpi-shared-libraries=0 --known-memcmp-ok  --with-blas-lapack-lib="-L/opt/acml/5.3.1/gfortran64/lib  -lacml" --COPTFLAGS="-march=bdver1 -fopenmp -O3 -ffast-math -fPIC " --FOPTFLAGS="-march=bdver1 -fopenmp -O3 -ffast-math -fPIC " --CXXOPTFLAGS="-march=bdver1 -fopenmp -O3 -ffast-math -fPIC " --with-x="0 " --with-debugging=0 --with-clib-autodetect="0 " --with-cxxlib-autodetect="0 " --with-fortranlib-autodetect="0 " --with-shared-libraries="0 " --with-mpi-compilers="1 " --with-cc="cc " --with-cxx="CC " --with-fc="ftn " --download-hypre=1 --download-blacs="1 " --download-scalapack="1 " --download-superlu_dist="1 " --download-metis="1 " --download-parmetis="1 " PETSC_ARCH=gnu-opt-32idx
-----------------------------------------
Libraries compiled on Wed Jun  3 14:35:20 2015 on h2ologin3 
Machine characteristics: Linux-3.0.101-0.46-default-x86_64-with-SuSE-11-x86_64
Using PETSc directory: /mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4
Using PETSc arch: gnu-opt-32idx
-----------------------------------------

Using C compiler: cc  -march=bdver1 -fopenmp -O3 -ffast-math -fPIC  ${COPTFLAGS} ${CFLAGS}
Using Fortran compiler: ftn  -march=bdver1 -fopenmp -O3 -ffast-math -fPIC   ${FOPTFLAGS} ${FFLAGS} 
-----------------------------------------

Using include paths: -I/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/include -I/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/include -I/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/include -I/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/include
-----------------------------------------

Using C linker: cc
Using Fortran linker: ftn
Using libraries: -Wl,-rpath,/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/lib -L/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/lib -lpetsc -Wl,-rpath,/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/lib -L/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/lib -lsuperlu_dist_3.3 -lHYPRE -L/opt/acml/5.3.1/gfortran64/lib -lacml -lparmetis -lmetis -lpthread -lssl -lcrypto -ldl  
-----------------------------------------

Application 25083726 resources: utime ~57s, stime ~13s, Rss ~165420, inblocks ~19777, outblocks ~47757


More information about the petsc-users mailing list