[petsc-users] Weird behavior for log_summary
Michele Rosso
mrosso at uci.edu
Mon Jun 8 13:46:47 CDT 2015
Jed,
In the petscconf.h I have
#ifndef PETSC_USE_LOG
#define PETSC_USE_LOG 1
#endif
so I guess that is not the problem.
I run ex50: I attached the output. It does prints the summary.
The I guess there is something wrong with my code.
I call mpi_init before petsc_initialize and then I finalize everything
with petsc_finalize. I do not believe I added any other command that
could
turn off the log_summary output.
Thanks,
Michele
On Mon, 2015-06-08 at 12:02 -0600, Jed Brown wrote:
> Michele Rosso <mrosso at uci.edu> writes:
>
> > Is there any external software I should link at compile time to allow
> > PETSc to provide log infos?
> > Also, is there any define in petscconf.h that controls profiling?
>
> There is PETSC_USE_LOG, but that should always be on, in which case
> everything is run-time rather than compile-time. When you run a PETSc
> example, does the log_summary correctly contain events? Is it possible
> that your code is turning off events somewhere?
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20150608/47d518cb/attachment.html>
-------------- next part --------------
0 KSP Residual norm 4.419256232155e+01
1 KSP Residual norm 5.451780353427e-01
2 KSP Residual norm 2.277974394337e-02
3 KSP Residual norm 2.972141446447e-04
************************************************************************************************************************
*** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document ***
************************************************************************************************************************
---------------------------------------------- PETSc Performance Summary: ----------------------------------------------
./ex50 on a gnu-opt-32idx named Àþÿÿ with 16 processors, by mrosso Mon Jun 8 13:25:36 2015
Using Petsc Release Version 3.5.4, May, 23, 2015
Max Max/Min Avg Total
Time (sec): 3.684e+00 1.00250 3.680e+00
Objects: 5.830e+02 1.00000 5.830e+02
Flops: 4.062e+08 1.00571 4.045e+08 6.472e+09
Flops/sec: 1.103e+08 1.00588 1.099e+08 1.759e+09
MPI Messages: 1.823e+03 2.00550 1.372e+03 2.196e+04
MPI Message Lengths: 1.306e+06 1.99861 7.158e+02 1.572e+07
MPI Reductions: 1.307e+03 1.00000
Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract)
e.g., VecAXPY() for real vectors of length N --> 2N flops
and VecAXPY() for complex vectors of length N --> 8N flops
Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions --
Avg %Total Avg %Total counts %Total Avg %Total counts %Total
0: Main Stage: 3.6802e+00 100.0% 6.4725e+09 100.0% 2.196e+04 100.0% 7.158e+02 100.0% 1.306e+03 99.9%
------------------------------------------------------------------------------------------------------------------------
See the 'Profiling' chapter of the users' manual for details on interpreting output.
Phase summary info:
Count: number of times phase was executed
Time and Flops: Max - maximum over all processors
Ratio - ratio of maximum to minimum over all processors
Mess: number of messages sent
Avg. len: average message length (bytes)
Reduct: number of global reductions
Global: entire computation
Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop().
%T - percent time in this phase %F - percent flops in this phase
%M - percent messages in this phase %L - percent message lengths in this phase
%R - percent reductions in this phase
Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors)
------------------------------------------------------------------------------------------------------------------------
Event Count Time (sec) Flops --- Global --- --- Stage --- Total
Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s
------------------------------------------------------------------------------------------------------------------------
--- Event Stage 0: Main Stage
KSPGMRESOrthog 93 1.0 2.7896e-01 1.2 8.37e+07 1.0 0.0e+00 0.0e+00 9.3e+01 7 21 0 0 7 7 21 0 0 7 4778
KSPSetUp 21 1.0 5.7312e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.5e+02 1 0 0 0 12 1 0 0 0 12 0
KSPSolve 1 1.0 3.6051e+00 1.0 4.06e+08 1.0 2.2e+04 7.1e+02 1.3e+03 98100100 99 99 98100100 99 99 1795
VecMDot 93 1.0 1.3218e-01 1.1 4.18e+07 1.0 0.0e+00 0.0e+00 9.3e+01 3 10 0 0 7 3 10 0 0 7 5042
VecNorm 103 1.0 5.8708e-02 4.9 9.84e+06 1.0 0.0e+00 0.0e+00 1.0e+02 1 2 0 0 8 1 2 0 0 8 2671
VecScale 103 1.0 1.5907e-02 1.2 4.92e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 4929
VecCopy 46 1.0 2.1967e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0
VecSet 188 1.0 2.1128e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0
VecAXPY 19 1.0 1.0012e-02 1.3 1.93e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 3076
VecAYPX 288 1.0 1.1642e-01 1.1 1.41e+07 1.0 0.0e+00 0.0e+00 0.0e+00 3 3 0 0 0 3 3 0 0 0 1924
VecAXPBYCZ 144 1.0 9.7574e-02 1.1 2.81e+07 1.0 0.0e+00 0.0e+00 0.0e+00 3 7 0 0 0 3 7 0 0 0 4592
VecMAXPY 103 1.0 1.8963e-01 1.5 5.04e+07 1.0 0.0e+00 0.0e+00 0.0e+00 4 12 0 0 0 4 12 0 0 0 4238
VecAssemblyBegin 1 1.0 7.0770e-0348.1 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecAssemblyEnd 1 1.0 1.2207e-04 8.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecScatterBegin 407 1.0 7.0384e-03 1.8 0.00e+00 0.0 1.9e+04 8.1e+02 0.0e+00 0 0 86 97 0 0 0 86 97 0 0
VecScatterEnd 407 1.0 5.3825e-03 1.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecNormalize 103 1.0 7.4179e-02 2.9 1.48e+07 1.0 0.0e+00 0.0e+00 1.0e+02 1 4 0 0 8 1 4 0 0 8 3171
MatMult 318 1.0 6.5145e-01 1.1 1.18e+08 1.0 1.5e+04 9.4e+02 0.0e+00 17 29 70 91 0 17 29 70 91 0 2880
MatMultAdd 36 1.0 3.7364e-02 1.0 6.32e+06 1.0 1.2e+03 3.3e+02 0.0e+00 1 2 5 3 0 1 2 5 3 0 2697
MatMultTranspose 45 1.0 4.8568e-02 1.1 7.90e+06 1.0 1.5e+03 3.3e+02 0.0e+00 1 2 7 3 0 1 2 7 3 0 2593
MatSolve 4 1.0 2.3127e-05 1.7 1.57e+03 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1088
MatSOR 315 1.0 1.3988e+00 1.0 1.23e+08 1.0 0.0e+00 0.0e+00 0.0e+00 38 30 0 0 0 38 30 0 0 0 1402
MatLUFactorSym 1 1.0 3.0994e-05 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatLUFactorNum 1 1.0 3.2902e-05 3.3 8.93e+02 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 434
MatConvert 1 1.0 4.2915e-05 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatResidual 36 1.0 8.2138e-02 1.1 1.40e+07 1.0 1.7e+03 9.1e+02 0.0e+00 2 3 8 10 0 2 3 8 10 0 2726
MatAssemblyBegin 30 1.0 5.0912e-02 5.9 0.00e+00 0.0 0.0e+00 0.0e+00 5.8e+01 1 0 0 0 4 1 0 0 0 4 0
MatAssemblyEnd 30 1.0 1.1132e-01 1.1 0.00e+00 0.0 1.6e+03 1.6e+02 1.5e+02 3 0 7 2 12 3 0 7 2 12 0
MatGetRowIJ 1 1.0 2.3127e-05 3.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatGetSubMatrice 1 1.0 2.5797e-04 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatGetOrdering 1 1.0 5.7936e-05 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatGetRedundant 1 1.0 3.2496e-04 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatMPIConcateSeq 1 1.0 5.2929e-05 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
PCSetUp 1 1.0 3.0851e-01 1.0 1.58e+06 1.0 3.2e+03 1.1e+02 5.5e+02 8 0 14 2 42 8 0 14 2 42 82
PCApply 4 1.0 2.7321e+00 1.0 3.86e+08 1.0 1.8e+04 7.9e+02 6.8e+02 74 95 84 93 52 74 95 84 93 52 2251
------------------------------------------------------------------------------------------------------------------------
Memory usage is given in bytes:
Object Type Creations Destructions Memory Descendants' Mem.
Reports information only for process 0.
--- Event Stage 0: Main Stage
Krylov Solver 21 21 303808 0
DMKSP interface 6 6 3936 0
Vector 315 315 98901528 0
Vector Scatter 31 31 2845572 0
Matrix 60 60 61654640 0
Matrix Null Space 11 11 6556 0
Distributed Mesh 10 10 48960 0
Star Forest Bipartite Graph 20 20 16000 0
Discrete System 10 10 7920 0
Index Set 67 67 1476092 0
IS L to G Mapping 10 10 1420508 0
Preconditioner 21 21 19192 0
Viewer 1 0 0 0
========================================================================================================================
Average time to get PetscTime(): 9.53674e-08
Average time for MPI_Barrier(): 1.2207e-05
Average time for zero size MPI_Send(): 1.49012e-06
#PETSc Option Table entries:
-da_grid_x 2049
-da_grid_y 2049
-ksp_monitor
-log_summary
-pc_mg_levels 10
-pc_type mg
#End of PETSc Option Table entries
Compiled without FORTRAN kernels
Compiled with full precision matrices (default)
sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4
Configure options: --known-level1-dcache-size=16384 --known-level1-dcache-linesize=64 --known-level1-dcache-assoc=4 --known-memcmp-ok=1 --known-sizeof-char=1 --known-sizeof-void-p=8 --known-sizeof-short=2 --known-sizeof-int=4 --known-sizeof-long=8 --known-sizeof-long-long=8 --known-sizeof-float=4 --known-sizeof-double=8 --known-sizeof-size_t=8 --known-bits-per-byte=8 --known-sizeof-MPI_Comm=4 --known-sizeof-MPI_Fint=4 --known-mpi-long-double=1 --known-mpi-int64_t=1 --known-mpi-c-double-complex=1 --known-sdot-returns-double=0 --known-snrm2-returns-double=0 --with-batch="1 " --known-mpi-shared="0 " --known-mpi-shared-libraries=0 --known-memcmp-ok --with-blas-lapack-lib="-L/opt/acml/5.3.1/gfortran64/lib -lacml" --COPTFLAGS="-march=bdver1 -fopenmp -O3 -ffast-math -fPIC " --FOPTFLAGS="-march=bdver1 -fopenmp -O3 -ffast-math -fPIC " --CXXOPTFLAGS="-march=bdver1 -fopenmp -O3 -ffast-math -fPIC " --with-x="0 " --with-debugging=0 --with-clib-autodetect="0 " --with-cxxlib-autodetect="0 " --with-fortranlib-autodetect="0 " --with-shared-libraries="0 " --with-mpi-compilers="1 " --with-cc="cc " --with-cxx="CC " --with-fc="ftn " --download-hypre=1 --download-blacs="1 " --download-scalapack="1 " --download-superlu_dist="1 " --download-metis="1 " --download-parmetis="1 " PETSC_ARCH=gnu-opt-32idx
-----------------------------------------
Libraries compiled on Wed Jun 3 14:35:20 2015 on h2ologin3
Machine characteristics: Linux-3.0.101-0.46-default-x86_64-with-SuSE-11-x86_64
Using PETSc directory: /mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4
Using PETSc arch: gnu-opt-32idx
-----------------------------------------
Using C compiler: cc -march=bdver1 -fopenmp -O3 -ffast-math -fPIC ${COPTFLAGS} ${CFLAGS}
Using Fortran compiler: ftn -march=bdver1 -fopenmp -O3 -ffast-math -fPIC ${FOPTFLAGS} ${FFLAGS}
-----------------------------------------
Using include paths: -I/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/include -I/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/include -I/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/include -I/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/include
-----------------------------------------
Using C linker: cc
Using Fortran linker: ftn
Using libraries: -Wl,-rpath,/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/lib -L/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/lib -lpetsc -Wl,-rpath,/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/lib -L/mnt/a/u/sciteam/mrosso/LIBS/petsc-3.5.4/gnu-opt-32idx/lib -lsuperlu_dist_3.3 -lHYPRE -L/opt/acml/5.3.1/gfortran64/lib -lacml -lparmetis -lmetis -lpthread -lssl -lcrypto -ldl
-----------------------------------------
Application 25083726 resources: utime ~57s, stime ~13s, Rss ~165420, inblocks ~19777, outblocks ~47757
More information about the petsc-users
mailing list