---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- with 16 processors Using Petsc Release Version 3.1.0, Patch 0, Thu Mar 25 16:15:27 CDT 2010 Max Max/Min Avg Total Time (sec): 9.823e+01 1.01510 9.706e+01 Objects: 8.190e+02 1.00000 8.190e+02 Flops: 2.975e+10 1.09319 2.909e+10 4.655e+11 Flops/sec: 3.073e+08 1.09622 2.997e+08 4.796e+09 MPI Messages: 4.592e+03 1.94165 4.244e+03 6.791e+04 MPI Message Lengths: 6.462e+08 1.45109 1.349e+05 9.163e+09 MPI Reductions: 2.322e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 1.0776e+00 1.1% 0.0000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 1.600e+01 0.7% 1: Assembly: 6.2783e+00 6.5% 0.0000e+00 0.0% 4.378e+03 6.4% 7.288e+04 54.0% 7.900e+01 3.4% 2: Solution: 8.9701e+01 92.4% 4.6547e+11 100.0% 6.353e+04 93.6% 6.206e+04 46.0% 2.155e+03 92.8% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage MatZeroEntries 3 1.0 2.5301e-01 1.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 18 0 0 0 0 0 VecSet 5 1.0 5.6458e-04 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 --- Event Stage 1: Assembly MatAssemblyBegin 8 1.0 8.7086e-01 1.4 0.00e+00 0.0 1.2e+03 4.1e+06 1.6e+01 1 0 2 52 1 12 0 26 96 20 0 MatAssemblyEnd 8 1.0 2.6968e+00 1.2 0.00e+00 0.0 2.4e+02 8.7e+03 3.0e+01 2 0 0 0 1 38 0 5 0 38 0 VecAssemblyBegin 8 1.0 5.3251e-01 4.4 0.00e+00 0.0 2.2e+03 8.6e+04 2.4e+01 0 0 3 2 1 6 0 51 4 30 0 VecAssemblyEnd 8 1.0 2.0029e-02 4.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 --- Event Stage 2: Solution MatMult 1017 1.0 2.4187e+01 1.7 7.37e+09 1.1 3.1e+04 5.5e+04 0.0e+00 21 25 45 18 0 23 25 48 40 0 4737 MatSolve 1022 1.0 3.1326e+01 2.5 9.44e+09 1.2 0.0e+00 0.0e+00 0.0e+00 24 31 0 0 0 26 31 0 0 0 4609 MatLUFactorNum 5 1.0 8.8662e-01 1.4 7.36e+08 1.3 0.0e+00 0.0e+00 0.0e+00 1 2 0 0 0 1 2 0 0 0 12349 MatILUFactorSym 5 1.0 3.0662e-01 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 5.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatScale 5 1.0 1.9215e-01 2.0 2.29e+07 1.1 1.5e+02 3.9e+04 0.0e+00 0 0 0 0 0 0 0 0 0 0 1857 MatAssemblyBegin 5 1.0 6.9141e-06 2.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 5 1.0 7.1857e-02 2.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 5 1.0 1.6689e-05 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetSubMatrice 5 1.0 7.2219e-01 1.9 0.00e+00 0.0 7.5e+02 1.1e+06 2.5e+01 1 0 1 9 1 1 0 1 20 1 0 MatGetOrdering 5 1.0 4.4348e-03 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 1.6e+01 0 0 0 0 1 0 0 0 0 1 0 MatIncreaseOvrlp 5 1.0 1.7671e-01 2.4 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01 0 0 0 0 0 0 0 0 0 0 0 VecMDot 1014 1.0 4.0270e+01 2.7 5.99e+09 1.0 0.0e+00 0.0e+00 1.0e+03 27 21 0 0 44 29 21 0 0 47 2381 VecNorm 1025 1.0 8.0000e+00 9.1 6.97e+07 1.0 0.0e+00 0.0e+00 1.0e+03 4 0 0 0 44 5 0 0 0 48 139 VecScale 1028 1.0 3.7416e-02 1.4 3.50e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 14949 VecCopy 22 1.0 3.5448e-03 1.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 2068 1.0 3.9775e-01 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 11 1.0 1.3638e-03 1.7 6.52e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 7644 VecMAXPY 1025 1.0 1.4375e+01 2.0 6.10e+09 1.0 0.0e+00 0.0e+00 0.0e+00 12 21 0 0 0 13 21 0 0 0 6792 VecPointwiseMult 10 1.0 1.5662e-03 1.6 2.40e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2451 VecScatterBegin 3071 1.0 8.9620e-01 2.1 0.00e+00 0.0 6.2e+04 5.5e+04 0.0e+00 1 0 91 37 0 1 0 97 80 0 0 VecScatterEnd 3071 1.0 1.4369e+0118.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 4 0 0 0 0 4 0 0 0 0 0 KSPGMRESOrthog 1014 1.0 4.8250e+01 1.7 1.20e+10 1.0 0.0e+00 0.0e+00 1.0e+03 38 41 0 0 44 41 41 0 0 47 3974 KSPSetup 10 1.0 2.3613e-01 1.8 2.29e+07 1.1 1.5e+02 3.9e+04 0.0e+00 0 0 0 0 0 0 0 0 0 0 1511 KSPSolve 5 1.0 8.9626e+01 1.0 2.97e+10 1.1 6.2e+04 6.7e+04 2.1e+03 92100 92 46 92 100100 98100 99 5193 PCSetUp 10 1.0 1.9494e+00 1.5 7.36e+08 1.3 1.0e+03 7.9e+05 8.6e+01 2 2 2 9 4 2 2 2 20 4 5617 PCSetUpOnBlocks 5 1.0 1.1798e+00 1.5 7.36e+08 1.3 0.0e+00 0.0e+00 2.1e+01 1 2 0 0 1 1 2 0 0 1 9281 PCApply 1022 1.0 3.3250e+01 2.0 9.44e+09 1.2 3.1e+04 5.5e+04 0.0e+00 27 31 45 18 0 29 31 48 40 0 4343 --- Event Stage 3: Unknown ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Matrix 12 12 236175152 0 Vec 4 8 833440 0 Vec Scatter 0 4 3568 0 --- Event Stage 1: Assembly Vec 8 4 5376 0 Vec Scatter 4 0 0 0 Index Set 8 8 73152 0 --- Event Stage 2: Solution Matrix 10 10 291888908 0 Vec 697 697 182154928 0 Vec Scatter 10 10 8920 0 Index Set 46 46 3804464 0 Krylov Solver 10 10 3294600 0 Preconditioner 10 10 7240 0 --- Event Stage 3: Unknown ======================================================================================================================== Average time to get PetscTime(): 9.53674e-08 Average time for MPI_Barrier(): 2.49863e-05 Average time for zero size MPI_Send(): 1.03116e-05 #PETSc Option Table entries: -block 0. -ksp_diagonal_scale -ksp_gmres_restart 200 -ksp_rtol 1.e-8 -ksp_type lgmres -log_summary -on_error_mpiabort -pc_type asm -tc 1 #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 Configure run at: Thu May 5 15:41:47 2011 Configure options: PETSC_ARCH=linux-gnu-c --with-debugging=no --with-cc=mpicc --download-scalapack=1 --with-cxx=mpicxx --download-mumps=1 --download-ml=1 --download-parmetis=1 --download-f-blas-lapack=1 --download-blacs=1 --with-fc=mpif90 --download-hypre=1