---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- with 4 processors Using Petsc Release Version 3.1.0, Patch 0, Thu Mar 25 16:15:27 CDT 2010 Max Max/Min Avg Total Time (sec): 1.730e+02 1.00408 1.728e+02 Objects: 7.120e+02 1.00000 7.120e+02 Flops: 5.410e+10 1.03017 5.318e+10 2.127e+11 Flops/sec: 3.139e+08 1.03438 3.078e+08 1.231e+09 MPI Messages: 2.216e+03 1.95587 1.674e+03 6.698e+03 MPI Message Lengths: 1.111e+09 9.02408 3.680e+05 2.465e+09 MPI Reductions: 1.278e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 2.9893e+00 1.7% 0.0000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 1.600e+01 1.3% 1: Assembly: 1.9585e+01 11.3% 0.0000e+00 0.0% 3.680e+02 5.5% 2.933e+05 79.7% 7.900e+01 6.2% 2: Solution: 1.5020e+02 86.9% 2.1273e+11 100.0% 6.330e+03 94.5% 7.465e+04 20.3% 1.111e+03 86.9% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage MatZeroEntries 3 1.0 9.4196e-01 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 27 0 0 0 0 0 VecSet 5 1.0 2.3990e-03 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 --- Event Stage 1: Assembly MatAssemblyBegin 8 1.0 1.7880e+00 2.7 0.00e+00 0.0 1.2e+02 1.6e+07 1.6e+01 1 0 2 77 1 6 0 33 96 20 0 MatAssemblyEnd 8 1.0 5.6752e+00 1.2 0.00e+00 0.0 4.8e+01 8.6e+03 3.0e+01 3 0 1 0 2 26 0 13 0 38 0 VecAssemblyBegin 8 1.0 1.4468e+0012.1 0.00e+00 0.0 1.5e+02 4.7e+05 2.4e+01 0 0 2 3 2 3 0 41 4 30 0 VecAssemblyEnd 8 1.0 3.5721e-0224.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 --- Event Stage 2: Solution MatMult 495 1.0 4.1255e+01 1.0 1.41e+10 1.0 3.0e+03 5.5e+04 0.0e+00 24 26 44 7 0 27 26 47 33 0 1344 MatSolve 500 1.0 4.5819e+01 1.3 1.52e+10 1.1 0.0e+00 0.0e+00 0.0e+00 23 28 0 0 0 27 28 0 0 0 1280 MatLUFactorNum 5 1.0 2.7440e+00 1.0 2.48e+09 1.1 0.0e+00 0.0e+00 0.0e+00 2 4 0 0 0 2 4 0 0 0 3425 MatILUFactorSym 5 1.0 9.8986e-01 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 5.0e+00 1 0 0 0 0 1 0 0 0 0 0 MatScale 5 1.0 7.1962e-01 1.0 9.09e+07 1.0 3.0e+01 3.9e+04 0.0e+00 0 0 0 0 0 0 0 0 0 0 496 MatAssemblyBegin 5 1.0 5.2452e-06 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 5 1.0 2.4028e-01 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 5 1.0 1.5020e-05 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetSubMatrice 5 1.0 1.6018e+00 1.3 0.00e+00 0.0 1.5e+02 1.1e+06 2.5e+01 1 0 2 7 2 1 0 2 33 2 0 MatGetOrdering 5 1.0 1.3533e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 1.6e+01 0 0 0 0 1 0 0 0 0 1 0 MatIncreaseOvrlp 5 1.0 2.1005e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01 0 0 0 0 1 0 0 0 0 1 0 VecMDot 495 1.0 3.7390e+01 1.5 1.09e+10 1.0 0.0e+00 0.0e+00 5.0e+02 18 21 0 0 39 21 21 0 0 45 1171 VecNorm 500 1.0 5.8201e+00 3.7 1.35e+08 1.0 0.0e+00 0.0e+00 5.0e+02 2 0 0 0 39 2 0 0 0 45 93 VecScale 500 1.0 6.6864e-02 1.3 6.76e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 4043 VecCopy 10 1.0 4.0643e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1015 1.0 6.2747e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 5 1.0 2.1534e-03 1.1 9.60e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1782 VecMAXPY 500 1.0 2.6078e+01 1.2 1.11e+10 1.0 0.0e+00 0.0e+00 0.0e+00 14 21 0 0 0 16 21 0 0 0 1699 VecPointwiseMult 10 1.0 5.2419e-03 1.1 9.60e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 732 VecScatterBegin 1505 1.0 1.4740e+00 1.2 0.00e+00 0.0 6.0e+03 5.5e+04 0.0e+00 1 0 90 13 0 1 0 95 66 0 0 VecScatterEnd 1505 1.0 1.0546e+01 8.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 3 0 0 0 0 4 0 0 0 0 0 KSPGMRESOrthog 495 1.0 5.9058e+01 1.1 2.19e+10 1.0 0.0e+00 0.0e+00 5.0e+02 32 41 0 0 39 37 41 0 0 45 1483 KSPSetup 10 1.0 9.0940e-01 1.0 9.09e+07 1.0 3.0e+01 3.9e+04 0.0e+00 1 0 0 0 0 1 0 0 0 0 392 KSPSolve 5 1.0 1.5000e+02 1.0 5.41e+10 1.0 6.2e+03 7.9e+04 1.1e+03 87100 93 20 85 100100 98 99 97 1418 PCSetUp 10 1.0 5.5895e+00 1.1 2.48e+09 1.1 2.1e+02 7.9e+05 8.6e+01 3 4 3 7 7 4 4 3 33 8 1681 PCSetUpOnBlocks 5 1.0 3.7476e+00 1.1 2.48e+09 1.1 0.0e+00 0.0e+00 2.1e+01 2 4 0 0 2 2 4 0 0 2 2507 PCApply 500 1.0 4.8406e+01 1.3 1.52e+10 1.1 3.0e+03 5.5e+04 0.0e+00 25 28 45 7 0 29 28 47 33 0 1211 --- Event Stage 3: Unknown ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Matrix 12 12 944404432 0 Vec 4 8 2888640 0 Vec Scatter 0 4 3568 0 --- Event Stage 1: Assembly Vec 8 4 5376 0 Vec Scatter 4 0 0 0 Index Set 8 8 73152 0 --- Event Stage 2: Solution Matrix 10 10 1110188204 0 Vec 590 590 606715160 0 Vec Scatter 10 10 8920 0 Index Set 46 46 13540336 0 Krylov Solver 10 10 3294600 0 Preconditioner 10 10 7240 0 --- Event Stage 3: Unknown ======================================================================================================================== Average time to get PetscTime(): 9.53674e-08 Average time for MPI_Barrier(): 1.20163e-05 Average time for zero size MPI_Send(): 1.12057e-05 #PETSc Option Table entries: -block 0. -ksp_diagonal_scale -ksp_gmres_restart 200 -ksp_rtol 1.e-8 -ksp_type lgmres -log_summary -on_error_mpiabort -pc_type asm -tc 1 #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 Configure run at: Thu May 5 15:41:47 2011 Configure options: PETSC_ARCH=linux-gnu-c --with-debugging=no --with-cc=mpicc --download-scalapack=1 --with-cxx=mpicxx --download-mumps=1 --download-ml=1 --download-parmetis=1 --download-f-blas-lapack=1 --download-blacs=1 --with-fc=mpif90 --download-hypre=1