---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- with 8 processors Using Petsc Release Version 3.1.0, Patch 0, Thu Mar 25 16:15:27 CDT 2010 Max Max/Min Avg Total Time (sec): 1.235e+02 1.00349 1.233e+02 Objects: 8.190e+02 1.00000 8.190e+02 Flops: 3.783e+10 1.05254 3.705e+10 2.964e+11 Flops/sec: 3.066e+08 1.05345 3.004e+08 2.403e+09 MPI Messages: 2.912e+03 1.96359 2.544e+03 2.035e+04 MPI Message Lengths: 8.684e+08 1.81405 2.684e+05 5.461e+09 MPI Reductions: 1.572e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 1.5740e+00 1.3% 0.0000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 1.600e+01 1.0% 1: Assembly: 1.2683e+01 10.3% 0.0000e+00 0.0% 1.330e+03 6.5% 1.998e+05 74.4% 7.900e+01 5.0% 2: Solution: 1.0906e+02 88.4% 2.9638e+11 100.0% 1.902e+04 93.5% 6.860e+04 25.6% 1.405e+03 89.4% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage MatZeroEntries 3 1.0 4.7752e-01 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 23 0 0 0 0 0 VecSet 5 1.0 1.1029e-03 1.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 --- Event Stage 1: Assembly MatAssemblyBegin 8 1.0 1.5275e+00 1.5 0.00e+00 0.0 4.1e+02 9.4e+06 1.6e+01 1 0 2 71 1 9 0 31 96 20 0 MatAssemblyEnd 8 1.0 5.5453e+00 1.2 0.00e+00 0.0 1.1e+02 8.7e+03 3.0e+01 4 0 1 0 2 40 0 8 0 38 0 VecAssemblyBegin 8 1.0 1.1837e+0011.2 0.00e+00 0.0 6.1e+02 2.6e+05 2.4e+01 1 0 3 3 2 6 0 46 4 30 0 VecAssemblyEnd 8 1.0 4.0169e-02 3.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 --- Event Stage 2: Solution MatMult 642 1.0 2.9808e+01 1.5 9.27e+09 1.1 9.0e+03 5.5e+04 0.0e+00 20 24 44 9 0 23 24 47 35 0 2420 MatSolve 647 1.0 3.4762e+01 2.0 1.06e+10 1.1 0.0e+00 0.0e+00 0.0e+00 20 27 0 0 0 23 27 0 0 0 2336 MatLUFactorNum 5 1.0 1.5293e+00 1.2 1.33e+09 1.2 0.0e+00 0.0e+00 0.0e+00 1 3 0 0 0 1 3 0 0 0 6486 MatILUFactorSym 5 1.0 5.4954e-01 1.8 0.00e+00 0.0 0.0e+00 0.0e+00 5.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatScale 5 1.0 3.9973e-01 1.5 4.58e+07 1.1 7.0e+01 3.9e+04 0.0e+00 0 0 0 0 0 0 0 0 0 0 893 MatAssemblyBegin 5 1.0 6.4373e-06 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 5 1.0 1.2016e-01 1.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 5 1.0 1.2875e-05 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetSubMatrice 5 1.0 9.7579e-01 1.4 0.00e+00 0.0 3.5e+02 1.1e+06 2.5e+01 1 0 2 7 2 1 0 2 28 2 0 MatGetOrdering 5 1.0 7.7188e-03 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 1.6e+01 0 0 0 0 1 0 0 0 0 1 0 MatIncreaseOvrlp 5 1.0 2.6123e-01 2.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01 0 0 0 0 1 0 0 0 0 1 0 VecMDot 639 1.0 4.5762e+01 2.5 8.15e+09 1.0 0.0e+00 0.0e+00 6.4e+02 25 22 0 0 41 29 22 0 0 45 1424 VecNorm 650 1.0 9.0227e+0014.8 8.82e+07 1.0 0.0e+00 0.0e+00 6.5e+02 4 0 0 0 41 5 0 0 0 46 78 VecScale 653 1.0 4.1008e-02 1.2 4.43e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 8640 VecCopy 22 1.0 7.0348e-03 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1318 1.0 4.6461e-01 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 11 1.0 2.9740e-03 1.6 1.30e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 3505 VecMAXPY 650 1.0 1.9291e+01 1.7 8.31e+09 1.0 0.0e+00 0.0e+00 0.0e+00 12 22 0 0 0 14 22 0 0 0 3448 VecPointwiseMult 10 1.0 2.8253e-03 1.4 4.80e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1358 VecScatterBegin 1946 1.0 1.1120e+00 1.8 0.00e+00 0.0 1.8e+04 5.5e+04 0.0e+00 1 0 90 18 0 1 0 96 72 0 0 VecScatterEnd 1946 1.0 1.1044e+0112.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 5 0 0 0 0 5 0 0 0 0 0 KSPGMRESOrthog 639 1.0 5.7099e+01 1.5 1.63e+10 1.0 0.0e+00 0.0e+00 6.4e+02 37 44 0 0 41 42 44 0 0 45 2283 KSPSetup 10 1.0 4.8600e-01 1.4 4.58e+07 1.1 7.0e+01 3.9e+04 0.0e+00 0 0 0 0 0 0 0 0 0 0 734 KSPSolve 5 1.0 1.0894e+02 1.0 3.78e+10 1.1 1.9e+04 7.4e+04 1.4e+03 88100 91 25 87 100100 98 99 98 2720 PCSetUp 10 1.0 3.1730e+00 1.2 1.33e+09 1.2 4.9e+02 7.9e+05 8.6e+01 2 3 2 7 5 3 3 3 28 6 3126 PCSetUpOnBlocks 5 1.0 2.0869e+00 1.3 1.33e+09 1.2 0.0e+00 0.0e+00 2.1e+01 1 3 0 0 1 2 3 0 0 1 4753 PCApply 647 1.0 3.6608e+01 1.8 1.06e+10 1.1 9.1e+03 5.5e+04 0.0e+00 23 27 45 9 0 26 27 48 36 0 2218 --- Event Stage 3: Unknown ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Matrix 12 12 472253768 0 Vec 4 8 1518528 0 Vec Scatter 0 4 3568 0 --- Event Stage 1: Assembly Vec 8 4 5376 0 Vec Scatter 4 0 0 0 Index Set 8 8 73160 0 --- Event Stage 2: Solution Matrix 10 10 564665548 0 Vec 697 697 362979408 0 Vec Scatter 10 10 8920 0 Index Set 46 46 7149096 0 Krylov Solver 10 10 3294600 0 Preconditioner 10 10 7240 0 --- Event Stage 3: Unknown ======================================================================================================================== Average time to get PetscTime(): 0 Average time for MPI_Barrier(): 1.94073e-05 Average time for zero size MPI_Send(): 1.13547e-05 #PETSc Option Table entries: -block 0. -ksp_diagonal_scale -ksp_gmres_restart 200 -ksp_rtol 1.e-8 -ksp_type lgmres -log_summary -on_error_mpiabort -pc_type asm -tc 1 #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 Configure run at: Thu May 5 15:41:47 2011 Configure options: PETSC_ARCH=linux-gnu-c --with-debugging=no --with-cc=mpicc --download-scalapack=1 --with-cxx=mpicxx --download-mumps=1 --download-ml=1 --download-parmetis=1 --download-f-blas-lapack=1 --download-blacs=1 --with-fc=mpif90 --download-hypre=1