Residual norm 2.08391e-07 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ./ex45 on a arch-linu named kazan with 2 processors, by fpoulin Fri Feb 24 21:29:48 2012 Using Petsc Release Version 3.2.0, Patch 6, Wed Jan 11 09:28:45 CST 2012 Max Max/Min Avg Total Time (sec): 1.112e+02 1.00000 1.112e+02 Objects: 3.200e+02 1.00000 3.200e+02 Flops: 3.627e+09 1.00833 3.612e+09 7.224e+09 Flops/sec: 3.260e+07 1.00833 3.247e+07 6.494e+07 MPI Messages: 1.260e+02 1.00000 1.260e+02 2.520e+02 MPI Message Lengths: 9.335e+06 1.00005 7.408e+04 1.867e+07 MPI Reductions: 5.890e+02 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 1.1124e+02 100.0% 7.2242e+09 100.0% 2.520e+02 100.0% 7.408e+04 100.0% 5.880e+02 99.8% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %f - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %f %M %L %R %T %f %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage KSPGMRESOrthog 3 1.0 4.9800e+00 1.2 2.04e+08 1.0 0.0e+00 0.0e+00 3.0e+00 4 6 0 0 1 4 6 0 0 1 82 KSPSetup 9 1.0 6.2236e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 3.4e+01 1 0 0 0 6 1 0 0 0 6 0 KSPSolve 1 1.0 4.9219e+01 1.0 3.42e+09 1.0 1.6e+02 1.0e+05 7.5e+01 44 94 63 85 13 44 94 63 85 13 138 VecMDot 3 1.0 4.2009e+00 1.2 1.02e+08 1.0 0.0e+00 0.0e+00 3.0e+00 3 3 0 0 1 3 3 0 0 1 48 VecNorm 5 1.0 1.1645e+00 1.4 8.52e+07 1.0 0.0e+00 0.0e+00 5.0e+00 1 2 0 0 1 1 2 0 0 1 146 VecScale 28 1.0 2.8942e-01 1.0 3.44e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 237 VecCopy 1 1.0 9.2836e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 105 1.0 6.8287e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecAXPY 2 1.0 2.1710e-01 1.0 3.41e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 313 VecAYPX 24 1.0 1.5245e+00 1.0 3.91e+07 1.0 0.0e+00 0.0e+00 0.0e+00 1 1 0 0 0 1 1 0 0 0 51 VecMAXPY 4 1.0 1.0060e+00 1.0 1.53e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 4 0 0 0 1 4 0 0 0 304 VecScatterBegin 114 1.0 3.6135e-02 1.4 0.00e+00 0.0 1.7e+02 1.0e+05 0.0e+00 0 0 66 92 0 0 0 66 92 0 0 VecScatterEnd 114 1.0 7.3426e-0122.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 4 1.0 1.2534e+00 1.3 1.02e+08 1.0 0.0e+00 0.0e+00 4.0e+00 1 3 0 0 1 1 3 0 0 1 163 PCSetUp 1 1.0 2.0011e+01 1.0 6.55e+07 1.0 8.4e+01 7.6e+03 4.6e+02 18 2 33 3 78 18 2 33 3 78 7 PCApply 4 1.0 3.9776e+01 1.0 2.71e+09 1.0 1.5e+02 8.4e+04 4.8e+01 35 75 60 68 8 35 75 60 68 8 136 MatMult 28 1.0 6.5616e+00 1.1 9.47e+08 1.0 5.6e+01 1.8e+05 0.0e+00 6 26 22 53 0 6 26 22 53 0 287 MatMultAdd 24 1.0 3.2982e+00 1.0 2.62e+08 1.0 2.4e+01 3.0e+04 0.0e+00 3 7 10 4 0 3 7 10 4 0 158 MatMultTranspose 30 1.0 6.6899e+00 1.0 3.28e+08 1.0 3.0e+01 3.0e+04 0.0e+00 6 9 12 5 0 6 9 12 5 0 98 MatSolve 4 1.0 3.1781e-04 1.1 2.35e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 148 MatSOR 48 1.0 2.5600e+01 1.1 1.64e+09 1.0 4.8e+01 1.2e+05 4.8e+01 22 45 19 30 8 22 45 19 30 8 128 MatLUFactorSym 1 1.0 3.0804e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 1 0 0 0 0 1 0 MatLUFactorNum 1 1.0 2.5415e-04 1.0 9.23e+03 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 73 MatAssemblyBegin 21 1.0 4.3024e-01 3.3 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+01 0 0 0 0 7 0 0 0 0 7 0 MatAssemblyEnd 21 1.0 4.2605e+00 1.0 0.00e+00 0.0 4.0e+01 2.0e+04 1.2e+02 4 0 16 4 21 4 0 16 4 21 0 MatGetRowIJ 1 1.0 3.6001e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetOrdering 1 1.0 1.6093e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRedundant 1 1.0 3.2711e-04 1.0 0.00e+00 0.0 6.0e+00 1.6e+03 4.0e+00 0 0 2 0 1 0 0 2 0 1 0 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Container 1 1 548 0 Krylov Solver 9 9 26960 0 Distributed Mesh 7 7 78845168 0 Vector 148 148 1980495064 0 Vector Scatter 29 29 30044 0 Index Set 68 68 39458232 0 IS L to G Mapping 7 7 39409564 0 Preconditioner 9 9 8232 0 Matrix 41 41 2069550948 0 Viewer 1 0 0 0 ======================================================================================================================== Average time to get PetscTime(): 5.96046e-07 Average time for MPI_Barrier(): 1.57356e-06 Average time for zero size MPI_Send(): 2.75373e-05 #PETSc Option Table entries: -da_grid_x 5 -da_grid_y 5 -da_grid_z 5 -da_refine 6 -log_summary -mg_levels_ksp_type richardson -mg_levels_pc_type sor -pc_type mg #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 Configure run at: Fri Feb 24 18:10:35 2012 Configure options: --with-blas-lapack-dir=/opt/intel/mkl/10.1.1.019/lib/64/ --with-mpi-dir=/opt/sgi/mpt/mpt-1.26/ --with-cc=icc --with-fc=ifort --with-debugging=0 ----------------------------------------- Libraries compiled on Fri Feb 24 18:10:35 2012 on kazan Machine characteristics: Linux-2.6.16.60-0.54.5-default-ia64-with-SuSE-10-ia64 Using PETSc directory: /home/fpoulin/soft/petsc-3.2-p6 Using PETSc arch: arch-linux2-c-debug ----------------------------------------- Using C compiler: icc -wd1572 -Qoption,cpp,--extended_float_type -O3 ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: ifort -O3 ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/home/fpoulin/soft/petsc-3.2-p6/arch-linux2-c-debug/include -I/home/fpoulin/soft/petsc-3.2-p6/include -I/home/fpoulin/soft/petsc-3.2-p6/include -I/home/fpoulin/soft/petsc-3.2-p6/arch-linux2-c-debug/include -I/opt/sgi/mpt/mpt-1.26/include ----------------------------------------- Using C linker: icc Using Fortran linker: ifort Using libraries: -Wl,-rpath,/home/fpoulin/soft/petsc-3.2-p6/arch-linux2-c-debug/lib -L/home/fpoulin/soft/petsc-3.2-p6/arch-linux2-c-debug/lib -lpetsc -lpthread -Wl,-rpath,/opt/intel/mkl/10.1.1.019/lib/64 -L/opt/intel/mkl/10.1.1.019/lib/64 -lmkl_lapack -lmkl -lguide -lpthread -Wl,-rpath,/opt/sgi/mpt/mpt-1.26/lib -L/opt/sgi/mpt/mpt-1.26/lib -lfmpich2g -lmpi -lPEPCF90 -ldl -L/opt/intel/Compiler/11.0/074/ipp/ia64/lib -L/opt/intel/Compiler/11.0/074/mkl/lib/64 -L/opt/intel/Compiler/11.0/074/tbb/itanium/cc4.1.0_libc2.4_kernel2.6.16.21/lib -L/opt/intel/Compiler/11.0/074/lib/ia64 -L/home/fpoulin/soft/petsc-3.2-p6/opt/intel/Compiler/11.0/074/mkl/lib/64 -L/usr/lib/gcc/ia64-suse-linux/4.1.2 -L/usr/ia64-suse-linux/lib -limf -lipgo -lirc -lipr -lgcc_s -lirc_s -L/opt/intel/fc/10.1.021/lib -lifport -lifcore -lm -lm -ldl -limf -lipgo -lirc -lipr -lgcc_s -lirc_s -ldl -----------------------------------------