Number of iterations = 5 Residual norm 0.0724 Setup time: 1.1593278885e+01 Solve time: 1.8146839142e+00 Total: 1.3407962799e+01 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ./ex10 on a intel-opt-precise-O3 named lagrange.tomato with 1 processor, by jfe Fri Aug 17 11:47:59 2012 Using Petsc Development HG revision: f9c6cac2d69c724a2258d4e0dc2f51b0825aa875 HG Date: Thu Aug 16 08:37:18 2012 -0700 Max Max/Min Avg Total Time (sec): 1.360e+01 1.00000 1.360e+01 Objects: 1.790e+02 1.00000 1.790e+02 Flops: 2.270e+09 1.00000 2.270e+09 2.270e+09 Flops/sec: 1.669e+08 1.00000 1.669e+08 1.669e+08 MPI Messages: 0.000e+00 0.00000 0.000e+00 0.000e+00 MPI Message Lengths: 0.000e+00 0.00000 0.000e+00 0.000e+00 MPI Reductions: 2.010e+02 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 5.3024e-04 0.0% 0.0000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 1: Load system: 1.5889e-01 1.2% 0.0000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 8.000e+00 4.0% 2: KSPSetUpSolve: 1.3441e+01 98.8% 2.2704e+09 100.0% 0.000e+00 0.0% 0.000e+00 0.0% 1.920e+02 95.5% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %f - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %f %M %L %R %T %f %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage PetscBarrier 1 1.0 2.1458e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 --- Event Stage 1: Load system MatAssemblyBegin 1 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 1 1.0 2.9479e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 19 0 0 0 0 0 MatLoad 1 1.0 1.4315e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 1 0 0 0 1 90 0 0 0 25 0 VecSet 6 1.0 8.7931e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 6 0 0 0 0 0 VecAssemblyBegin 2 1.0 9.5367e-07 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAssemblyEnd 2 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecLoad 2 1.0 7.6361e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 0 5 0 0 0 12 0 --- Event Stage 2: KSPSetUpSolve MatMult 156 1.0 1.8240e+00 1.0 1.37e+09 1.0 0.0e+00 0.0e+00 0.0e+00 13 60 0 0 0 14 60 0 0 0 749 MatMultAdd 18 1.0 2.9168e-02 1.0 9.16e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 314 MatMultTranspose 18 1.0 2.2335e-02 1.0 9.16e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 410 MatSolve 12 1.0 7.6771e-05 1.0 4.92e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 641 MatLUFactorSym 1 1.0 7.9870e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 1 0 0 0 0 2 0 MatLUFactorNum 1 1.0 6.8903e-05 1.0 1.20e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 174 MatConvert 3 1.0 4.7931e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatScale 3 1.0 4.8071e-02 1.0 2.45e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 510 MatAssemblyBegin 27 1.0 6.9141e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 27 1.0 2.1622e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 MatGetRow 1526152 1.0 1.0088e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 MatGetRowIJ 1 1.0 2.0027e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetOrdering 1 1.0 1.2207e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 1 0 0 0 0 1 0 MatCoarsen 3 1.0 9.2258e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.0e+00 1 0 0 0 4 1 0 0 0 5 0 MatPtAP 3 1.0 4.2210e-01 1.0 5.12e+07 1.0 0.0e+00 0.0e+00 1.8e+01 3 2 0 0 9 3 2 0 0 9 121 MatPtAPSymbolic 3 1.0 2.6616e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.8e+01 2 0 0 0 9 2 0 0 0 9 0 MatPtAPNumeric 3 1.0 1.5593e-01 1.0 5.12e+07 1.0 0.0e+00 0.0e+00 0.0e+00 1 2 0 0 0 1 2 0 0 0 328 MatMatTrnMultSym 3 1.0 2.4065e+00 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.2e+01 18 0 0 0 6 18 0 0 0 6 0 MatMatTrnMultNum 3 1.0 6.7791e+00 1.0 3.19e+08 1.0 0.0e+00 0.0e+00 0.0e+00 50 14 0 0 0 50 14 0 0 0 47 MatGetSymTrans 3 1.0 1.6632e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecMDot 41 1.0 4.5545e-02 1.0 1.06e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 5 0 0 0 0 5 0 0 0 2322 VecNorm 60 1.0 1.0339e-02 1.0 3.43e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 3313 VecScale 123 1.0 1.8651e-02 1.0 3.11e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 1666 VecCopy 34 1.0 1.6087e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 193 1.0 5.4778e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 160 1.0 5.1041e-02 1.0 8.50e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 4 0 0 0 0 4 0 0 0 1665 VecAYPX 150 1.0 7.4659e-02 1.0 5.02e+07 1.0 0.0e+00 0.0e+00 0.0e+00 1 2 0 0 0 1 2 0 0 0 672 VecMAXPY 56 1.0 8.7279e-02 1.0 1.50e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 7 0 0 0 1 7 0 0 0 1720 VecAssemblyBegin 3 1.0 2.3842e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAssemblyEnd 3 1.0 9.5367e-07 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 141 1.0 8.4999e-02 1.0 3.59e+07 1.0 0.0e+00 0.0e+00 0.0e+00 1 2 0 0 0 1 2 0 0 0 422 VecSetRandom 3 1.0 1.1648e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 51 1.0 1.3630e-02 1.0 3.83e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 2809 KSPGMRESOrthog 41 1.0 1.0680e-01 1.0 2.12e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 9 0 0 0 1 9 0 0 0 1981 KSPSetUp 9 1.0 1.6894e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.4e+01 0 0 0 0 22 0 0 0 0 23 0 KSPSolve 1 1.0 1.8147e+00 1.0 1.39e+09 1.0 0.0e+00 0.0e+00 1.5e+01 13 61 0 0 7 14 61 0 0 8 767 PCSetUp 2 1.0 1.1584e+01 1.0 8.50e+08 1.0 0.0e+00 0.0e+00 1.8e+02 85 37 0 0 87 86 37 0 0 91 73 PCSetUpOnBlocks 6 1.0 3.5286e-04 1.0 1.20e+04 1.0 0.0e+00 0.0e+00 5.0e+00 0 0 0 0 2 0 0 0 0 3 34 PCApply 6 1.0 1.3992e+00 1.0 1.04e+09 1.0 0.0e+00 0.0e+00 5.0e+00 10 46 0 0 2 10 46 0 0 3 743 PCGAMGgraph_AGG 3 1.0 1.0654e+00 1.0 2.45e+07 1.0 0.0e+00 0.0e+00 1.2e+01 8 1 0 0 6 8 1 0 0 6 23 PCGAMGcoarse_AGG 3 1.0 9.3799e+00 1.0 3.19e+08 1.0 0.0e+00 0.0e+00 2.4e+01 69 14 0 0 12 70 14 0 0 12 34 PCGAMGProl_AGG 3 1.0 1.8540e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.2e+01 1 0 0 0 6 1 0 0 0 6 0 PCGAMGPOpt_AGG 3 1.0 9.0599e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Viewer 1 0 0 0 --- Event Stage 1: Load system Viewer 1 1 728 0 Matrix 1 0 0 0 Vector 6 0 0 0 --- Event Stage 2: KSPSetUpSolve Container 3 3 1668 0 Matrix 25 26 501725708 0 Matrix Coarsen 3 3 1860 0 Vector 109 115 287547640 0 Krylov Solver 9 9 131888 0 Preconditioner 9 9 8836 0 Index Set 9 9 8008 0 PetscRandom 3 3 1848 0 ======================================================================================================================== Average time to get PetscTime(): 0 #PETSc Option Table entries: -Pressure_ksp_type preonly -Pressure_pc_factor_mat_solver_package mumps -Pressure_pc_type lu -f0 Pressure__3_19_0.mtx -ksp_type gmres -log_summary -pc_type gamg #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4 Configure run at: Fri Aug 17 09:14:50 2012 Configure options: --with-x=0 --download-f-blas-lapack=0 --with-blas-lapack-dir=/opt/intel/Compiler/11.1/072/mkl/lib/em64t --with-mpi=1 --with-mpi-shared=1 --with-mpi=1 --download-mpich=no --with-debugging=0 --with-gnu-compilers=no --with-vendor-compilers=intel --with-cc=/usr/local/encap/platform_mpi-8.01/bin/mpicc --with-cxx=/usr/local/encap/platform_mpi-8.01/bin/mpiCC --with-fc=/usr/local/encap/platform_mpi-8.01/bin/mpif90 --with-shared-libraries=1 --with-c++-support --with-clanguage=C --COPTFLAGS="-fPIC -O3 -xSSE4.2 -fp-model precise -g -debug inline_debug_info" --CXXOPTFLAGS="-fPIC -O3 -xSSE4.2 -fp-model precise -g -debug inline_debug_info" --FOPTFLAGS="-fPIC -O3 -xSSE4.2 -fp-model precise -g -debug inline_debug_info" --download-scalapack=1 --download-blacs=1 --with-blacs=1 --download-umfpack=1 --download-parmetis=1 --download-metis=1 --download-superlu=1 --download-superlu_dist=1 --download-mumps=1 --download-ml=1 --download-hypre=1 ----------------------------------------- Libraries compiled on Fri Aug 17 09:14:50 2012 on lagrange.tomato Machine characteristics: Linux-2.6.32-279.2.1.el6.x86_64-x86_64-with-centos-6.3-Final Using PETSc directory: /home/jfe/local/petsc-dev Using PETSc arch: intel-opt-precise-O3 ----------------------------------------- Using C compiler: /usr/local/encap/platform_mpi-8.01/bin/mpicc -fPIC -wd1572 -fPIC -O3 -xSSE4.2 -fp-model precise -g -debug inline_debug_info ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: /usr/local/encap/platform_mpi-8.01/bin/mpif90 -fPIC -fPIC -O3 -xSSE4.2 -fp-model precise -g -debug inline_debug_info ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/home/jfe/local/petsc-dev/intel-opt-precise-O3/include -I/home/jfe/local/petsc-dev/include -I/home/jfe/local/petsc-dev/include -I/home/jfe/local/petsc-dev/intel-opt-precise-O3/include -I/usr/local/encap/platform_mpi-8.01/include ----------------------------------------- Using C linker: /usr/local/encap/platform_mpi-8.01/bin/mpicc Using Fortran linker: /usr/local/encap/platform_mpi-8.01/bin/mpif90 Using libraries: -Wl,-rpath,/home/jfe/local/petsc-dev/intel-opt-precise-O3/lib -L/home/jfe/local/petsc-dev/intel-opt-precise-O3/lib -lpetsc -Wl,-rpath,/home/jfe/local/petsc-dev/intel-opt-precise-O3/lib -L/home/jfe/local/petsc-dev/intel-opt-precise-O3/lib -lcmumps -ldmumps -lsmumps -lzmumps -lmumps_common -lpord -lscalapack -lblacs -lml -Wl,-rpath,/usr/local/encap/platform_mpi-8.01/lib/linux_amd64 -L/usr/local/encap/platform_mpi-8.01/lib/linux_amd64 -lmpiCC -Wl,-rpath,/opt/intel/Compiler/11.1/072/lib/intel64 -L/opt/intel/Compiler/11.1/072/lib/intel64 -Wl,-rpath,/opt/intel/Compiler/11.1/072/ipp/em64t/lib -L/opt/intel/Compiler/11.1/072/ipp/em64t/lib -Wl,-rpath,/opt/intel/Compiler/11.1/072/mkl/lib/em64t -L/opt/intel/Compiler/11.1/072/mkl/lib/em64t -Wl,-rpath,/opt/intel/Compiler/11.1/072/tbb/intel64/cc4.1.0_libc2.4_kernel2.6.16.21/lib -L/opt/intel/Compiler/11.1/072/tbb/intel64/cc4.1.0_libc2.4_kernel2.6.16.21/lib -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.6 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.6 -lstdc++ -lsuperlu_dist_3.0 -lparmetis -lmetis -lpthread -lsuperlu_4.3 -lHYPRE -lmpiCC -lstdc++ -lumfpack -lamd -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -liomp5 -lpthread -lifport -lifcore -lm -lpthread -lm -lmpiCC -lstdc++ -lmpiCC -lstdc++ -lpcmpio -lpcmpi -ldl -limf -lsvml -lipgo -ldecimal -lgcc_s -lirc -lirc_s -ldl -----------------------------------------