0 SNES Function norm 1.030411923746e+00 0 KSP Residual norm 2.680835659267e+02 1 KSP Residual norm 2.243073242747e-03 2 KSP Residual norm 2.811562261046e-06 3 KSP Residual norm 3.085475519999e-08 1 SNES Function norm 3.234994471908e-05 0 KSP Residual norm 2.088849268118e+01 1 KSP Residual norm 4.119966896731e-08 2 KSP Residual norm 1.188412980322e-10 2 SNES Function norm 2.483839649458e-07 0 KSP Residual norm 1.711387426354e-01 1 KSP Residual norm 3.369265081263e-10 2 KSP Residual norm 7.740323265210e-13 3 SNES Function norm 1.686690741501e-11 SNES Object: 16 MPI processes type: newtonls maximum iterations=50, maximum function evaluations=10000 tolerances: relative=1e-08, absolute=1e-50, solution=1e-08 total number of linear solver iterations=7 total number of function evaluations=4 norm schedule ALWAYS SNESLineSearch Object: 16 MPI processes type: bt interpolation: cubic alpha=1.000000e-04 maxstep=1.000000e+08, minlambda=1.000000e-12 tolerances: relative=1.000000e-08, absolute=1.000000e-15, lambda=1.000000e-08 maximum iterations=40 KSP Object: 16 MPI processes type: gmres GMRES: restart=30, using Classical (unmodified) Gram-Schmidt Orthogonalization with no iterative refinement GMRES: happy breakdown tolerance 1e-30 maximum iterations=10000, initial guess is zero tolerances: relative=1e-09, absolute=1e-50, divergence=10000 left preconditioning using PRECONDITIONED norm type for convergence test PC Object: 16 MPI processes type: mg MG: type is FULL, levels=2 cycles=v Not using Galerkin computed coarse grid matrices Coarse grid solver -- level ------------------------------- KSP Object: (mg_coarse_) 16 MPI processes type: preonly maximum iterations=1, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using NONE norm type for convergence test PC Object: (mg_coarse_) 16 MPI processes type: redundant Redundant preconditioner: First (color=0) of 16 PCs follows KSP Object: (mg_coarse_redundant_) 1 MPI processes type: preonly maximum iterations=10000, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using NONE norm type for convergence test PC Object: (mg_coarse_redundant_) 1 MPI processes type: lu LU: out-of-place factorization tolerance for zero pivot 2.22045e-14 using diagonal shift on blocks to prevent zero pivot [INBLOCKS] matrix ordering: nd factor fill ratio given 5, needed 13.7197 Factored matrix follows: Mat Object: 1 MPI processes type: seqaij rows=1640961, cols=1640961 package used to perform factorization: petsc total: nonzeros=1.12497e+08, allocated nonzeros=1.12497e+08 total number of mallocs used during MatSetValues calls =0 not using I-node routines linear system matrix = precond matrix: Mat Object: 1 MPI processes type: seqaij rows=1640961, cols=1640961 total: nonzeros=8.19968e+06, allocated nonzeros=8.19968e+06 total number of mallocs used during MatSetValues calls =0 not using I-node routines linear system matrix = precond matrix: Mat Object: 16 MPI processes type: mpiaij rows=1640961, cols=1640961 total: nonzeros=8.19968e+06, allocated nonzeros=8.19968e+06 total number of mallocs used during MatSetValues calls =0 Down solver (pre-smoother) on level 1 ------------------------------- KSP Object: (mg_levels_1_) 16 MPI processes type: richardson Richardson: damping factor=1 maximum iterations=2 tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using nonzero initial guess using NONE norm type for convergence test PC Object: (mg_levels_1_) 16 MPI processes type: sor SOR: type = local_symmetric, iterations = 1, local iterations = 1, omega = 1 linear system matrix = precond matrix: Mat Object: 16 MPI processes type: mpiaij rows=6558721, cols=6558721 total: nonzeros=3.27834e+07, allocated nonzeros=3.27834e+07 total number of mallocs used during MatSetValues calls =0 Up solver (post-smoother) same as down solver (pre-smoother) linear system matrix = precond matrix: Mat Object: 16 MPI processes type: mpiaij rows=6558721, cols=6558721 total: nonzeros=3.27834e+07, allocated nonzeros=3.27834e+07 total number of mallocs used during MatSetValues calls =0 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ./ex5 on a arch-linux2-c-opt named helios91 with 16 processors, by tnicolas Thu Oct 15 14:20:07 2015 Using Petsc Release Version 3.6.0, Jun, 09, 2015 Max Max/Min Avg Total Time (sec): 1.755e+02 1.00014 1.755e+02 Objects: 1.670e+02 1.00000 1.670e+02 Flops: 1.450e+11 1.00001 1.450e+11 2.320e+12 Flops/sec: 8.263e+08 1.00014 8.262e+08 1.322e+10 MPI Messages: 8.020e+02 1.46886 6.769e+02 1.083e+04 MPI Message Lengths: 2.694e+08 1.00312 3.976e+05 4.306e+09 MPI Reductions: 2.670e+02 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 1.7554e+02 100.0% 2.3205e+12 100.0% 1.083e+04 100.0% 3.976e+05 100.0% 2.660e+02 99.6% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage SNESSolve 1 1.0 1.7492e+02 1.0 1.45e+11 1.0 1.1e+04 4.1e+05 2.2e+02100100 97100 83 100100 97100 83 13266 SNESFunctionEval 4 1.0 5.4719e-02 1.2 1.81e+07 1.0 1.9e+02 5.1e+03 0.0e+00 0 0 2 0 0 0 0 2 0 0 5274 SNESJacobianEval 6 1.0 4.4689e-01 1.0 0.00e+00 0.0 2.9e+02 3.8e+03 1.2e+01 0 0 3 0 4 0 0 3 0 5 0 SNESLineSearch 3 1.0 1.1804e-01 1.1 3.82e+07 1.0 2.9e+02 5.1e+03 1.2e+01 0 0 3 0 4 0 0 3 0 5 5167 VecDot 3 1.0 1.8432e-02 1.4 2.47e+06 1.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 1 0 0 0 0 1 2135 VecMDot 7 1.0 4.5608e-02 3.0 9.86e+06 1.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 3 0 0 0 0 3 3451 VecNorm 17 1.0 3.9799e-02 1.7 1.40e+07 1.0 0.0e+00 0.0e+00 1.7e+01 0 0 0 0 6 0 0 0 0 6 5603 VecScale 50 1.0 7.3025e-03 1.2 4.20e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 9150 VecCopy 9 1.0 1.8358e-02 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 122 1.0 1.2891e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 3 1.0 1.5691e-02 3.6 2.47e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2508 VecAYPX 10 1.0 1.7606e-02 1.7 4.11e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 3725 VecWAXPY 3 1.0 7.6582e-03 1.3 1.23e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2569 VecMAXPY 10 1.0 3.6096e-02 1.4 1.56e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 6905 VecPointwiseMult 3 1.0 9.2316e-04 1.3 3.09e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 5333 VecScatterBegin 155 1.0 1.7556e-01 1.1 0.00e+00 0.0 9.6e+03 4.1e+05 0.0e+00 0 0 89 92 0 0 0 89 92 0 0 VecScatterEnd 155 1.0 4.5882e+00 9.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 VecReduceArith 6 1.0 4.5867e-03 1.2 4.93e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 17159 VecReduceComm 3 1.0 9.5990e-0338.2 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 1 0 0 0 0 1 0 VecNormalize 10 1.0 1.9212e-02 1.5 1.23e+07 1.0 0.0e+00 0.0e+00 1.0e+01 0 0 0 0 4 0 0 0 0 4 10241 MatMult 30 1.0 2.6071e-01 1.4 8.83e+07 1.0 1.3e+03 4.3e+03 0.0e+00 0 0 12 0 0 0 0 12 0 0 5407 MatMultAdd 10 1.0 9.3473e-02 3.2 1.85e+07 1.0 3.3e+02 1.9e+03 0.0e+00 0 0 3 0 0 0 0 3 0 0 3157 MatMultTranspose 24 1.0 1.9701e+0021.0 4.43e+07 1.0 7.9e+02 1.9e+03 0.0e+00 0 0 7 0 0 0 0 7 0 0 359 MatSolve 20 1.0 6.1714e+00 1.0 4.47e+09 1.0 0.0e+00 0.0e+00 0.0e+00 3 3 0 0 0 3 3 0 0 0 11581 MatSOR 20 1.0 1.1135e+00 1.2 2.48e+08 1.0 1.9e+03 5.1e+03 4.0e+01 1 0 18 0 15 1 0 18 0 15 3551 MatLUFactorSym 1 1.0 2.9993e+00 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 MatLUFactorNum 3 1.0 1.6080e+02 1.0 1.40e+11 1.0 0.0e+00 0.0e+00 0.0e+00 90 97 0 0 0 90 97 0 0 0 13939 MatCopy 2 1.0 6.3402e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatConvert 1 1.0 9.6691e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatResidual 10 1.0 1.0360e-01 1.3 4.11e+07 1.0 4.8e+02 5.1e+03 0.0e+00 0 0 4 0 0 0 0 4 0 0 6329 MatAssemblyBegin 10 1.0 7.3902e-02 4.7 0.00e+00 0.0 0.0e+00 0.0e+00 1.8e+01 0 0 0 0 7 0 0 0 0 7 0 MatAssemblyEnd 10 1.0 1.3596e-01 1.0 0.00e+00 0.0 2.6e+02 8.4e+02 2.4e+01 0 0 2 0 9 0 0 2 0 9 0 MatGetRowIJ 1 1.0 6.0591e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetSubMatrice 3 1.0 3.1719e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 3 0 0 0 0 3 0 MatGetOrdering 1 1.0 1.5876e+00 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 MatView 5 1.7 4.7398e-04 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 1 0 0 0 0 1 0 MatRedundantMat 3 1.0 4.7453e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 3 0 0 0 0 3 0 KSPGMRESOrthog 7 1.0 6.1690e-02 2.0 1.97e+07 1.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 3 0 0 0 0 3 5103 KSPSetUp 12 1.0 1.4829e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.2e+01 0 0 0 0 4 0 0 0 0 5 0 KSPSolve 3 1.0 1.7442e+02 1.0 1.45e+11 1.0 1.0e+04 4.3e+05 2.0e+02 99100 93100 74 99100 93100 74 13300 PCSetUp 3 1.0 1.6621e+02 1.0 1.40e+11 1.0 1.2e+03 2.9e+05 1.1e+02 93 97 11 8 42 93 97 11 8 42 13486 PCApply 10 1.0 1.2185e+01 1.5 4.83e+09 1.0 8.5e+03 4.6e+05 4.0e+01 6 3 79 92 15 6 3 79 92 15 6335 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage SNES 1 1 1332 0 SNESLineSearch 1 1 864 0 DMSNES 3 3 2072 0 Vector 78 78 217893048 0 Vector Scatter 10 10 11530944 0 Matrix 13 13 1680657104 0 Distributed Mesh 5 5 24416 0 Star Forest Bipartite Graph 10 10 8448 0 Discrete System 5 5 4240 0 Index Set 25 25 30882964 0 IS L to G Mapping 4 4 4129280 0 Krylov Solver 4 4 22016 0 DMKSP interface 2 2 1296 0 Preconditioner 4 4 3944 0 Viewer 2 1 760 0 ======================================================================================================================== Average time to get PetscTime(): 0 Average time for MPI_Barrier(): 1.81198e-06 Average time for zero size MPI_Send(): 5.55813e-06 #PETSc Option Table entries: -da_grid_x 21 -da_grid_y 21 -da_refine 7 -ksp_monitor -ksp_rtol 1e-9 -log_summary -mg_levels_ksp_type richardson -pc_mg_levels 2 -pc_mg_type full -pc_type mg -snes_monitor -snes_view #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4 Configure options: --prefix=/csc/softs/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real --with-debugging=0 --with-x=0 --with-cc=mpicc --with-fc=mpif90 --with-cxx=mpicxx --with-fortran --known-mpi-shared-libraries=1 --with-scalar-type=real --with-precision=double --CFLAGS="-g -O3 -mavx -mkl" --CXXFLAGS="-g -O3 -mavx -mkl" --FFLAGS="-g -O3 -mavx -mkl" ----------------------------------------- Libraries compiled on Mon Sep 28 20:22:47 2015 on helios85 Machine characteristics: Linux-2.6.32-573.1.1.el6.Bull.80.x86_64-x86_64-with-redhat-6.4-Santiago Using PETSc directory: /csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0 Using PETSc arch: arch-linux2-c-opt ----------------------------------------- Using C compiler: mpicc -g -O3 -mavx -mkl -fPIC ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: mpif90 -g -O3 -mavx -mkl -fPIC ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/arch-linux2-c-opt/include -I/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/include -I/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/include -I/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/arch-linux2-c-opt/include -I/opt/mpi/bullxmpi/1.2.8.2/include ----------------------------------------- Using C linker: mpicc Using Fortran linker: mpif90 Using libraries: -Wl,-rpath,/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/arch-linux2-c-opt/lib -L/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/arch-linux2-c-opt/lib -lpetsc -lhwloc -lxml2 -lssl -lcrypto -Wl,-rpath,/opt/mpi/bullxmpi/1.2.8.2/lib -L/opt/mpi/bullxmpi/1.2.8.2/lib -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -lmpi_f90 -lmpi_f77 -lm -lifport -lifcore -lm -lmpi_cxx -ldl -Wl,-rpath,/opt/mpi/bullxmpi/1.2.8.2/lib -L/opt/mpi/bullxmpi/1.2.8.2/lib -lmpi -lnuma -lrt -lnsl -lutil -Wl,-rpath,/opt/mpi/bullxmpi/1.2.8.2/lib -L/opt/mpi/bullxmpi/1.2.8.2/lib -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -liomp5 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -limf -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -lsvml -lirng -lipgo -ldecimal -lcilkrts -lstdc++ -lgcc_s -lirc -lpthread -lirc_s -Wl,-rpath,/opt/mpi/bullxmpi/1.2.8.2/lib -L/opt/mpi/bullxmpi/1.2.8.2/lib -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -ldl -----------------------------------------