0 KSP unpreconditioned resid norm 1.065032289254e+02 true resid norm 1.065032289254e+02 ||r(i)||/||b|| 1.000000000000e+00 1 KSP unpreconditioned resid norm 1.949685988175e+01 true resid norm 1.949685988175e+01 ||r(i)||/||b|| 1.830635566496e-01 2 KSP unpreconditioned resid norm 1.016426068648e+00 true resid norm 1.009355385792e+00 ||r(i)||/||b|| 9.477228023751e-03 3 KSP unpreconditioned resid norm 1.022877169297e-01 true resid norm 1.065926359951e-01 ||r(i)||/||b|| 1.000839477550e-03 4 KSP unpreconditioned resid norm 3.622030309251e-03 true resid norm 2.184543702448e-02 ||r(i)||/||b|| 2.051152556115e-04 5 KSP unpreconditioned resid norm 2.641360660213e-04 true resid norm 2.069673481279e-02 ||r(i)||/||b|| 1.943296463554e-04 Linear solve converged due to CONVERGED_RTOL iterations 5 KSP Object: 2 MPI processes type: gmres GMRES: restart=30, using Classical (unmodified) Gram-Schmidt Orthogonalization with no iterative refinement GMRES: happy breakdown tolerance 1e-30 maximum iterations=100, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000 right preconditioning using UNPRECONDITIONED norm type for convergence test PC Object: 2 MPI processes type: asm Additive Schwarz: total subdomain blocks = 2, amount of overlap = 1 Additive Schwarz: restriction/interpolation type - RESTRICT Local solve is same for all blocks, in the following KSP and PC objects: KSP Object: (sub_) 1 MPI processes type: gmres GMRES: restart=30, using Classical (unmodified) Gram-Schmidt Orthogonalization with no iterative refinement GMRES: happy breakdown tolerance 1e-30 maximum iterations=1000, initial guess is zero tolerances: relative=0.001, absolute=1e-30, divergence=10000 left preconditioning using PRECONDITIONED norm type for convergence test PC Object: (sub_) 1 MPI processes type: ilu ILU: out-of-place factorization 0 levels of fill tolerance for zero pivot 2.22045e-14 using diagonal shift to prevent zero pivot matrix ordering: natural factor fill ratio given 1.9, needed 1 Factored matrix follows: Matrix Object: 1 MPI processes type: seqaij rows=43875, cols=43875 package used to perform factorization: petsc total: nonzeros=36905625, allocated nonzeros=36905625 total number of mallocs used during MatSetValues calls =0 using I-node routines: found 8775 nodes, limit used is 5 linear system matrix = precond matrix: Matrix Object: 1 MPI processes type: seqaij rows=43875, cols=43875 total: nonzeros=36905625, allocated nonzeros=36905625 total number of mallocs used during MatSetValues calls =0 using I-node routines: found 8775 nodes, limit used is 5 linear system matrix = precond matrix: Matrix Object: 2 MPI processes type: mpiaij rows=64800, cols=64800 total: nonzeros=57736800, allocated nonzeros=57736800 total number of mallocs used during MatSetValues calls =0 using I-node (on process 0) routines: found 6480 nodes, limit used is 5 PetscSolve converged by 2 its=5 error = 2.069673e-02 Cpu of petsc solve=1.355602312088013e+01 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /users/stoneszone/hpMusic_opt on a linux-gnu-c-opt named n361 with 2 processors, by stoneszone Thu Jun 25 15:13:56 2015 Using Petsc Release Version 3.3.0, Patch 3, Wed Aug 29 11:26:24 CDT 2012 Max Max/Min Avg Total Time (sec): 3.086e+01 1.00007 3.086e+01 Objects: 7.400e+01 1.00000 7.400e+01 Flops: 2.351e+10 1.03840 2.307e+10 4.615e+10 Flops/sec: 7.619e+08 1.03848 7.478e+08 1.496e+09 MPI Messages: 3.700e+01 1.00000 3.700e+01 7.400e+01 MPI Message Lengths: 1.215e+08 1.00000 3.284e+06 2.430e+08 MPI Reductions: 1.000e+02 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 3.0856e+01 100.0% 4.6147e+10 100.0% 7.400e+01 100.0% 3.284e+06 100.0% 9.900e+01 99.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %f - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %f %M %L %R %T %f %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage MatMult 78 1.1 3.2893e+00 1.1 5.53e+09 1.1 2.4e+01 8.9e+04 0.0e+00 10 23 32 1 0 10 23 32 1 0 3192 MatSolve 77 1.1 3.2892e+00 1.1 5.64e+09 1.1 0.0e+00 0.0e+00 0.0e+00 10 23 0 0 0 10 23 0 0 0 3263 MatLUFactorNum 1 1.0 5.4255e+00 1.0 1.25e+10 1.0 0.0e+00 0.0e+00 0.0e+00 17 54 0 0 0 17 54 0 0 0 4571 MatILUFactorSym 1 1.0 1.4881e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 1 0 0 0 0 1 0 MatAssemblyBegin 3 1.0 1.1272e-011136.5 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 4 0 0 0 0 4 0 MatAssemblyEnd 3 1.0 1.1539e-01 1.0 0.00e+00 0.0 4.0e+00 2.2e+04 8.0e+00 0 0 5 0 8 0 0 5 0 8 0 MatGetRowIJ 1 1.0 2.1458e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetSubMatrice 1 1.0 1.5943e+00 1.1 0.00e+00 0.0 1.0e+01 2.4e+07 7.0e+00 5 0 14 98 7 5 0 14 98 7 0 MatGetOrdering 1 1.0 6.2704e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 4 0 0 0 0 4 0 MatIncreaseOvrlp 1 1.0 9.2284e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 2 0 0 0 0 2 0 MatView 3 3.0 9.7752e-05 2.1 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 1 0 0 0 0 1 0 VecMDot 71 1.1 7.6318e-03 1.2 2.09e+07 1.2 0.0e+00 0.0e+00 5.0e+00 0 0 0 0 5 0 0 0 0 5 4974 VecNorm 91 1.1 5.5225e-0214.1 7.56e+06 1.1 0.0e+00 0.0e+00 1.4e+01 0 0 0 0 14 0 0 0 0 14 263 VecScale 83 1.1 2.1131e-03 1.1 3.52e+06 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 3191 VecCopy 25 1.0 9.8109e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 59 1.0 3.4337e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 18 1.0 7.4458e-04 1.0 1.42e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 3791 VecAYPX 6 1.0 2.4581e-04 1.0 1.94e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1582 VecMAXPY 88 1.1 9.5332e-03 1.2 2.79e+07 1.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 5386 VecScatterBegin 34 1.0 1.5125e-03 1.0 0.00e+00 0.0 4.6e+01 8.9e+04 0.0e+00 0 0 62 2 0 0 0 62 2 0 0 VecScatterEnd 34 1.0 5.6801e-0131.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecNormalize 83 1.1 5.6778e-0210.0 1.06e+07 1.1 0.0e+00 0.0e+00 6.0e+00 0 0 0 0 6 0 0 0 0 6 356 KSPGMRESOrthog 71 1.1 1.4941e-02 1.2 4.19e+07 1.2 0.0e+00 0.0e+00 5.0e+00 0 0 0 0 5 0 0 0 0 5 5081 KSPSetUp 2 1.0 1.4548e-03 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 7 0 0 0 0 7 0 KSPSolve 1 1.0 1.3514e+01 1.0 2.35e+10 1.0 5.8e+01 4.2e+06 6.0e+01 44100 78100 60 44100 78100 61 3406 PCSetUp 2 1.0 7.1735e+00 1.0 1.25e+10 1.0 1.4e+01 1.7e+07 2.5e+01 23 54 19 98 25 23 54 19 98 25 3457 PCSetUpOnBlocks 1 1.0 5.5756e+00 1.0 1.25e+10 1.0 0.0e+00 0.0e+00 1.2e+01 18 54 0 0 12 18 54 0 0 12 4448 PCApply 11 1.0 5.9663e+00 1.1 1.05e+10 1.1 2.2e+01 8.9e+04 1.0e+01 18 43 30 1 10 18 43 30 1 10 3344 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Matrix 5 5 1234655228 0 Vector 51 51 14520080 0 Vector Scatter 2 2 2072 0 Index Set 10 10 614916 0 Krylov Solver 2 2 36816 0 Preconditioner 2 2 1824 0 Viewer 2 1 712 0 ======================================================================================================================== Average time to get PetscTime(): 0 Average time for MPI_Barrier(): 1.38283e-06 Average time for zero size MPI_Send(): 3.45707e-06 #PETSc Option Table entries: -ksp_atol 1e-50 -ksp_converged_reason -ksp_gmres_restart 30 -ksp_lgmres_augment 10 -ksp_max_it 100 -ksp_monitor_true_residual -ksp_pc_side right -ksp_rtol 1e-5 -ksp_type gmres -ksp_view -log_summary -pc_type asm -sub_ksp_atol 1e-30 -sub_ksp_max_it 1000 -sub_ksp_rtol 0.001 -sub_ksp_type gmres -sub_pc_factor_fill 1.9 -sub_pc_factor_levels 0 -sub_pc_type ilu #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4 Configure run at: Fri Sep 21 15:34:01 2012 Configure options: --download-f2cblaslapack=1 --download-mpicc=1 --download-mpich=1 --with-debugging=0 --with-cc=gcc --with-cxx=g++ --with-fc=0 --with-x=0 ----------------------------------------- Libraries compiled on Fri Sep 21 15:34:01 2012 on 3165CLinux1 Machine characteristics: Linux-3.2.0-3-amd64-x86_64-with-debian-wheezy-sid Using PETSc directory: /home/czhou/usr/petsc-3.3-p3-opt Using PETSc arch: linux-gnu-c-opt ----------------------------------------- Using C compiler: /home/czhou/usr/petsc-3.3-p3-opt/linux-gnu-c-opt/bin/mpicc -Wall -Wwrite-strings -Wno-strict-aliasing -Wno-unknown-pragmas -O ${COPTFLAGS} ${CFLAGS} ----------------------------------------- Using include paths: -I/home/czhou/usr/petsc-3.3-p3-opt/linux-gnu-c-opt/include -I/home/czhou/usr/petsc-3.3-p3-opt/include -I/home/czhou/usr/petsc-3.3-p3-opt/include -I/home/czhou/usr/petsc-3.3-p3-opt/linux-gnu-c-opt/include ----------------------------------------- Using C linker: /home/czhou/usr/petsc-3.3-p3-opt/linux-gnu-c-opt/bin/mpicc Using libraries: -Wl,-rpath,/home/czhou/usr/petsc-3.3-p3-opt/linux-gnu-c-opt/lib -L/home/czhou/usr/petsc-3.3-p3-opt/linux-gnu-c-opt/lib -lpetsc -lpthread -Wl,-rpath,/home/czhou/usr/petsc-3.3-p3-opt/linux-gnu-c-opt/lib -L/home/czhou/usr/petsc-3.3-p3-opt/linux-gnu-c-opt/lib -lf2clapack -lf2cblas -lm -lm -ldl -----------------------------------------