0 KSP preconditioned resid norm 1.696540029979e+06 true resid norm 2.242813827253e+12 ||r(i)||/||b|| 1.000000000000e+00 1 KSP preconditioned resid norm 2.075079768083e+00 true resid norm 1.882606961535e+04 ||r(i)||/||b|| 8.393951110251e-09 Linear solve converged due to CONVERGED_RTOL iterations 1 KSP Object: 1 MPI processes type: gmres restart=30, using Classical (unmodified) Gram-Schmidt Orthogonalization with no iterative refinement happy breakdown tolerance 1e-30 maximum iterations=10000, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000. left preconditioning using PRECONDITIONED norm type for convergence test PC Object: 1 MPI processes type: asm total subdomain blocks = 1, amount of overlap = 3 restriction/interpolation type - RESTRICT Local solve is same for all blocks, in the following KSP and PC objects: KSP Object: (sub_) 1 MPI processes type: preonly maximum iterations=10000, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000. left preconditioning using NONE norm type for convergence test PC Object: (sub_) 1 MPI processes type: lu out-of-place factorization tolerance for zero pivot 2.22045e-14 matrix ordering: amd factor fill ratio given 5., needed 1.02897 Factored matrix follows: Mat Object: 1 MPI processes type: seqaij rows=320745, cols=320745 package used to perform factorization: petsc total: nonzeros=1984497, allocated nonzeros=1984497 total number of mallocs used during MatSetValues calls =0 using I-node routines: found 185662 nodes, limit used is 5 linear system matrix = precond matrix: Mat Object: 1 MPI processes type: seqaij rows=320745, cols=320745 total: nonzeros=1928617, allocated nonzeros=1928617 total number of mallocs used during MatSetValues calls =0 using I-node routines: found 186383 nodes, limit used is 5 linear system matrix = precond matrix: Mat Object: 1 MPI processes type: seqaij rows=320745, cols=320745 total: nonzeros=1928617, allocated nonzeros=1928617 total number of mallocs used during MatSetValues calls =0 using I-node routines: found 186383 nodes, limit used is 5 Time: 3.118e-01 seconds ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /lustre/eaglefs/projects/dss/./main on a arch-intel-complex-opt named r7i7n29 with 1 processor, by jchang Wed Feb 6 13:45:30 2019 Using Petsc Development GIT revision: v3.10.3-1312-g058c394 GIT Date: 2019-01-23 16:37:18 -0600 Max Max/Min Avg Total Time (sec): 3.757e-01 1.000 3.757e-01 Objects: 4.100e+01 1.000 4.100e+01 Flop: 1.285e+08 1.000 1.285e+08 1.285e+08 Flop/sec: 3.419e+08 1.000 3.419e+08 3.419e+08 MPI Messages: 0.000e+00 0.000 0.000e+00 0.000e+00 MPI Message Lengths: 0.000e+00 0.000 0.000e+00 0.000e+00 MPI Reductions: 0.000e+00 0.000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total 0: Main Stage: 3.7572e-01 100.0% 1.2847e+08 100.0% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent AvgLen: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage ---- Total Max Ratio Max Ratio Max Ratio Mess AvgLen Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage MatMult 3 1.0 2.9458e-02 1.0 4.24e+07 1.0 0.0e+00 0.0e+00 0.0e+00 8 33 0 0 0 8 33 0 0 0 1441 MatSolve 2 1.0 2.5982e-02 1.0 2.92e+07 1.0 0.0e+00 0.0e+00 0.0e+00 7 23 0 0 0 7 23 0 0 0 1123 MatLUFactorSym 1 1.0 3.7406e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 10 0 0 0 0 10 0 0 0 0 0 MatLUFactorNum 1 1.0 6.7826e-02 1.0 2.35e+07 1.0 0.0e+00 0.0e+00 0.0e+00 18 18 0 0 0 18 18 0 0 0 346 MatAssemblyBegin 2 1.0 2.1458e-06 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 2 1.0 1.0680e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 3 0 0 0 0 3 0 0 0 0 0 MatGetRowIJ 1 1.0 9.5060e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 3 0 0 0 0 3 0 0 0 0 0 MatCreateSubMats 1 1.0 3.2060e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 9 0 0 0 0 9 0 0 0 0 0 MatGetOrdering 1 1.0 4.0774e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 11 0 0 0 0 11 0 0 0 0 0 MatIncreaseOvrlp 1 1.0 6.4042e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 MatLoad 1 1.0 5.0304e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 13 0 0 0 0 13 0 0 0 0 0 MatView 3 1.0 7.1049e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecMDot 1 1.0 6.6113e-04 1.0 2.57e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 3881 VecNorm 5 1.0 1.5202e-03 1.0 1.28e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 10 0 0 0 0 10 0 0 0 8440 VecScale 2 1.0 2.7363e-02 1.0 2.57e+06 1.0 0.0e+00 0.0e+00 0.0e+00 7 2 0 0 0 7 2 0 0 0 94 VecCopy 3 1.0 2.2202e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecSet 25 1.0 2.7504e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 7 0 0 0 0 7 0 0 0 0 0 VecAXPY 2 1.0 1.0650e-03 1.0 5.13e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 4 0 0 0 0 4 0 0 0 4819 VecAYPX 2 1.0 1.2941e-03 1.0 2.57e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 1983 VecMAXPY 3 1.0 1.5440e-03 1.0 7.70e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 6 0 0 0 0 6 0 0 0 4986 VecAssemblyBegin 1 1.0 9.5367e-07 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAssemblyEnd 1 1.0 0.0000e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecLoad 1 1.0 8.4050e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 VecScatterBegin 8 1.0 5.9762e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 VecNormalize 2 1.0 2.7950e-02 1.0 7.70e+06 1.0 0.0e+00 0.0e+00 0.0e+00 7 6 0 0 0 7 6 0 0 0 275 KSPSetUp 2 1.0 7.5660e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 KSPSolve 1 1.0 3.1165e-01 1.0 1.28e+08 1.0 0.0e+00 0.0e+00 0.0e+00 83100 0 0 0 83100 0 0 0 412 KSPGMRESOrthog 1 1.0 1.2100e-03 1.0 5.13e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 4 0 0 0 0 4 0 0 0 4241 PCSetUp 2 1.0 1.9671e-01 1.0 2.35e+07 1.0 0.0e+00 0.0e+00 0.0e+00 52 18 0 0 0 52 18 0 0 0 119 PCSetUpOnBlocks 1 1.0 1.4610e-01 1.0 2.35e+07 1.0 0.0e+00 0.0e+00 0.0e+00 39 18 0 0 0 39 18 0 0 0 161 PCApply 2 1.0 3.6120e-02 1.0 2.92e+07 1.0 0.0e+00 0.0e+00 0.0e+00 10 23 0 0 0 10 23 0 0 0 808 PCApplyOnBlocks 2 1.0 2.7388e-02 1.0 2.92e+07 1.0 0.0e+00 0.0e+00 0.0e+00 7 23 0 0 0 7 23 0 0 0 1066 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Viewer 4 3 2520 0. Matrix 3 3 129674124 0. Vector 16 16 82135296 0. Krylov Solver 2 2 36864 0. Preconditioner 2 2 1984 0. Index Set 11 11 9197656 0. IS L to G Mapping 1 1 2566632 0. Vec Scatter 2 2 1408 0. ======================================================================================================================== Average time to get PetscTime(): 0. #PETSc Option Table entries: -A DS3_urbansuburban.matrix -b DS3_urbansuburban.vector -ksp_converged_reason -ksp_monitor_true_residual -ksp_type gmres -ksp_view -log_view -pc_asm_overlap 3 -pc_type asm -sub_pc_factor_mat_ordering_type amd -sub_pc_type lu #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 16 sizeof(PetscInt) 4 Configure options: --COPTFLAGS="-g -xCORE-AVX512 -O3" --CXXOPTFLAGS="-g -xCORE-AVX512 -O3" --FOPTFLAGS="-g -xCORE-AVX512 -O3" --download-hwloc=1 --download-metis --download-mumps --download-parmetis --download-scalapack --download-suitesparse --download-zlib --with-avx512-kernels=1 --with-blaslapack-dir=/nopt/nrel/apps/base/2019-01-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mkl-2018.3.222-dzfj7xvn6uy7tqmmgzwfcjkucomyxkui/compilers_and_libraries_2018.3.222/linux/mkl --with-cc=mpiicc --with-cxx=mpiicpc --with-debugging=0 --with-fc=mpiifort --with-mpiexec=srun --with-openmp=1 --with-scalar-type=complex --with-shared-libraries=1 PETSC_ARCH=arch-intel-complex-opt ----------------------------------------- Libraries compiled on 2019-02-01 22:32:57 on el1 Machine characteristics: Linux-3.10.0-693.el7.x86_64-x86_64-with-centos-7.4.1708-Core Using PETSc directory: /lustre/eaglefs/projects/dss/petsc-dev Using PETSc arch: arch-intel-complex-opt ----------------------------------------- Using C compiler: mpiicc -fPIC -wd1572 -g -xCORE-AVX512 -O3 -fopenmp Using Fortran compiler: mpiifort -fPIC -g -xCORE-AVX512 -O3 -fopenmp ----------------------------------------- Using include paths: -I/lustre/eaglefs/projects/dss/petsc-dev/include -I/lustre/eaglefs/projects/dss/petsc-dev/arch-intel-complex-opt/include ----------------------------------------- Using C linker: mpiicc Using Fortran linker: mpiifort Using libraries: -Wl,-rpath,/lustre/eaglefs/projects/dss/petsc-dev/arch-intel-complex-opt/lib -L/lustre/eaglefs/projects/dss/petsc-dev/arch-intel-complex-opt/lib -lpetsc -Wl,-rpath,/lustre/eaglefs/projects/dss/petsc-dev/arch-intel-complex-opt/lib -L/lustre/eaglefs/projects/dss/petsc-dev/arch-intel-complex-opt/lib -Wl,-rpath,/nopt/nrel/apps/base/2019-01-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mkl-2018.3.222-dzfj7xvn6uy7tqmmgzwfcjkucomyxkui/compilers_and_libraries_2018.3.222/linux/mkl/lib/intel64 -L/nopt/nrel/apps/base/2019-01-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mkl-2018.3.222-dzfj7xvn6uy7tqmmgzwfcjkucomyxkui/compilers_and_libraries_2018.3.222/linux/mkl/lib/intel64 -Wl,-rpath,/nopt/nrel/apps/base/2018-12-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mpi-2018.3.222-6hbmyhwcn27yjvb6og6iypamd6hb3tb4/compilers_and_libraries_2018.3.222/linux/mpi/intel64/lib/debug_mt -L/nopt/nrel/apps/base/2018-12-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mpi-2018.3.222-6hbmyhwcn27yjvb6og6iypamd6hb3tb4/compilers_and_libraries_2018.3.222/linux/mpi/intel64/lib/debug_mt -Wl,-rpath,/nopt/nrel/apps/base/2018-12-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mpi-2018.3.222-6hbmyhwcn27yjvb6og6iypamd6hb3tb4/compilers_and_libraries_2018.3.222/linux/mpi/intel64/lib -L/nopt/nrel/apps/base/2018-12-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mpi-2018.3.222-6hbmyhwcn27yjvb6og6iypamd6hb3tb4/compilers_and_libraries_2018.3.222/linux/mpi/intel64/lib -Wl,-rpath,/nopt/nrel/apps/base/2019-01-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mkl-2018.3.222-dzfj7xvn6uy7tqmmgzwfcjkucomyxkui/compilers_and_libraries_2018.3.222/linux/compiler/lib/intel64_lin -L/nopt/nrel/apps/base/2019-01-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mkl-2018.3.222-dzfj7xvn6uy7tqmmgzwfcjkucomyxkui/compilers_and_libraries_2018.3.222/linux/compiler/lib/intel64_lin -Wl,-rpath,/nopt/nrel/apps/base/2019-01-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mkl-2018.3.222-dzfj7xvn6uy7tqmmgzwfcjkucomyxkui/compilers_and_libraries_2018.3.222/linux/mkl/lib/intel64_lin -L/nopt/nrel/apps/base/2019-01-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mkl-2018.3.222-dzfj7xvn6uy7tqmmgzwfcjkucomyxkui/compilers_and_libraries_2018.3.222/linux/mkl/lib/intel64_lin -Wl,-rpath,/nopt/nrel/apps/base/2019-01-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mkl-2018.3.222-dzfj7xvn6uy7tqmmgzwfcjkucomyxkui/lib -L/nopt/nrel/apps/base/2019-01-02/spack/opt/spack/linux-centos7-x86_64/intel-18.0.3/intel-mkl-2018.3.222-dzfj7xvn6uy7tqmmgzwfcjkucomyxkui/lib -Wl,-rpath,/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/ipp/lib/intel64 -L/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/ipp/lib/intel64 -Wl,-rpath,/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/tbb/lib/intel64/gcc4.7 -L/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/tbb/lib/intel64/gcc4.7 -Wl,-rpath,/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/daal/lib/intel64_lin -L/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/daal/lib/intel64_lin -Wl,-rpath,/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/tbb/lib/intel64_lin/gcc4.4 -L/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/tbb/lib/intel64_lin/gcc4.4 -Wl,-rpath,/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/lib -L/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/lib -Wl,-rpath,/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/gcc-7.3.0-vydnujncq3lpwhhnxmauinsqxkhxy4gn/lib64 -L/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/gcc-7.3.0-vydnujncq3lpwhhnxmauinsqxkhxy4gn/lib64 -Wl,-rpath,/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/gcc-7.3.0-vydnujncq3lpwhhnxmauinsqxkhxy4gn/lib -L/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/gcc-7.3.0-vydnujncq3lpwhhnxmauinsqxkhxy4gn/lib -Wl,-rpath,/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/compiler/lib/intel64_lin -L/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/intel-parallel-studio-cluster.2018.3-6wq2vvslzhamadvc66fecse5bgcdhjzt/compilers_and_libraries_2018.3.222/linux/compiler/lib/intel64_lin -Wl,-rpath,/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/gcc-7.3.0-vydnujncq3lpwhhnxmauinsqxkhxy4gn/lib/gcc/x86_64-pc-linux-gnu/7.3.0 -L/nopt/nrel/apps/compilers/spack/opt/spack/linux-centos7-x86_64/gcc-4.8.5/gcc-7.3.0-vydnujncq3lpwhhnxmauinsqxkhxy4gn/lib/gcc/x86_64-pc-linux-gnu/7.3.0 -Wl,-rpath,/opt/intel/mpi-rt/2017.0.0/intel64/lib/debug_mt -Wl,-rpath,/opt/intel/mpi-rt/2017.0.0/intel64/lib -lcmumps -ldmumps -lsmumps -lzmumps -lmumps_common -lpord -lscalapack -lumfpack -lklu -lcholmod -lbtf -lccolamd -lcolamd -lcamd -lamd -lsuitesparseconfig -lmkl_intel_lp64 -lmkl_sequential -lmkl_core -lpthread -lparmetis -lmetis -lz -lX11 -lhwloc -lstdc++ -ldl -lmpifort -lmpi -lmpigi -lrt -lpthread -lifport -lifcoremt_pic -limf -lsvml -lm -lipgo -lirc -lgcc_s -lirc_s -lstdc++ -ldl -----------------------------------------