ModuleCmd_Switch.c(179):ERROR:152: Module 'PrgEnv-intel/6.0.3' is currently not loaded Level 1 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 80 x 80 x 9 (57600), size (m) 125. x 125. x 125. Level 0 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 40 x 40 x 5 (8000), size (m) 250. x 250. x 250. Solution statistics after solve: Full CONVERGED_FNORM_RELATIVE: Number of SNES iterations = 7, total linear iterations = 45 |X|_2 3996.69 -7.13274e-12 <= u <= 24.479 -3.10124 <= v <= 3.10124 8.69993e-15 <= c <= 24.479 Surface statistics: u in [1.214710e+01, 2.447899e+01] mean 2.010957e+01 Global eta range 2.96251e+10 to 9.2273e+12 converged range 2.96251e+10 to 2.44973e+12 Global beta2 range 1e+100 to 0. converged range 1e+100 to 0. Wall-clock time: 5.029e+00 seconds Degrees-of-freedom: 115200 FLOPS: 1.079e+10 L1 misses: 1.470e+09 Intensity: 7.342e+00 Rate: 2.291e+04 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /global/cscratch1/sd/jychang/Icesheet/./ex48cori on a arch-cori-c-opt named nid00359 with 32 processors, by jychang Tue Apr 4 20:23:49 2017 Using Petsc Development GIT revision: v3.7.5-3418-ge372536 GIT Date: 2017-03-30 13:35:15 -0500 Max Max/Min Avg Total Time (sec): 5.150e+00 1.00014 5.150e+00 Objects: 2.216e+03 1.00636 2.202e+03 Flop: 3.418e+08 1.01625 3.373e+08 1.079e+10 Flop/sec: 6.637e+07 1.01628 6.550e+07 2.096e+09 MPI Messages: 1.349e+04 1.29226 1.117e+04 3.573e+05 MPI Message Lengths: 1.098e+07 1.11423 9.058e+02 3.236e+08 MPI Reductions: 3.718e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 5.1423e+00 99.9% 1.0795e+10 100.0% 3.573e+05 100.0% 9.058e+02 100.0% 3.717e+03 100.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage BuildTwoSided 14 1.0 2.7452e-02 1.0 0.00e+00 0.0 9.5e+02 8.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecDot 7 1.0 2.7990e-04 3.8 5.04e+04 1.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 5761 VecMDot 395 1.0 2.8620e-02 1.7 4.89e+06 1.0 0.0e+00 0.0e+00 4.0e+02 0 1 0 0 11 0 1 0 0 11 5354 VecNorm 452 1.0 7.3320e-02 1.2 1.21e+06 1.0 0.0e+00 0.0e+00 4.5e+02 1 0 0 0 12 1 0 0 0 12 521 VecScale 437 1.0 4.4966e-04 1.2 5.53e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 38649 VecCopy 212 1.0 8.0061e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1036 1.0 1.1883e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 42 1.0 4.2272e-03 1.1 1.17e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 871 VecAYPX 1248 1.0 1.8425e-03 1.3 2.17e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 37127 VecAXPBYCZ 624 1.0 1.2541e-03 1.5 4.34e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 109095 VecWAXPY 7 1.0 4.2677e-05 1.9 2.52e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 18895 VecMAXPY 437 1.0 5.2407e-03 1.1 5.88e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 35178 VecAssemblyBegin 112 1.0 1.7178e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02 0 0 0 0 8 0 0 0 0 8 0 VecAssemblyEnd 112 1.0 6.5804e-05 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 161 1.0 4.2541e-03 1.1 4.77e+04 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 322 VecScatterBegin 1821 1.0 1.8016e-02 1.2 0.00e+00 0.0 2.8e+05 5.4e+02 0.0e+00 0 0 79 47 0 0 0 79 47 0 0 VecScatterEnd 1821 1.0 3.0510e-02 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSetRandom 14 1.0 1.3423e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecReduceArith 14 1.0 7.8201e-05 1.3 1.01e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 41242 VecReduceComm 7 1.0 2.1839e-0411.2 0.00e+00 0.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 437 1.0 2.1846e-02 2.5 1.66e+06 1.0 0.0e+00 0.0e+00 4.4e+02 0 0 0 0 12 0 0 0 0 12 2387 MatMult 1338 1.0 1.2559e-01 1.1 1.71e+08 1.0 2.3e+05 5.9e+02 0.0e+00 2 50 66 43 0 2 50 66 43 0 42897 MatMultAdd 156 1.0 2.9215e-02 4.3 1.63e+06 1.0 1.0e+04 2.0e+02 0.0e+00 1 0 3 1 0 1 0 3 1 0 1729 MatMultTranspose 164 1.0 1.4852e-02 2.0 1.81e+06 1.0 1.1e+04 2.1e+02 0.0e+00 0 1 3 1 0 0 1 3 1 0 3805 MatSolve 52 0.0 4.0698e-04 0.0 1.63e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 402 MatSOR 1167 1.0 1.4839e-01 1.1 1.42e+08 1.0 0.0e+00 0.0e+00 0.0e+00 3 42 0 0 0 3 42 0 0 0 30450 MatLUFactorSym 7 1.0 1.0984e-0232.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 7 1.0 1.4591e-02 4.4 2.89e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 20 MatScale 42 1.0 1.4336e-02 1.0 1.47e+05 1.2 1.9e+03 1.7e+02 0.0e+00 0 0 1 0 0 0 0 1 0 0 295 MatResidual 156 1.0 1.2203e-02 1.2 2.05e+07 1.0 2.7e+04 5.9e+02 0.0e+00 0 6 8 5 0 0 6 8 5 0 53190 MatAssemblyBegin 360 1.0 3.1530e-02 1.2 0.00e+00 0.0 1.1e+04 1.2e+04 3.0e+02 1 0 3 39 8 1 0 3 39 8 0 MatAssemblyEnd 360 1.0 1.0410e-01 1.0 0.00e+00 0.0 2.3e+04 7.5e+01 8.6e+02 2 0 6 1 23 2 0 6 1 23 0 MatGetRow 20090 1.1 7.4577e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 7 0.0 5.7969e-03 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCreateSubMat 28 1.0 5.1333e-02 1.0 0.00e+00 0.0 3.0e+03 6.3e+02 4.8e+02 1 0 1 1 13 1 0 1 1 13 0 MatGetOrdering 7 0.0 2.5012e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCoarsen 14 1.0 6.5708e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 7.0e+01 1 0 3 1 2 1 0 3 1 2 0 MatZeroEntries 28 1.0 4.7405e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAXPY 14 1.0 8.1284e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 2.8e+01 0 0 0 0 1 0 0 0 0 1 0 MatMatMult 14 1.0 2.5482e-02 1.0 6.36e+05 1.1 1.1e+04 3.6e+02 2.2e+02 0 0 3 1 6 0 0 3 1 6 712 MatMatMultSym 14 1.0 2.3641e-02 1.0 0.00e+00 0.0 8.7e+03 3.2e+02 2.0e+02 0 0 2 1 5 0 0 2 1 5 0 MatMatMultNum 14 1.0 1.8179e-03 1.0 6.36e+05 1.1 1.9e+03 5.5e+02 2.8e+01 0 0 1 0 1 0 0 1 0 1 9975 MatPtAP 14 1.0 6.4708e-02 1.0 3.73e+06 1.3 1.9e+04 7.2e+02 2.4e+02 1 1 5 4 6 1 1 5 4 6 1542 MatPtAPSymbolic 14 1.0 5.0878e-02 1.0 0.00e+00 0.0 1.0e+04 1.0e+03 9.8e+01 1 0 3 3 3 1 0 3 3 3 0 MatPtAPNumeric 14 1.0 1.3831e-02 1.0 3.73e+06 1.3 8.8e+03 3.8e+02 1.4e+02 0 1 2 1 4 0 1 2 1 4 7216 MatTrnMatMult 7 1.0 7.4459e-02 1.0 1.57e+06 1.0 1.1e+04 2.3e+03 1.3e+02 1 0 3 7 4 1 0 3 7 4 673 MatTrnMatMultSym 7 1.0 5.0304e-02 1.0 0.00e+00 0.0 9.0e+03 1.4e+03 1.2e+02 1 0 3 4 3 1 0 3 4 3 0 MatTrnMatMultNum 7 1.0 1.1883e-02 1.0 1.57e+06 1.0 1.8e+03 6.5e+03 1.4e+01 0 0 1 4 0 0 0 1 4 0 4216 MatGetLocalMat 56 1.0 8.2507e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetBrAoCol 42 1.0 6.7179e-03 1.1 0.00e+00 0.0 1.3e+04 9.8e+02 0.0e+00 0 0 4 4 0 0 0 4 4 0 0 DMCoarsen 1 1.0 8.9252e-03 1.0 0.00e+00 0.0 1.0e+03 5.3e+01 2.2e+01 0 0 0 0 1 0 0 0 0 1 0 DMCreateInterp 1 1.0 1.6762e-01 1.0 2.34e+04 1.0 4.8e+02 1.5e+02 2.5e+01 3 0 0 0 1 3 0 0 0 1 4 SFSetGraph 14 1.0 1.2898e-04 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFBcastBegin 98 1.0 5.5178e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 0.0e+00 1 0 3 1 0 1 0 3 1 0 0 SFBcastEnd 98 1.0 3.9101e-04 2.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SNESSolve 1 1.0 2.4672e+00 1.0 3.42e+08 1.0 3.6e+05 9.1e+02 3.7e+03 48100100100 99 48100100100 99 4375 SNESFunctionEval 8 1.0 6.9380e-02 1.0 0.00e+00 0.0 4.1e+03 8.6e+02 0.0e+00 1 0 1 1 0 1 0 1 1 0 0 SNESJacobianEval 14 1.0 1.6903e-01 1.0 0.00e+00 0.0 7.6e+03 1.6e+04 4.2e+01 3 0 2 38 1 3 0 2 38 1 0 SNESLineSearch 7 1.0 3.2927e-02 1.0 2.65e+06 1.0 5.4e+03 8.6e+02 2.8e+01 1 1 2 1 1 1 1 2 1 1 2574 KSPGMRESOrthog 395 1.0 3.9574e-02 1.4 9.77e+06 1.0 0.0e+00 0.0e+00 4.0e+02 1 3 0 0 11 1 3 0 0 11 7745 KSPSetUp 78 1.0 7.8094e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+01 0 0 0 0 1 0 0 0 0 1 0 KSPSolve 7 1.0 2.1647e+00 1.0 3.39e+08 1.0 3.5e+05 6.8e+02 3.6e+03 42 99 97 73 97 42 99 97 73 97 4947 PCGAMGGraph_AGG 14 1.0 6.8386e-02 1.0 9.21e+04 1.1 9.5e+03 9.7e+01 3.6e+02 1 0 3 0 10 1 0 3 0 10 39 PCGAMGCoarse_AGG 14 1.0 2.2352e-01 1.0 1.57e+06 1.0 3.1e+04 9.6e+02 3.2e+02 4 0 9 9 8 4 0 9 9 8 224 PCGAMGProl_AGG 14 1.0 2.8829e-01 1.0 0.00e+00 0.0 1.3e+04 3.1e+02 5.5e+02 6 0 4 1 15 6 0 4 1 15 0 PCGAMGPOpt_AGG 14 1.0 4.8292e-01 1.0 5.48e+06 1.1 3.0e+04 3.5e+02 6.6e+02 9 1 8 3 18 9 1 8 3 18 327 GAMG: createProl 14 1.0 1.0821e+00 1.0 7.14e+06 1.1 8.3e+04 5.4e+02 1.9e+03 21 2 23 14 51 21 2 23 14 51 195 Graph 28 1.0 5.7079e-02 1.0 9.21e+04 1.1 9.5e+03 9.7e+01 3.6e+02 1 0 3 0 10 1 0 3 0 10 47 MIS/Agg 14 1.0 8.5649e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 7.0e+01 2 0 3 1 2 2 0 3 1 2 0 SA: col data 14 1.0 6.4638e-03 1.0 0.00e+00 0.0 9.5e+03 3.0e+02 3.5e+02 0 0 3 1 9 0 0 3 1 9 0 SA: frmProl0 14 1.0 2.8070e-01 1.0 0.00e+00 0.0 3.5e+03 3.6e+02 1.4e+02 5 0 1 0 4 5 0 1 0 4 0 SA: smooth 14 1.0 5.0714e-02 1.0 6.91e+05 1.2 1.1e+04 3.6e+02 2.8e+02 1 0 3 1 8 1 0 3 1 8 388 GAMG: partLevel 14 1.0 1.6271e-01 1.0 3.73e+06 1.3 2.3e+04 6.9e+02 9.8e+02 3 1 6 5 26 3 1 6 5 26 613 repartition 14 1.0 1.6053e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.4e+01 0 0 0 0 2 0 0 0 0 2 0 Invert-Sort 14 1.0 9.0451e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 2 0 0 0 0 2 0 Move A 14 1.0 5.0542e-02 1.0 0.00e+00 0.0 1.6e+03 1.2e+03 2.5e+02 1 0 0 1 7 1 0 0 1 7 0 Move P 14 1.0 5.6243e-03 1.0 0.00e+00 0.0 1.4e+03 4.8e+01 2.5e+02 0 0 0 0 7 0 0 0 0 7 0 PCSetUp 14 1.0 1.6199e+00 1.0 1.11e+07 1.2 1.1e+05 9.0e+02 3.0e+03 31 3 31 31 82 31 3 31 31 82 196 PCSetUpOnBlocks 52 1.0 4.0324e-02 1.6 2.89e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 7 PCApply 52 1.0 5.6706e-01 1.0 3.09e+08 1.0 2.2e+05 5.6e+02 4.4e+02 11 91 62 38 12 11 91 62 38 12 17276 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Toy Hydrostatic Ice 1 1 832 0. Vector 1036 1036 4624504 0. Vector Scatter 127 127 192456 0. Matrix 509 509 13321800 0. Matrix Coarsen 14 14 9464 0. Distributed Mesh 4 4 21744 0. Index Set 415 415 459144 0. IS L to G Mapping 4 4 28000 0. Star Forest Bipartite Graph 22 22 19728 0. Discrete System 4 4 3720 0. SNES 1 1 1488 0. SNESLineSearch 1 1 1040 0. DMSNES 2 2 1504 0. Krylov Solver 24 24 894048 0. DMKSP interface 2 2 1392 0. Preconditioner 21 21 21848 0. PetscRandom 28 28 19208 0. Viewer 1 0 0 0. ======================================================================================================================== Average time to get PetscTime(): 1.19209e-07 Average time for MPI_Barrier(): 2.00272e-06 Average time for zero size MPI_Send(): 1.62423e-06 #PETSc Option Table entries: -M 40 -N 40 -P 5 -da_refine 1 -log_view -mg_coarse_pc_type gamg -mg_levels_0_pc_type gamg -mg_levels_1_sub_pc_type cholesky -pc_type mg -thi_mat_type baij #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 8 Configure options: --with-64-bit-indices=1 --with-cc=cc --with-clib-autodetect=0 --with-cxx=CC --with-cxxlib-autodetect=0 --with-debugging=0 --with-fc=ftn --with-fortranlib-autodetect=0 --with-memalign=64 --with-mpiexec=srun COPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" CXXOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" FOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" PETSC_ARCH=arch-cori-c-opt ----------------------------------------- Libraries compiled on Mon Apr 3 16:17:18 2017 on cori04 Machine characteristics: Linux-3.12.60-52.63.1.12215.0.PTF.1017941-default-x86_64-with-SuSE-12-x86_64 Using PETSc directory: /global/homes/j/jychang/Software/petsc Using PETSc arch: arch-cori-c-opt ----------------------------------------- Using C compiler: cc ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: ftn ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include ----------------------------------------- Using C linker: cc Using Fortran linker: ftn Using libraries: -Wl,-rpath,/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -L/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -lpetsc -ldl ----------------------------------------- Level 2 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 160 x 160 x 17 (435200), size (m) 62.5 x 62.5 x 62.5 Level 1 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 80 x 80 x 9 (57600), size (m) 125. x 125. x 125. Level 0 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 40 x 40 x 5 (8000), size (m) 250. x 250. x 250. Solution statistics after solve: Full CONVERGED_FNORM_RELATIVE: Number of SNES iterations = 7, total linear iterations = 44 |X|_2 11109.6 -4.74462e-12 <= u <= 24.5864 -3.11696 <= v <= 3.11696 1.10135e-14 <= c <= 24.5864 Surface statistics: u in [1.222055e+01, 2.458639e+01] mean 2.020187e+01 Global eta range 2.76911e+10 to 9.2273e+12 converged range 2.76911e+10 to 4.45572e+12 Global beta2 range 1e+100 to 0. converged range 1e+100 to 0. Wall-clock time: 1.029e+01 seconds Degrees-of-freedom: 870400 FLOPS: 8.459e+10 L1 misses: 1.068e+10 Intensity: 7.917e+00 Rate: 8.456e+04 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /global/cscratch1/sd/jychang/Icesheet/./ex48cori on a arch-cori-c-opt named nid00359 with 32 processors, by jychang Tue Apr 4 20:24:17 2017 Using Petsc Development GIT revision: v3.7.5-3418-ge372536 GIT Date: 2017-03-30 13:35:15 -0500 Max Max/Min Avg Total Time (sec): 1.040e+01 1.00005 1.040e+01 Objects: 2.294e+03 1.00614 2.280e+03 Flop: 2.648e+09 1.00205 2.643e+09 8.459e+10 Flop/sec: 2.547e+08 1.00205 2.542e+08 8.136e+09 MPI Messages: 1.687e+04 1.21831 1.456e+04 4.660e+05 MPI Message Lengths: 3.247e+07 1.03568 2.170e+03 1.011e+09 MPI Reductions: 4.000e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 1.0389e+01 99.9% 8.4588e+10 100.0% 4.660e+05 100.0% 2.170e+03 100.0% 3.999e+03 100.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage BuildTwoSided 14 1.0 3.2408e-02 1.0 0.00e+00 0.0 9.5e+02 8.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecDot 7 1.0 1.3173e-03 2.4 3.81e+05 1.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 9251 VecMDot 464 1.0 9.6009e-02 1.9 3.35e+07 1.0 0.0e+00 0.0e+00 4.6e+02 1 1 0 0 12 1 1 0 0 12 11140 VecNorm 528 1.0 7.3724e-02 1.3 8.51e+06 1.0 0.0e+00 0.0e+00 5.3e+02 1 0 0 0 13 1 0 0 0 13 3686 VecScale 513 1.0 1.6105e-03 1.1 3.85e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 76249 VecCopy 267 1.0 8.3096e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1211 1.0 1.6581e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 49 1.0 4.5407e-03 1.1 8.28e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 5823 VecAYPX 1632 1.0 4.8082e-02 1.1 1.60e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 10628 VecAXPBYCZ 816 1.0 2.7187e-02 1.2 3.20e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 37592 VecWAXPY 7 1.0 8.7214e-04 1.1 1.90e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 6986 VecMAXPY 513 1.0 1.5888e-02 1.1 4.04e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 81112 VecAssemblyBegin 112 1.0 2.1532e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02 0 0 0 0 7 0 0 0 0 7 0 VecAssemblyEnd 112 1.0 5.8174e-05 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 168 1.0 4.4577e-03 1.1 7.29e+04 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 488 VecScatterBegin 2289 1.0 5.4971e-02 1.1 0.00e+00 0.0 3.9e+05 1.3e+03 0.0e+00 1 0 83 51 0 1 0 83 51 0 0 VecScatterEnd 2289 1.0 1.6494e-01 2.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecSetRandom 14 1.0 1.3494e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecReduceArith 14 1.0 4.4966e-04 1.2 7.62e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 54198 VecReduceComm 7 1.0 2.8491e-04 4.2 0.00e+00 0.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 513 1.0 2.8581e-02 1.9 1.15e+07 1.0 0.0e+00 0.0e+00 5.1e+02 0 0 0 0 13 0 0 0 0 13 12890 MatMult 1695 1.0 1.8132e+00 1.0 1.31e+09 1.0 3.3e+05 1.4e+03 0.0e+00 17 50 70 46 0 17 50 70 46 0 23157 MatMultAdd 204 1.0 4.5427e-02 1.9 1.08e+07 1.0 1.5e+04 5.0e+02 0.0e+00 0 0 3 1 0 0 0 3 1 0 7557 MatMultTranspose 220 1.0 5.0090e-02 1.9 1.24e+07 1.0 1.6e+04 5.2e+02 0.0e+00 0 0 4 1 0 0 0 4 1 0 7893 MatSolve 51 0.0 4.8900e-04 0.0 1.60e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 328 MatSOR 1532 1.0 2.9772e+00 1.0 1.17e+09 1.0 0.0e+00 0.0e+00 0.0e+00 28 44 0 0 0 28 44 0 0 0 12539 MatLUFactorSym 7 1.0 1.4097e-0214.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 7 1.0 1.9433e-02 4.1 2.89e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 15 MatScale 42 1.0 2.1112e-02 1.0 1.47e+05 1.2 1.9e+03 1.7e+02 0.0e+00 0 0 0 0 0 0 0 0 0 0 200 MatResidual 204 1.0 2.1771e-01 1.1 1.61e+08 1.0 4.0e+04 1.4e+03 0.0e+00 2 6 9 6 0 2 6 9 6 0 23558 MatAssemblyBegin 369 1.0 5.7523e-02 1.3 0.00e+00 0.0 1.3e+04 3.6e+04 3.4e+02 0 0 3 45 8 0 0 3 45 8 0 MatAssemblyEnd 369 1.0 1.3202e-01 1.0 0.00e+00 0.0 2.4e+04 9.9e+01 8.8e+02 1 0 5 0 22 1 0 5 0 22 0 MatGetRow 20090 1.1 7.6849e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 7 0.0 6.0282e-03 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCreateSubMat 28 1.0 5.2260e-02 1.0 0.00e+00 0.0 3.0e+03 6.3e+02 4.8e+02 1 0 1 0 12 1 0 1 0 12 0 MatGetOrdering 7 0.0 2.5805e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCoarsen 14 1.0 7.4183e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 7.0e+01 1 0 2 0 2 1 0 2 0 2 0 MatZeroEntries 35 1.0 5.2785e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 MatAXPY 14 1.0 9.9297e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.8e+01 0 0 0 0 1 0 0 0 0 1 0 MatMatMult 14 1.0 2.4828e-02 1.0 6.36e+05 1.1 1.1e+04 3.6e+02 2.2e+02 0 0 2 0 6 0 0 2 0 6 730 MatMatMultSym 14 1.0 2.3042e-02 1.0 0.00e+00 0.0 8.7e+03 3.2e+02 2.0e+02 0 0 2 0 5 0 0 2 0 5 0 MatMatMultNum 14 1.0 1.7328e-03 1.0 6.36e+05 1.1 1.9e+03 5.5e+02 2.8e+01 0 0 0 0 1 0 0 0 0 1 10465 MatPtAP 14 1.0 6.5125e-02 1.0 3.73e+06 1.3 1.9e+04 7.2e+02 2.4e+02 1 0 4 1 6 1 0 4 1 6 1533 MatPtAPSymbolic 14 1.0 5.1410e-02 1.0 0.00e+00 0.0 1.0e+04 1.0e+03 9.8e+01 0 0 2 1 2 0 0 2 1 2 0 MatPtAPNumeric 14 1.0 1.3699e-02 1.0 3.73e+06 1.3 8.8e+03 3.8e+02 1.4e+02 0 0 2 0 4 0 0 2 0 4 7286 MatTrnMatMult 7 1.0 8.3159e-02 1.0 1.57e+06 1.0 1.1e+04 2.3e+03 1.3e+02 1 0 2 2 3 1 0 2 2 3 602 MatTrnMatMultSym 7 1.0 5.9392e-02 1.0 0.00e+00 0.0 9.0e+03 1.4e+03 1.2e+02 1 0 2 1 3 1 0 2 1 3 0 MatTrnMatMultNum 7 1.0 1.2359e-02 1.0 1.57e+06 1.0 1.8e+03 6.5e+03 1.4e+01 0 0 0 1 0 0 0 0 1 0 4053 MatGetLocalMat 56 1.0 9.6574e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetBrAoCol 42 1.0 7.0934e-03 1.2 0.00e+00 0.0 1.3e+04 9.8e+02 0.0e+00 0 0 3 1 0 0 0 3 1 0 0 DMCoarsen 2 1.0 1.8643e-02 1.0 0.00e+00 0.0 2.0e+03 1.1e+02 4.4e+01 0 0 0 0 1 0 0 0 0 1 0 DMCreateInterp 2 1.0 2.0458e-01 1.0 2.03e+05 1.0 9.6e+02 3.4e+02 5.0e+01 2 0 0 0 1 2 0 0 0 1 32 SFSetGraph 14 1.0 1.1492e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFBcastBegin 98 1.0 6.1451e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 0.0e+00 1 0 2 0 0 1 0 2 0 0 0 SFBcastEnd 98 1.0 3.1519e-04 2.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SNESSolve 1 1.0 8.1851e+00 1.0 2.65e+09 1.0 4.6e+05 2.2e+03 4.0e+03 79100100100 99 79100100100 99 10334 SNESFunctionEval 8 1.0 2.9148e-01 1.0 0.00e+00 0.0 4.1e+03 3.2e+03 0.0e+00 3 0 1 1 0 3 0 1 1 0 0 SNESJacobianEval 21 1.0 1.1486e+00 1.0 0.00e+00 0.0 1.1e+04 4.0e+04 7.0e+01 11 0 2 45 2 11 0 2 45 2 0 SNESLineSearch 7 1.0 2.5393e-01 1.0 2.12e+07 1.0 5.4e+03 3.2e+03 2.8e+01 2 1 1 2 1 2 1 1 2 1 2668 KSPGMRESOrthog 464 1.0 1.1742e-01 1.6 6.70e+07 1.0 0.0e+00 0.0e+00 4.6e+02 1 3 0 0 12 1 3 0 0 12 18218 KSPSetUp 86 1.0 1.1960e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 1 0 0 0 0 1 0 KSPSolve 7 1.0 6.7683e+00 1.0 2.63e+09 1.0 4.5e+05 1.4e+03 3.9e+03 65 99 98 65 97 65 99 98 65 97 12397 PCGAMGGraph_AGG 14 1.0 8.5653e-02 1.0 9.21e+04 1.1 9.5e+03 9.7e+01 3.6e+02 1 0 2 0 9 1 0 2 0 9 31 PCGAMGCoarse_AGG 14 1.0 2.2754e-01 1.0 1.57e+06 1.0 3.1e+04 9.6e+02 3.2e+02 2 0 7 3 8 2 0 7 3 8 220 PCGAMGProl_AGG 14 1.0 1.7076e-01 1.0 0.00e+00 0.0 1.3e+04 3.1e+02 5.5e+02 2 0 3 0 14 2 0 3 0 14 0 PCGAMGPOpt_AGG 14 1.0 3.8462e-01 1.0 5.48e+06 1.1 3.0e+04 3.5e+02 6.6e+02 4 0 6 1 16 4 0 6 1 16 411 GAMG: createProl 14 1.0 8.7617e-01 1.0 7.14e+06 1.1 8.3e+04 5.4e+02 1.9e+03 8 0 18 4 47 8 0 18 4 47 241 Graph 28 1.0 6.8999e-02 1.0 9.21e+04 1.1 9.5e+03 9.7e+01 3.6e+02 1 0 2 0 9 1 0 2 0 9 39 MIS/Agg 14 1.0 8.2166e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 7.0e+01 1 0 2 0 2 1 0 2 0 2 0 SA: col data 14 1.0 6.0670e-03 1.0 0.00e+00 0.0 9.5e+03 3.0e+02 3.5e+02 0 0 2 0 9 0 0 2 0 9 0 SA: frmProl0 14 1.0 1.6367e-01 1.0 0.00e+00 0.0 3.5e+03 3.6e+02 1.4e+02 2 0 1 0 4 2 0 1 0 4 0 SA: smooth 14 1.0 5.5566e-02 1.0 6.91e+05 1.2 1.1e+04 3.6e+02 2.8e+02 1 0 2 0 7 1 0 2 0 7 354 GAMG: partLevel 14 1.0 1.6794e-01 1.0 3.73e+06 1.3 2.3e+04 6.9e+02 9.8e+02 2 0 5 2 24 2 0 5 2 25 594 repartition 14 1.0 1.6151e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.4e+01 0 0 0 0 2 0 0 0 0 2 0 Invert-Sort 14 1.0 8.8663e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 1 0 0 0 0 1 0 Move A 14 1.0 5.1425e-02 1.0 0.00e+00 0.0 1.6e+03 1.2e+03 2.5e+02 0 0 0 0 6 0 0 0 0 6 0 Move P 14 1.0 5.5583e-03 1.0 0.00e+00 0.0 1.4e+03 4.8e+01 2.5e+02 0 0 0 0 6 0 0 0 0 6 0 PCSetUp 14 1.0 1.6160e+00 1.0 1.25e+07 1.1 1.2e+05 1.6e+03 3.2e+03 15 0 26 18 79 15 0 26 18 79 225 PCSetUpOnBlocks 51 1.0 4.6449e-02 1.5 2.89e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 6 PCApply 51 1.0 5.0000e+00 1.0 2.47e+09 1.0 3.2e+05 1.3e+03 6.2e+02 48 93 70 43 15 48 93 70 43 15 15783 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Toy Hydrostatic Ice 1 1 832 0. Vector 1075 1075 13927680 0. Vector Scatter 132 132 428896 0. Matrix 518 518 30921736 0. Matrix Coarsen 14 14 9464 0. Distributed Mesh 6 6 32576 0. Index Set 425 425 638824 0. IS L to G Mapping 6 6 162480 0. Star Forest Bipartite Graph 26 26 23152 0. Discrete System 6 6 5576 0. SNES 1 1 1488 0. SNESLineSearch 1 1 1040 0. DMSNES 3 3 2296 0. Krylov Solver 26 26 925984 0. DMKSP interface 3 3 2088 0. Preconditioner 22 22 22888 0. PetscRandom 28 28 19208 0. Viewer 1 0 0 0. ======================================================================================================================== Average time to get PetscTime(): 9.53674e-08 Average time for MPI_Barrier(): 2.19345e-06 Average time for zero size MPI_Send(): 1.65403e-06 #PETSc Option Table entries: -M 40 -N 40 -P 5 -da_refine 2 -log_view -mg_coarse_pc_type gamg -mg_levels_0_pc_type gamg -mg_levels_1_sub_pc_type cholesky -pc_type mg -thi_mat_type baij #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 8 Configure options: --with-64-bit-indices=1 --with-cc=cc --with-clib-autodetect=0 --with-cxx=CC --with-cxxlib-autodetect=0 --with-debugging=0 --with-fc=ftn --with-fortranlib-autodetect=0 --with-memalign=64 --with-mpiexec=srun COPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" CXXOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" FOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" PETSC_ARCH=arch-cori-c-opt ----------------------------------------- Libraries compiled on Mon Apr 3 16:17:18 2017 on cori04 Machine characteristics: Linux-3.12.60-52.63.1.12215.0.PTF.1017941-default-x86_64-with-SuSE-12-x86_64 Using PETSc directory: /global/homes/j/jychang/Software/petsc Using PETSc arch: arch-cori-c-opt ----------------------------------------- Using C compiler: cc ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: ftn ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include ----------------------------------------- Using C linker: cc Using Fortran linker: ftn Using libraries: -Wl,-rpath,/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -L/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -lpetsc -ldl ----------------------------------------- Level 3 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 320 x 320 x 33 (3379200), size (m) 31.25 x 31.25 x 31.25 Level 2 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 160 x 160 x 17 (435200), size (m) 62.5 x 62.5 x 62.5 Level 1 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 80 x 80 x 9 (57600), size (m) 125. x 125. x 125. Level 0 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 40 x 40 x 5 (8000), size (m) 250. x 250. x 250. Solution statistics after solve: Full CONVERGED_FNORM_RELATIVE: Number of SNES iterations = 7, total linear iterations = 44 |X|_2 31108.6 -1.9031e-12 <= u <= 24.6134 -3.12097 <= v <= 3.12097 8.36597e-16 <= c <= 24.6134 Surface statistics: u in [1.223876e+01, 2.461340e+01] mean 2.022494e+01 Global eta range 2.6772e+10 to 9.2273e+12 converged range 2.6772e+10 to 6.96406e+12 Global beta2 range 1e+100 to 0. converged range 1e+100 to 0. Wall-clock time: 5.850e+01 seconds Degrees-of-freedom: 6758400 FLOPS: 6.801e+11 L1 misses: 8.556e+10 Intensity: 7.948e+00 Rate: 1.155e+05 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /global/cscratch1/sd/jychang/Icesheet/./ex48cori on a arch-cori-c-opt named nid00359 with 32 processors, by jychang Tue Apr 4 20:25:23 2017 Using Petsc Development GIT revision: v3.7.5-3418-ge372536 GIT Date: 2017-03-30 13:35:15 -0500 Max Max/Min Avg Total Time (sec): 5.860e+01 1.00001 5.860e+01 Objects: 2.372e+03 1.00594 2.358e+03 Flop: 2.126e+10 1.00025 2.125e+10 6.801e+11 Flop/sec: 3.627e+08 1.00026 3.627e+08 1.161e+10 MPI Messages: 2.038e+04 1.17409 1.808e+04 5.786e+05 MPI Message Lengths: 1.169e+08 1.00966 6.417e+03 3.713e+09 MPI Reductions: 4.284e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 5.8594e+01 100.0% 6.8008e+11 100.0% 5.786e+05 100.0% 6.417e+03 100.0% 4.283e+03 100.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage BuildTwoSided 14 1.0 3.2702e-02 1.0 0.00e+00 0.0 9.5e+02 8.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecDot 7 1.0 1.0628e-02 2.2 2.96e+06 1.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 8903 VecMDot 534 1.0 8.4907e-01 4.5 2.56e+08 1.0 0.0e+00 0.0e+00 5.3e+02 1 1 0 0 12 1 1 0 0 12 9663 VecNorm 605 1.0 2.3112e-01 1.7 6.53e+07 1.0 0.0e+00 0.0e+00 6.0e+02 0 0 0 0 14 0 0 0 0 14 9042 VecScale 590 1.0 1.8753e-02 1.3 2.95e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 50310 VecCopy 325 1.0 1.0678e-01 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1400 1.0 8.8583e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 56 1.0 1.9573e-02 1.0 6.36e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 10397 VecAYPX 2040 1.0 4.1538e-01 1.7 1.24e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 1 0 0 0 1 1 0 0 0 9528 VecAXPBYCZ 1020 1.0 2.6012e-01 1.3 2.47e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 30431 VecWAXPY 7 1.0 1.0615e-02 1.0 1.48e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 4457 VecMAXPY 590 1.0 4.9074e-01 1.1 3.09e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 1 0 0 0 1 1 0 0 0 20150 VecAssemblyBegin 112 1.0 1.9824e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02 0 0 0 0 7 0 0 0 0 7 0 VecAssemblyEnd 112 1.0 6.2466e-05 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 175 1.0 4.1749e-03 1.1 2.63e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1981 VecScatterBegin 2782 1.0 1.4979e-01 1.1 0.00e+00 0.0 5.0e+05 3.8e+03 0.0e+00 0 0 86 51 0 0 0 86 51 0 0 VecScatterEnd 2782 1.0 1.5741e+00 7.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 VecSetRandom 14 1.0 1.3471e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecReduceArith 14 1.0 5.8839e-03 1.1 5.91e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 32161 VecReduceComm 7 1.0 1.4040e-0315.4 0.00e+00 0.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 590 1.0 1.8667e-01 1.8 8.85e+07 1.0 0.0e+00 0.0e+00 5.9e+02 0 0 0 0 14 0 0 0 0 14 15163 MatMult 2071 1.0 1.5491e+01 1.1 1.05e+10 1.0 4.2e+05 4.1e+03 0.0e+00 26 49 73 47 0 26 49 73 47 0 21706 MatMultAdd 255 1.0 2.3932e-01 1.2 8.27e+07 1.0 2.0e+04 1.4e+03 0.0e+00 0 0 3 1 0 0 0 3 1 0 11058 MatMultTranspose 279 1.0 4.4844e-01 2.9 9.57e+07 1.0 2.2e+04 1.4e+03 0.0e+00 0 0 4 1 0 0 0 4 1 0 6823 MatSolve 51 0.0 4.9591e-04 0.0 1.60e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 323 MatSOR 1915 1.0 2.7142e+01 1.1 9.51e+09 1.0 0.0e+00 0.0e+00 0.0e+00 45 45 0 0 0 45 45 0 0 0 11216 MatLUFactorSym 7 1.0 1.0582e-0230.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 7 1.0 1.5340e-02 4.4 2.89e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 19 MatScale 42 1.0 1.9720e-02 1.0 1.47e+05 1.2 1.9e+03 1.7e+02 0.0e+00 0 0 0 0 0 0 0 0 0 0 214 MatResidual 255 1.0 1.9884e+00 1.2 1.29e+09 1.0 5.3e+04 4.1e+03 0.0e+00 3 6 9 6 0 3 6 9 6 0 20694 MatAssemblyBegin 378 1.0 2.4761e-01 2.0 0.00e+00 0.0 1.5e+04 1.2e+05 3.7e+02 0 0 3 48 9 0 0 3 48 9 0 MatAssemblyEnd 378 1.0 2.5680e-01 1.0 0.00e+00 0.0 2.4e+04 1.9e+02 8.9e+02 0 0 4 0 21 0 0 4 0 21 0 MatGetRow 20090 1.1 6.7832e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 7 0.0 7.5543e-03 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCreateSubMat 28 1.0 3.4220e-02 1.0 0.00e+00 0.0 3.0e+03 6.3e+02 4.8e+02 0 0 1 0 11 0 0 1 0 11 0 MatGetOrdering 7 0.0 2.6727e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCoarsen 14 1.0 7.1902e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 7.0e+01 0 0 2 0 2 0 0 2 0 2 0 MatZeroEntries 42 1.0 4.1994e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 MatAXPY 14 1.0 1.0168e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 2.8e+01 0 0 0 0 1 0 0 0 0 1 0 MatMatMult 14 1.0 2.5724e-02 1.0 6.36e+05 1.1 1.1e+04 3.6e+02 2.2e+02 0 0 2 0 5 0 0 2 0 5 705 MatMatMultSym 14 1.0 2.3988e-02 1.0 0.00e+00 0.0 8.7e+03 3.2e+02 2.0e+02 0 0 2 0 5 0 0 2 0 5 0 MatMatMultNum 14 1.0 1.6725e-03 1.0 6.36e+05 1.1 1.9e+03 5.5e+02 2.8e+01 0 0 0 0 1 0 0 0 0 1 10843 MatPtAP 14 1.0 6.9330e-02 1.0 3.73e+06 1.3 1.9e+04 7.2e+02 2.4e+02 0 0 3 0 6 0 0 3 0 6 1440 MatPtAPSymbolic 14 1.0 5.5696e-02 1.0 0.00e+00 0.0 1.0e+04 1.0e+03 9.8e+01 0 0 2 0 2 0 0 2 0 2 0 MatPtAPNumeric 14 1.0 1.3617e-02 1.0 3.73e+06 1.3 8.8e+03 3.8e+02 1.4e+02 0 0 2 0 3 0 0 2 0 3 7330 MatTrnMatMult 7 1.0 7.8797e-02 1.0 1.57e+06 1.0 1.1e+04 2.3e+03 1.3e+02 0 0 2 1 3 0 0 2 1 3 636 MatTrnMatMultSym 7 1.0 5.4526e-02 1.0 0.00e+00 0.0 9.0e+03 1.4e+03 1.2e+02 0 0 2 0 3 0 0 2 0 3 0 MatTrnMatMultNum 7 1.0 1.2680e-02 1.0 1.57e+06 1.0 1.8e+03 6.5e+03 1.4e+01 0 0 0 0 0 0 0 0 0 0 3951 MatGetLocalMat 56 1.0 9.1653e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetBrAoCol 42 1.0 7.7808e-03 1.2 0.00e+00 0.0 1.3e+04 9.8e+02 0.0e+00 0 0 2 0 0 0 0 2 0 0 0 DMCoarsen 3 1.0 1.0102e-02 1.0 0.00e+00 0.0 3.1e+03 2.6e+02 6.6e+01 0 0 1 0 2 0 0 1 0 2 0 DMCreateInterp 3 1.0 2.0824e-01 1.0 1.61e+06 1.0 1.4e+03 8.7e+02 7.5e+01 0 0 0 0 2 0 0 0 0 2 248 SFSetGraph 14 1.0 1.1158e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFBcastBegin 98 1.0 6.0109e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 0.0e+00 0 0 2 0 0 0 0 2 0 0 0 SFBcastEnd 98 1.0 3.0899e-04 2.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SNESSolve 1 1.0 5.6206e+01 1.0 2.13e+10 1.0 5.8e+05 6.4e+03 4.2e+03 96100100100 99 96100100100 99 12100 SNESFunctionEval 8 1.0 1.9475e+00 1.0 0.00e+00 0.0 4.1e+03 1.2e+04 0.0e+00 3 0 1 1 0 3 0 1 1 0 0 SNESJacobianEval 28 1.0 8.6584e+00 1.0 0.00e+00 0.0 1.5e+04 1.2e+05 9.8e+01 15 0 3 48 2 15 0 3 48 2 0 SNESLineSearch 7 1.0 2.0236e+00 1.0 1.69e+08 1.0 5.4e+03 1.2e+04 2.8e+01 3 1 1 2 1 3 1 1 2 1 2677 KSPGMRESOrthog 534 1.0 1.2579e+00 2.3 5.13e+08 1.0 0.0e+00 0.0e+00 5.3e+02 2 2 0 0 12 2 2 0 0 12 13046 KSPSetUp 94 1.0 2.5028e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 7.2e+01 0 0 0 0 2 0 0 0 0 2 0 KSPSolve 7 1.0 4.6249e+01 1.0 2.11e+10 1.0 5.7e+05 4.1e+03 4.2e+03 79 99 98 62 98 79 99 98 62 98 14587 PCGAMGGraph_AGG 14 1.0 7.3272e-02 1.0 9.21e+04 1.1 9.5e+03 9.7e+01 3.6e+02 0 0 2 0 8 0 0 2 0 8 36 PCGAMGCoarse_AGG 14 1.0 2.1786e-01 1.0 1.57e+06 1.0 3.1e+04 9.6e+02 3.2e+02 0 0 5 1 7 0 0 5 1 7 230 PCGAMGProl_AGG 14 1.0 1.6291e-01 1.0 0.00e+00 0.0 1.3e+04 3.1e+02 5.5e+02 0 0 2 0 13 0 0 2 0 13 0 PCGAMGPOpt_AGG 14 1.0 3.2663e-01 1.0 5.48e+06 1.1 3.0e+04 3.5e+02 6.6e+02 1 0 5 0 15 1 0 5 0 15 484 GAMG: createProl 14 1.0 7.8852e-01 1.0 7.14e+06 1.1 8.3e+04 5.4e+02 1.9e+03 1 0 14 1 44 1 0 14 1 44 267 Graph 28 1.0 6.0457e-02 1.0 9.21e+04 1.1 9.5e+03 9.7e+01 3.6e+02 0 0 2 0 8 0 0 2 0 8 44 MIS/Agg 14 1.0 8.0180e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 7.0e+01 0 0 2 0 2 0 0 2 0 2 0 SA: col data 14 1.0 6.7234e-03 1.0 0.00e+00 0.0 9.5e+03 3.0e+02 3.5e+02 0 0 2 0 8 0 0 2 0 8 0 SA: frmProl0 14 1.0 1.5508e-01 1.0 0.00e+00 0.0 3.5e+03 3.6e+02 1.4e+02 0 0 1 0 3 0 0 1 0 3 0 SA: smooth 14 1.0 5.3384e-02 1.0 6.91e+05 1.2 1.1e+04 3.6e+02 2.8e+02 0 0 2 0 7 0 0 2 0 7 369 GAMG: partLevel 14 1.0 1.5463e-01 1.0 3.73e+06 1.3 2.3e+04 6.9e+02 9.8e+02 0 0 4 0 23 0 0 4 0 23 645 repartition 14 1.0 1.6106e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.4e+01 0 0 0 0 2 0 0 0 0 2 0 Invert-Sort 14 1.0 8.8561e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 1 0 0 0 0 1 0 Move A 14 1.0 3.3668e-02 1.0 0.00e+00 0.0 1.6e+03 1.2e+03 2.5e+02 0 0 0 0 6 0 0 0 0 6 0 Move P 14 1.0 5.4650e-03 1.0 0.00e+00 0.0 1.4e+03 4.8e+01 2.5e+02 0 0 0 0 6 0 0 0 0 6 0 PCSetUp 14 1.0 2.4866e+00 1.0 2.40e+07 1.1 1.3e+05 4.2e+03 3.3e+03 4 0 22 14 76 4 0 22 14 76 294 PCSetUpOnBlocks 51 1.0 4.2618e-02 1.9 2.89e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 7 PCApply 51 1.0 4.2158e+01 1.0 1.99e+10 1.0 4.3e+05 3.8e+03 8.0e+02 72 94 74 44 19 72 94 74 44 19 15110 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Toy Hydrostatic Ice 1 1 832 0. Vector 1114 1114 85317528 0. Vector Scatter 137 137 2175736 0. Matrix 527 527 169532088 0. Matrix Coarsen 14 14 9464 0. Distributed Mesh 8 8 43408 0. Index Set 435 435 1729448 0. IS L to G Mapping 8 8 1100672 0. Star Forest Bipartite Graph 30 30 26576 0. Discrete System 8 8 7432 0. SNES 1 1 1488 0. SNESLineSearch 1 1 1040 0. DMSNES 4 4 3088 0. Krylov Solver 28 28 957920 0. DMKSP interface 4 4 2784 0. Preconditioner 23 23 23928 0. PetscRandom 28 28 19208 0. Viewer 1 0 0 0. ======================================================================================================================== Average time to get PetscTime(): 9.53674e-08 Average time for MPI_Barrier(): 2.19345e-06 Average time for zero size MPI_Send(): 1.72108e-06 #PETSc Option Table entries: -M 40 -N 40 -P 5 -da_refine 3 -log_view -mg_coarse_pc_type gamg -mg_levels_0_pc_type gamg -mg_levels_1_sub_pc_type cholesky -pc_type mg -thi_mat_type baij #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 8 Configure options: --with-64-bit-indices=1 --with-cc=cc --with-clib-autodetect=0 --with-cxx=CC --with-cxxlib-autodetect=0 --with-debugging=0 --with-fc=ftn --with-fortranlib-autodetect=0 --with-memalign=64 --with-mpiexec=srun COPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" CXXOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" FOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" PETSC_ARCH=arch-cori-c-opt ----------------------------------------- Libraries compiled on Mon Apr 3 16:17:18 2017 on cori04 Machine characteristics: Linux-3.12.60-52.63.1.12215.0.PTF.1017941-default-x86_64-with-SuSE-12-x86_64 Using PETSc directory: /global/homes/j/jychang/Software/petsc Using PETSc arch: arch-cori-c-opt ----------------------------------------- Using C compiler: cc ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: ftn ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include ----------------------------------------- Using C linker: cc Using Fortran linker: ftn Using libraries: -Wl,-rpath,/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -L/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -lpetsc -ldl ----------------------------------------- Level 4 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 640 x 640 x 65 (26624000), size (m) 15.625 x 15.625 x 15.625 Level 3 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 320 x 320 x 33 (3379200), size (m) 31.25 x 31.25 x 31.25 Level 2 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 160 x 160 x 17 (435200), size (m) 62.5 x 62.5 x 62.5 Level 1 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 80 x 80 x 9 (57600), size (m) 125. x 125. x 125. Level 0 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 40 x 40 x 5 (8000), size (m) 250. x 250. x 250. Solution statistics after solve: Full CONVERGED_FNORM_RELATIVE: Number of SNES iterations = 7, total linear iterations = 43 |X|_2 87518.2 -8.01069e-13 <= u <= 24.6202 -3.12198 <= v <= 3.12198 2.63324e-16 <= c <= 24.6202 Surface statistics: u in [1.224323e+01, 2.462018e+01] mean 2.023071e+01 Global eta range 2.63231e+10 to 9.2273e+12 converged range 2.63231e+10 to 8.50855e+12 Global beta2 range 1e+100 to 0. converged range 1e+100 to 0. Wall-clock time: 4.437e+02 seconds Degrees-of-freedom: 53248000 FLOPS: 5.372e+12 L1 misses: 6.905e+11 Intensity: 7.780e+00 Rate: 1.200e+05 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /global/cscratch1/sd/jychang/Icesheet/./ex48cori on a arch-cori-c-opt named nid00359 with 32 processors, by jychang Tue Apr 4 20:32:50 2017 Using Petsc Development GIT revision: v3.7.5-3418-ge372536 GIT Date: 2017-03-30 13:35:15 -0500 Max Max/Min Avg Total Time (sec): 4.439e+02 1.00000 4.439e+02 Objects: 2.450e+03 1.00575 2.436e+03 Flop: 1.679e+11 1.00003 1.679e+11 5.372e+12 Flop/sec: 3.782e+08 1.00003 3.782e+08 1.210e+10 MPI Messages: 2.360e+04 1.14529 2.132e+04 6.821e+05 MPI Message Lengths: 4.474e+08 1.00249 2.095e+04 1.429e+10 MPI Reductions: 4.566e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 4.4384e+02 100.0% 5.3722e+12 100.0% 6.821e+05 100.0% 2.095e+04 100.0% 4.565e+03 100.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage BuildTwoSided 14 1.0 2.9377e-02 1.0 0.00e+00 0.0 9.5e+02 8.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecDot 7 1.0 7.1556e-02 2.3 2.33e+07 1.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 10418 VecMDot 603 1.0 5.2095e+00 3.5 1.99e+09 1.0 0.0e+00 0.0e+00 6.0e+02 1 1 0 0 13 1 1 0 0 13 12229 VecNorm 681 1.0 1.4214e+00 2.5 5.10e+08 1.0 0.0e+00 0.0e+00 6.8e+02 0 0 0 0 15 0 0 0 0 15 11482 VecScale 666 1.0 9.0572e-01 1.0 2.30e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 8128 VecCopy 377 1.0 7.2079e-01 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1566 1.0 8.3475e-01 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 63 1.0 1.6098e-01 1.0 5.00e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 9938 VecAYPX 2400 1.0 3.5579e+00 1.7 9.53e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 1 0 0 0 1 1 0 0 0 8574 VecAXPBYCZ 1200 1.0 2.2154e+00 1.3 1.91e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 27539 VecWAXPY 7 1.0 9.9269e-02 1.0 1.16e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 3755 VecMAXPY 666 1.0 4.2151e+00 1.0 2.40e+09 1.0 0.0e+00 0.0e+00 0.0e+00 1 1 0 0 0 1 1 0 0 0 18228 VecAssemblyBegin 112 1.0 2.1347e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02 0 0 0 0 6 0 0 0 0 6 0 VecAssemblyEnd 112 1.0 5.4836e-05 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 182 1.0 7.6704e-03 1.3 1.74e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 7246 VecScatterBegin 3226 1.0 6.8904e-01 1.2 0.00e+00 0.0 6.0e+05 1.2e+04 0.0e+00 0 0 87 50 0 0 0 87 50 0 0 VecScatterEnd 3226 1.0 8.5603e+0010.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecSetRandom 14 1.0 1.3447e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecReduceArith 14 1.0 5.0733e-02 1.1 4.66e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 29388 VecReduceComm 7 1.0 8.0802e-0317.9 0.00e+00 0.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 666 1.0 2.0227e+00 1.4 6.90e+08 1.0 0.0e+00 0.0e+00 6.7e+02 0 0 0 0 15 0 0 0 0 15 10918 MatMult 2410 1.0 1.2050e+02 1.1 8.28e+10 1.0 5.1e+05 1.3e+04 0.0e+00 27 49 75 46 0 27 49 75 46 0 21983 MatMultAdd 300 1.0 1.7866e+00 1.1 6.40e+08 1.0 2.4e+04 4.3e+03 0.0e+00 0 0 4 1 0 0 0 4 1 0 11460 MatMultTranspose 332 1.0 2.7783e+00 2.0 7.42e+08 1.0 2.7e+04 4.4e+03 0.0e+00 0 0 4 1 0 0 0 4 1 0 8547 MatSolve 50 0.0 4.8852e-04 0.0 1.57e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 322 MatSOR 2262 1.0 2.2116e+02 1.1 7.56e+10 1.0 0.0e+00 0.0e+00 0.0e+00 48 45 0 0 0 48 45 0 0 0 10936 MatLUFactorSym 7 1.0 1.3500e-0242.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 7 1.0 1.7295e-02 4.1 2.89e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 17 MatScale 42 1.0 1.9886e-02 1.1 1.47e+05 1.2 1.9e+03 1.7e+02 0.0e+00 0 0 0 0 0 0 0 0 0 0 213 MatResidual 300 1.0 1.5420e+01 1.2 1.01e+10 1.0 6.5e+04 1.3e+04 0.0e+00 3 6 10 6 0 3 6 10 6 0 20957 MatAssemblyBegin 387 1.0 1.5306e+00 4.7 0.00e+00 0.0 1.7e+04 4.2e+05 4.0e+02 0 0 2 49 9 0 0 2 49 9 0 MatAssemblyEnd 387 1.0 1.0585e+00 1.0 0.00e+00 0.0 2.5e+04 5.5e+02 9.0e+02 0 0 4 0 20 0 0 4 0 20 0 MatGetRow 20090 1.1 7.6053e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 7 0.0 6.1123e-03 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCreateSubMat 28 1.0 5.6903e-02 1.0 0.00e+00 0.0 3.0e+03 6.3e+02 4.8e+02 0 0 0 0 10 0 0 0 0 10 0 MatGetOrdering 7 0.0 2.7789e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCoarsen 14 1.0 6.8002e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 7.0e+01 0 0 2 0 2 0 0 2 0 2 0 MatZeroEntries 49 1.0 3.3608e+00 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 MatAXPY 14 1.0 9.8815e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.8e+01 0 0 0 0 1 0 0 0 0 1 0 MatMatMult 14 1.0 2.3916e-02 1.0 6.36e+05 1.1 1.1e+04 3.6e+02 2.2e+02 0 0 2 0 5 0 0 2 0 5 758 MatMatMultSym 14 1.0 2.2179e-02 1.0 0.00e+00 0.0 8.7e+03 3.2e+02 2.0e+02 0 0 1 0 4 0 0 1 0 4 0 MatMatMultNum 14 1.0 1.6932e-03 1.0 6.36e+05 1.1 1.9e+03 5.5e+02 2.8e+01 0 0 0 0 1 0 0 0 0 1 10710 MatPtAP 14 1.0 6.5053e-02 1.0 3.73e+06 1.3 1.9e+04 7.2e+02 2.4e+02 0 0 3 0 5 0 0 3 0 5 1534 MatPtAPSymbolic 14 1.0 5.1435e-02 1.0 0.00e+00 0.0 1.0e+04 1.0e+03 9.8e+01 0 0 1 0 2 0 0 1 0 2 0 MatPtAPNumeric 14 1.0 1.3652e-02 1.0 3.73e+06 1.3 8.8e+03 3.8e+02 1.4e+02 0 0 1 0 3 0 0 1 0 3 7311 MatTrnMatMult 7 1.0 8.5686e-02 1.0 1.57e+06 1.0 1.1e+04 2.3e+03 1.3e+02 0 0 2 0 3 0 0 2 0 3 585 MatTrnMatMultSym 7 1.0 5.9986e-02 1.0 0.00e+00 0.0 9.0e+03 1.4e+03 1.2e+02 0 0 1 0 3 0 0 1 0 3 0 MatTrnMatMultNum 7 1.0 1.2683e-02 1.0 1.57e+06 1.0 1.8e+03 6.5e+03 1.4e+01 0 0 0 0 0 0 0 0 0 0 3950 MatGetLocalMat 56 1.0 9.4686e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetBrAoCol 42 1.0 6.6102e-03 1.2 0.00e+00 0.0 1.3e+04 9.8e+02 0.0e+00 0 0 2 0 0 0 0 2 0 0 0 DMCoarsen 4 1.0 1.4358e-02 1.0 0.00e+00 0.0 4.1e+03 7.1e+02 8.8e+01 0 0 1 0 2 0 0 1 0 2 0 DMCreateInterp 4 1.0 4.6864e-01 1.0 1.28e+07 1.0 1.9e+03 2.5e+03 1.0e+02 0 0 0 0 2 0 0 0 0 2 873 SFSetGraph 14 1.0 1.1373e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFBcastBegin 98 1.0 5.5954e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 0.0e+00 0 0 2 0 0 0 0 2 0 0 0 SFBcastEnd 98 1.0 5.0378e-04 4.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SNESSolve 1 1.0 4.3749e+02 1.0 1.68e+11 1.0 6.8e+05 2.1e+04 4.5e+03 99100100100 99 99100100100 99 12280 SNESFunctionEval 8 1.0 1.5460e+01 1.0 0.00e+00 0.0 4.1e+03 4.7e+04 0.0e+00 3 0 1 1 0 3 0 1 1 0 0 SNESJacobianEval 35 1.0 6.8994e+01 1.0 0.00e+00 0.0 1.9e+04 3.8e+05 1.3e+02 16 0 3 50 3 16 0 3 50 3 0 SNESLineSearch 7 1.0 1.5535e+01 1.0 1.35e+09 1.0 5.4e+03 4.7e+04 2.8e+01 3 1 1 2 1 3 1 1 2 1 2789 KSPGMRESOrthog 603 1.0 8.6574e+00 1.7 3.98e+09 1.0 0.0e+00 0.0e+00 6.0e+02 2 2 0 0 13 2 2 0 0 13 14718 KSPSetUp 102 1.0 1.2778e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 8.8e+01 0 0 0 0 2 0 0 0 0 2 0 KSPSolve 7 1.0 3.5937e+02 1.0 1.67e+11 1.0 6.7e+05 1.3e+04 4.5e+03 81 99 98 60 98 81 99 98 60 98 14828 PCGAMGGraph_AGG 14 1.0 7.9316e-02 1.0 9.21e+04 1.1 9.5e+03 9.7e+01 3.6e+02 0 0 1 0 8 0 0 1 0 8 34 PCGAMGCoarse_AGG 14 1.0 2.2306e-01 1.0 1.57e+06 1.0 3.1e+04 9.6e+02 3.2e+02 0 0 5 0 7 0 0 5 0 7 225 PCGAMGProl_AGG 14 1.0 1.6573e-01 1.0 0.00e+00 0.0 1.3e+04 3.1e+02 5.5e+02 0 0 2 0 12 0 0 2 0 12 0 PCGAMGPOpt_AGG 14 1.0 3.2763e-01 1.0 5.48e+06 1.1 3.0e+04 3.5e+02 6.6e+02 0 0 4 0 14 0 0 4 0 14 482 GAMG: createProl 14 1.0 8.0299e-01 1.0 7.14e+06 1.1 8.3e+04 5.4e+02 1.9e+03 0 0 12 0 41 0 0 12 0 41 263 Graph 28 1.0 6.5082e-02 1.0 9.21e+04 1.1 9.5e+03 9.7e+01 3.6e+02 0 0 1 0 8 0 0 1 0 8 41 MIS/Agg 14 1.0 7.6058e-02 1.0 0.00e+00 0.0 1.1e+04 2.6e+02 7.0e+01 0 0 2 0 2 0 0 2 0 2 0 SA: col data 14 1.0 6.3233e-03 1.0 0.00e+00 0.0 9.5e+03 3.0e+02 3.5e+02 0 0 1 0 8 0 0 1 0 8 0 SA: frmProl0 14 1.0 1.5851e-01 1.0 0.00e+00 0.0 3.5e+03 3.6e+02 1.4e+02 0 0 1 0 3 0 0 1 0 3 0 SA: smooth 14 1.0 5.2157e-02 1.0 6.91e+05 1.2 1.1e+04 3.6e+02 2.8e+02 0 0 2 0 6 0 0 2 0 6 378 GAMG: partLevel 14 1.0 1.7075e-01 1.0 3.73e+06 1.3 2.3e+04 6.9e+02 9.8e+02 0 0 3 0 21 0 0 3 0 21 585 repartition 14 1.0 1.4938e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.4e+01 0 0 0 0 2 0 0 0 0 2 0 Invert-Sort 14 1.0 8.9898e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 1 0 0 0 0 1 0 Move A 14 1.0 5.6126e-02 1.0 0.00e+00 0.0 1.6e+03 1.2e+03 2.5e+02 0 0 0 0 6 0 0 0 0 6 0 Move P 14 1.0 5.5060e-03 1.0 0.00e+00 0.0 1.4e+03 4.8e+01 2.5e+02 0 0 0 0 6 0 0 0 0 6 0 PCSetUp 14 1.0 1.0860e+01 1.0 1.15e+08 1.0 1.3e+05 1.4e+04 3.4e+03 2 0 19 13 74 2 0 19 13 74 335 PCSetUpOnBlocks 50 1.0 4.5601e-02 1.6 2.89e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 6 PCApply 50 1.0 3.3545e+02 1.0 1.57e+11 1.0 5.3e+05 1.2e+04 9.7e+02 75 94 77 44 21 75 94 77 44 21 15017 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Toy Hydrostatic Ice 1 1 832 0. Vector 1153 1153 645876560 0. Vector Scatter 142 142 15698576 0. Matrix 536 536 1270286152 0. Matrix Coarsen 14 14 9464 0. Distributed Mesh 10 10 54240 0. Index Set 445 445 9319080 0. IS L to G Mapping 10 10 8116048 0. Star Forest Bipartite Graph 34 34 30000 0. Discrete System 10 10 9288 0. SNES 1 1 1488 0. SNESLineSearch 1 1 1040 0. DMSNES 5 5 3880 0. Krylov Solver 30 30 989856 0. DMKSP interface 5 5 3480 0. Preconditioner 24 24 24968 0. PetscRandom 28 28 19208 0. Viewer 1 0 0 0. ======================================================================================================================== Average time to get PetscTime(): 9.53674e-08 Average time for MPI_Barrier(): 2.38419e-06 Average time for zero size MPI_Send(): 1.71363e-06 #PETSc Option Table entries: -M 40 -N 40 -P 5 -da_refine 4 -log_view -mg_coarse_pc_type gamg -mg_levels_0_pc_type gamg -mg_levels_1_sub_pc_type cholesky -pc_type mg -thi_mat_type baij #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 8 Configure options: --with-64-bit-indices=1 --with-cc=cc --with-clib-autodetect=0 --with-cxx=CC --with-cxxlib-autodetect=0 --with-debugging=0 --with-fc=ftn --with-fortranlib-autodetect=0 --with-memalign=64 --with-mpiexec=srun COPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" CXXOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" FOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" PETSC_ARCH=arch-cori-c-opt ----------------------------------------- Libraries compiled on Mon Apr 3 16:17:18 2017 on cori04 Machine characteristics: Linux-3.12.60-52.63.1.12215.0.PTF.1017941-default-x86_64-with-SuSE-12-x86_64 Using PETSc directory: /global/homes/j/jychang/Software/petsc Using PETSc arch: arch-cori-c-opt ----------------------------------------- Using C compiler: cc ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: ftn ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include ----------------------------------------- Using C linker: cc Using Fortran linker: ftn Using libraries: -Wl,-rpath,/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -L/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -lpetsc -ldl -----------------------------------------