ModuleCmd_Switch.c(179):ERROR:152: Module 'PrgEnv-intel/6.0.3' is currently not loaded Level 1 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 80 x 80 x 9 (57600), size (m) 125. x 125. x 125. Level 0 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 40 x 40 x 5 (8000), size (m) 250. x 250. x 250. Solution statistics after solve: Full CONVERGED_FNORM_RELATIVE: Number of SNES iterations = 7, total linear iterations = 46 |X|_2 3996.69 -1.01384e-10 <= u <= 24.479 -3.10124 <= v <= 3.10124 1.53208e-13 <= c <= 24.479 Surface statistics: u in [1.214710e+01, 2.447899e+01] mean 2.010957e+01 Global eta range 2.96251e+10 to 9.2273e+12 converged range 2.96251e+10 to 2.44973e+12 Global beta2 range 1e+100 to 0. converged range 1e+100 to 0. Wall-clock time: 8.647e+00 seconds Degrees-of-freedom: 115200 FLOPS: 1.069e+10 L1 misses: 4.078e+08 Intensity: 2.622e+01 Rate: 1.332e+04 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ./ex48cori on a arch-cori-c-opt named nid12477 with 64 processors, by jychang Tue Apr 4 18:18:58 2017 Using Petsc Development GIT revision: v3.7.5-3418-ge372536 GIT Date: 2017-03-30 13:35:15 -0500 Max Max/Min Avg Total Time (sec): 8.830e+00 1.00041 8.828e+00 Objects: 2.216e+03 1.00636 2.202e+03 Flop: 1.717e+08 1.03060 1.671e+08 1.069e+10 Flop/sec: 1.945e+07 1.03058 1.893e+07 1.211e+09 MPI Messages: 1.419e+04 1.36956 1.080e+04 6.912e+05 MPI Message Lengths: 7.700e+06 1.14498 6.343e+02 4.384e+08 MPI Reductions: 3.706e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 8.8155e+00 99.9% 1.0695e+10 100.0% 6.912e+05 100.0% 6.343e+02 100.0% 3.705e+03 100.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage BuildTwoSided 14 1.0 5.5515e-02 1.0 0.00e+00 0.0 1.8e+03 8.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecDot 7 1.0 7.6556e-04 2.5 2.52e+04 1.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 2106 VecMDot 396 1.0 4.0381e-02 1.2 2.51e+06 1.0 0.0e+00 0.0e+00 4.0e+02 0 1 0 0 11 0 1 0 0 11 3820 VecNorm 453 1.0 1.1778e-01 1.1 6.20e+05 1.0 0.0e+00 0.0e+00 4.5e+02 1 0 0 0 12 1 0 0 0 12 326 VecScale 438 1.0 3.4211e-03 1.2 2.83e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 5112 VecCopy 215 1.0 1.0784e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1050 1.0 2.1647e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 42 1.0 7.9656e-03 1.3 5.93e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 462 VecAYPX 1272 1.0 4.2536e-03 1.1 1.12e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 16386 VecAXPBYCZ 636 1.0 3.0270e-03 1.1 2.25e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 46052 VecWAXPY 7 1.0 8.1778e-05 1.4 1.26e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 9861 VecMAXPY 438 1.0 1.2118e-02 1.2 3.02e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 15321 VecAssemblyBegin 112 1.0 3.8507e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02 0 0 0 0 8 0 0 0 0 8 0 VecAssemblyEnd 112 1.0 3.8528e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 161 1.0 7.1597e-03 1.3 2.62e+04 1.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 191 VecScatterBegin 1846 1.0 8.8701e-02 1.1 0.00e+00 0.0 5.6e+05 3.8e+02 0.0e+00 1 0 81 49 0 1 0 81 49 0 0 VecScatterEnd 1846 1.0 7.1634e-02 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecSetRandom 14 1.0 3.7098e-04 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecReduceArith 14 1.0 7.3910e-05 1.2 5.04e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 43630 VecReduceComm 7 1.0 1.7388e-0312.4 0.00e+00 0.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 438 1.0 3.3288e-02 1.5 8.49e+05 1.0 0.0e+00 0.0e+00 4.4e+02 0 0 0 0 12 0 0 0 0 12 1576 MatMult 1357 1.0 2.4321e-01 1.1 8.66e+07 1.0 4.7e+05 4.1e+02 0.0e+00 3 50 68 44 0 3 50 68 44 0 22168 MatMultAdd 159 1.0 5.4475e-02 4.3 8.44e+05 1.1 1.8e+04 1.6e+02 0.0e+00 1 0 3 1 0 1 0 3 1 0 934 MatMultTranspose 167 1.0 2.1050e-02 1.4 9.38e+05 1.1 2.0e+04 1.6e+02 0.0e+00 0 1 3 1 0 0 1 3 1 0 2701 MatSolve 53 0.0 1.2574e-03 0.0 9.38e+04 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 75 MatSOR 1185 1.0 2.8965e-01 1.1 7.01e+07 1.0 0.0e+00 0.0e+00 0.0e+00 3 41 0 0 0 3 41 0 0 0 15231 MatLUFactorSym 7 1.0 1.7575e-0218.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 7 1.0 1.7965e-02 3.9 1.23e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 7 MatScale 42 1.0 3.2594e-02 1.1 8.02e+04 1.3 3.7e+03 1.2e+02 0.0e+00 0 0 1 0 0 0 0 1 0 0 127 MatResidual 159 1.0 2.4995e-02 1.1 1.05e+07 1.0 5.5e+04 4.2e+02 0.0e+00 0 6 8 5 0 0 6 8 5 0 26079 MatAssemblyBegin 360 1.0 9.2436e-02 1.2 0.00e+00 0.0 1.7e+04 9.5e+03 3.0e+02 1 0 3 38 8 1 0 3 38 8 0 MatAssemblyEnd 360 1.0 2.3271e-01 1.1 0.00e+00 0.0 4.1e+04 5.8e+01 8.6e+02 3 0 6 1 23 3 0 6 1 23 0 MatGetRow 11130 1.3 1.5826e-02 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 7 0.0 1.0793e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCreateSubMat 28 1.0 1.0475e-01 1.0 0.00e+00 0.0 4.7e+03 3.4e+02 4.8e+02 1 0 1 0 13 1 0 1 0 13 0 MatGetOrdering 7 0.0 4.4314e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCoarsen 14 1.0 1.2710e-01 1.0 0.00e+00 0.0 1.7e+04 2.0e+02 5.6e+01 1 0 3 1 2 1 0 3 1 2 0 MatZeroEntries 28 1.0 8.1992e-03 1.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAXPY 14 1.0 2.0459e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 2.8e+01 0 0 0 0 1 0 0 0 0 1 0 MatMatMult 14 1.0 5.8835e-02 1.0 3.55e+05 1.3 1.9e+04 2.8e+02 2.2e+02 1 0 3 1 6 1 0 3 1 6 307 MatMatMultSym 14 1.0 5.3425e-02 1.0 0.00e+00 0.0 1.5e+04 2.6e+02 2.0e+02 1 0 2 1 5 1 0 2 1 5 0 MatMatMultNum 14 1.0 5.0399e-03 1.0 3.55e+05 1.3 3.7e+03 3.9e+02 2.8e+01 0 0 1 0 1 0 0 1 0 1 3586 MatPtAP 14 1.0 1.3213e-01 1.0 2.01e+06 1.6 3.1e+04 5.7e+02 2.4e+02 1 1 5 4 6 1 1 5 4 6 717 MatPtAPSymbolic 14 1.0 9.9159e-02 1.0 0.00e+00 0.0 1.7e+04 8.1e+02 9.8e+01 1 0 2 3 3 1 0 2 3 3 0 MatPtAPNumeric 14 1.0 3.2885e-02 1.0 2.01e+06 1.6 1.4e+04 2.8e+02 1.4e+02 0 1 2 1 4 0 1 2 1 4 2879 MatTrnMatMult 7 1.0 1.4646e-01 1.0 7.99e+05 1.0 2.2e+04 1.5e+03 1.3e+02 2 0 3 8 4 2 0 3 8 4 349 MatTrnMatMultSym 7 1.0 1.0330e-01 1.0 0.00e+00 0.0 1.8e+04 9.6e+02 1.2e+02 1 0 3 4 3 1 0 3 4 3 0 MatTrnMatMultNum 7 1.0 2.4621e-02 1.0 7.99e+05 1.0 3.6e+03 4.4e+03 1.4e+01 0 0 1 4 0 0 0 1 4 0 2076 MatGetLocalMat 56 1.0 1.6483e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetBrAoCol 42 1.0 1.7893e-02 1.3 0.00e+00 0.0 2.6e+04 6.8e+02 0.0e+00 0 0 4 4 0 0 0 4 4 0 0 DMCoarsen 1 1.0 3.9919e-02 1.1 0.00e+00 0.0 2.0e+03 3.8e+01 2.2e+01 0 0 0 0 1 0 0 0 0 1 0 DMCreateInterp 1 1.0 2.8365e-01 1.0 1.17e+04 1.0 9.6e+02 1.0e+02 2.5e+01 3 0 0 0 1 3 0 0 0 1 3 SFSetGraph 14 1.0 4.7231e-04 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFBcastBegin 84 1.0 1.0454e-01 1.1 0.00e+00 0.0 1.7e+04 2.0e+02 0.0e+00 1 0 3 1 0 1 0 3 1 0 0 SFBcastEnd 84 1.0 6.4907e-0316.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SNESSolve 1 1.0 4.2976e+00 1.0 1.72e+08 1.0 6.9e+05 6.4e+02 3.7e+03 49100100100 99 49100100100 99 2489 SNESFunctionEval 8 1.0 1.5724e-01 1.0 0.00e+00 0.0 8.2e+03 5.9e+02 0.0e+00 2 0 1 1 0 2 0 1 1 0 0 SNESJacobianEval 14 1.0 3.5013e-01 1.0 0.00e+00 0.0 1.5e+04 1.1e+04 4.2e+01 4 0 2 37 1 4 0 2 37 1 0 SNESLineSearch 7 1.0 8.2637e-02 1.0 1.30e+06 1.0 1.1e+04 5.9e+02 2.8e+01 1 1 2 1 1 1 1 2 1 1 1011 KSPGMRESOrthog 396 1.0 6.6370e-02 1.1 5.02e+06 1.0 0.0e+00 0.0e+00 4.0e+02 1 3 0 0 11 1 3 0 0 11 4650 KSPSetUp 78 1.0 1.8524e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+01 0 0 0 0 1 0 0 0 0 1 0 KSPSolve 7 1.0 3.6880e+00 1.0 1.70e+08 1.0 6.7e+05 4.8e+02 3.6e+03 42 99 97 73 97 42 99 97 73 97 2877 PCGAMGGraph_AGG 14 1.0 1.6327e-01 1.0 5.07e+04 1.2 1.8e+04 6.9e+01 3.6e+02 2 0 3 0 10 2 0 3 0 10 16 PCGAMGCoarse_AGG 14 1.0 3.9530e-01 1.0 7.99e+05 1.0 5.7e+04 7.0e+02 3.0e+02 4 0 8 9 8 4 0 8 9 8 129 PCGAMGProl_AGG 14 1.0 4.0973e-01 1.0 0.00e+00 0.0 2.4e+04 2.5e+02 5.5e+02 5 0 4 1 15 5 0 4 1 15 0 PCGAMGPOpt_AGG 14 1.0 7.0727e-01 1.0 3.03e+06 1.3 5.6e+04 2.6e+02 6.6e+02 8 1 8 3 18 8 1 8 3 18 223 GAMG: createProl 14 1.0 1.6950e+00 1.0 3.87e+06 1.2 1.6e+05 4.0e+02 1.9e+03 19 2 22 14 50 19 2 22 14 50 125 Graph 28 1.0 1.3609e-01 1.0 5.07e+04 1.2 1.8e+04 6.9e+01 3.6e+02 2 0 3 0 10 2 0 3 0 10 20 MIS/Agg 14 1.0 1.4187e-01 1.0 0.00e+00 0.0 1.7e+04 2.0e+02 5.6e+01 2 0 3 1 2 2 0 3 1 2 0 SA: col data 14 1.0 2.1919e-02 1.0 0.00e+00 0.0 1.8e+04 2.2e+02 3.5e+02 0 0 3 1 9 0 0 3 1 9 0 SA: frmProl0 14 1.0 3.8215e-01 1.0 0.00e+00 0.0 5.8e+03 3.3e+02 1.4e+02 4 0 1 0 4 4 0 1 0 4 0 SA: smooth 14 1.0 1.1343e-01 1.0 3.85e+05 1.3 1.9e+04 2.8e+02 2.8e+02 1 0 3 1 8 1 0 3 1 8 172 GAMG: partLevel 14 1.0 3.3553e-01 1.0 2.01e+06 1.6 3.7e+04 5.2e+02 9.8e+02 4 1 5 4 26 4 1 5 4 26 282 repartition 14 1.0 3.1958e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.4e+01 0 0 0 0 2 0 0 0 0 2 0 Invert-Sort 14 1.0 1.8250e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 2 0 0 0 0 2 0 Move A 14 1.0 9.0899e-02 1.0 0.00e+00 0.0 2.7e+03 5.6e+02 2.5e+02 1 0 0 0 7 1 0 0 0 7 0 Move P 14 1.0 2.5607e-02 1.0 0.00e+00 0.0 2.1e+03 4.0e+01 2.5e+02 0 0 0 0 7 0 0 0 0 7 0 PCSetUp 14 1.0 2.7166e+00 1.0 5.98e+06 1.3 2.1e+05 6.5e+02 3.0e+03 30 3 30 31 82 31 3 30 31 82 115 PCSetUpOnBlocks 53 1.0 6.8124e-02 1.6 1.23e+05 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 2 PCApply 53 1.0 9.9841e-01 1.0 1.55e+08 1.0 4.4e+05 3.9e+02 4.4e+02 11 91 64 39 12 11 91 64 39 12 9712 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Toy Hydrostatic Ice 1 1 832 0. Vector 1036 1036 3301336 0. Vector Scatter 127 127 172712 0. Matrix 509 509 8060064 0. Matrix Coarsen 14 14 9464 0. Distributed Mesh 4 4 21744 0. Index Set 415 415 424856 0. IS L to G Mapping 4 4 16720 0. Star Forest Bipartite Graph 22 22 19728 0. Discrete System 4 4 3720 0. SNES 1 1 1488 0. SNESLineSearch 1 1 1040 0. DMSNES 2 2 1504 0. Krylov Solver 24 24 894048 0. DMKSP interface 2 2 1392 0. Preconditioner 21 21 21848 0. PetscRandom 28 28 19208 0. Viewer 1 0 0 0. ======================================================================================================================== Average time to get PetscTime(): 5.00679e-07 Average time for MPI_Barrier(): 6.77109e-06 Average time for zero size MPI_Send(): 6.78003e-06 #PETSc Option Table entries: -M 40 -N 40 -P 5 -da_refine 1 -log_view -mg_coarse_pc_type gamg -mg_levels_0_pc_type gamg -mg_levels_1_sub_pc_type cholesky -pc_type mg -thi_mat_type baij #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 8 Configure options: --with-64-bit-indices=1 --with-cc=cc --with-clib-autodetect=0 --with-cxx=CC --with-cxxlib-autodetect=0 --with-debugging=0 --with-fc=ftn --with-fortranlib-autodetect=0 --with-memalign=64 --with-mpiexec=srun COPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" CXXOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" FOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" PETSC_ARCH=arch-cori-c-opt ----------------------------------------- Libraries compiled on Mon Apr 3 16:17:18 2017 on cori04 Machine characteristics: Linux-3.12.60-52.63.1.12215.0.PTF.1017941-default-x86_64-with-SuSE-12-x86_64 Using PETSc directory: /global/homes/j/jychang/Software/petsc Using PETSc arch: arch-cori-c-opt ----------------------------------------- Using C compiler: cc ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: ftn ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include ----------------------------------------- Using C linker: cc Using Fortran linker: ftn Using libraries: -Wl,-rpath,/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -L/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -lpetsc -ldl ----------------------------------------- Level 2 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 160 x 160 x 17 (435200), size (m) 62.5 x 62.5 x 62.5 Level 1 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 80 x 80 x 9 (57600), size (m) 125. x 125. x 125. Level 0 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 40 x 40 x 5 (8000), size (m) 250. x 250. x 250. Solution statistics after solve: Full CONVERGED_FNORM_RELATIVE: Number of SNES iterations = 7, total linear iterations = 45 |X|_2 11109.6 -3.37374e-12 <= u <= 24.5864 -3.11696 <= v <= 3.11696 4.08846e-15 <= c <= 24.5864 Surface statistics: u in [1.222055e+01, 2.458639e+01] mean 2.020187e+01 Global eta range 2.76911e+10 to 9.2273e+12 converged range 2.76911e+10 to 4.45572e+12 Global beta2 range 1e+100 to 0. converged range 1e+100 to 0. Wall-clock time: 1.436e+01 seconds Degrees-of-freedom: 870400 FLOPS: 8.480e+10 L1 misses: 2.010e+09 Intensity: 4.218e+01 Rate: 6.061e+04 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ./ex48cori on a arch-cori-c-opt named nid12477 with 64 processors, by jychang Tue Apr 4 18:19:21 2017 Using Petsc Development GIT revision: v3.7.5-3418-ge372536 GIT Date: 2017-03-30 13:35:15 -0500 Max Max/Min Avg Total Time (sec): 1.473e+01 1.00021 1.473e+01 Objects: 2.294e+03 1.00614 2.280e+03 Flop: 1.330e+09 1.00381 1.325e+09 8.480e+10 Flop/sec: 9.028e+07 1.00385 8.997e+07 5.758e+09 MPI Messages: 1.761e+04 1.27402 1.425e+04 9.123e+05 MPI Message Lengths: 2.223e+07 1.04556 1.500e+03 1.369e+09 MPI Reductions: 3.988e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 1.4717e+01 99.9% 8.4798e+10 100.0% 9.123e+05 100.0% 1.500e+03 100.0% 3.987e+03 100.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage BuildTwoSided 14 1.0 8.3925e-02 1.1 0.00e+00 0.0 1.8e+03 8.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecDot 7 1.0 3.3703e-03 4.3 1.90e+05 1.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 3615 VecMDot 465 1.0 9.4529e-02 1.8 1.70e+07 1.0 0.0e+00 0.0e+00 4.6e+02 1 1 0 0 12 1 1 0 0 12 11442 VecNorm 529 1.0 1.6931e-01 1.2 4.29e+06 1.0 0.0e+00 0.0e+00 5.3e+02 1 0 0 0 13 1 0 0 0 13 1615 VecScale 514 1.0 5.0766e-03 1.2 1.94e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 24359 VecCopy 271 1.0 3.7184e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1228 1.0 3.4874e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 49 1.0 1.1013e-02 1.5 4.15e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2401 VecAYPX 1664 1.0 1.8009e-02 1.1 8.17e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 28929 VecAXPBYCZ 832 1.0 1.2977e-02 1.1 1.63e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 80294 VecWAXPY 7 1.0 3.4213e-04 1.1 9.52e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 17808 VecMAXPY 514 1.0 2.7991e-02 1.1 2.05e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 46536 VecAssemblyBegin 112 1.0 6.6179e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02 0 0 0 0 7 0 0 0 0 7 0 VecAssemblyEnd 112 1.0 4.1127e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 168 1.0 1.0634e-02 1.3 3.88e+04 1.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 204 VecScatterBegin 2322 1.0 1.1115e-01 1.1 0.00e+00 0.0 7.7e+05 9.1e+02 0.0e+00 1 0 85 52 0 1 0 85 52 0 0 VecScatterEnd 2322 1.0 2.2482e-01 1.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecSetRandom 14 1.0 3.9992e-03 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecReduceArith 14 1.0 2.6488e-04 1.1 3.81e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 92004 VecReduceComm 7 1.0 4.0891e-03 7.7 0.00e+00 0.0 0.0e+00 0.0e+00 7.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 514 1.0 7.4366e-02 1.3 5.83e+06 1.0 0.0e+00 0.0e+00 5.1e+02 0 0 0 0 13 0 0 0 0 13 4989 MatMult 1720 1.0 1.0676e+00 1.0 6.63e+08 1.0 6.6e+05 9.9e+02 0.0e+00 7 50 72 47 0 7 50 72 47 0 39611 MatMultAdd 208 1.0 7.8553e-02 2.3 5.51e+06 1.0 2.8e+04 3.7e+02 0.0e+00 1 0 3 1 0 1 0 3 1 0 4448 MatMultTranspose 224 1.0 1.5899e-01 2.9 6.32e+06 1.0 3.1e+04 3.8e+02 0.0e+00 1 0 3 1 0 1 0 3 1 0 2525 MatSolve 52 0.0 1.4913e-03 0.0 9.20e+04 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 62 MatSOR 1556 1.0 1.6883e+00 1.0 5.82e+08 1.0 0.0e+00 0.0e+00 0.0e+00 11 44 0 0 0 11 44 0 0 0 22019 MatLUFactorSym 7 1.0 2.0966e-0213.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 7 1.0 2.4120e-02 5.0 1.23e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 5 MatScale 42 1.0 3.9835e-02 1.1 8.02e+04 1.3 3.7e+03 1.2e+02 0.0e+00 0 0 0 0 0 0 0 0 0 0 104 MatResidual 208 1.0 1.3544e-01 1.1 8.12e+07 1.0 8.1e+04 9.8e+02 0.0e+00 1 6 9 6 0 1 6 9 6 0 38261 MatAssemblyBegin 369 1.0 4.1530e-01 1.3 0.00e+00 0.0 2.2e+04 2.8e+04 3.4e+02 3 0 2 44 8 3 0 2 44 8 0 MatAssemblyEnd 369 1.0 5.9109e-01 1.0 0.00e+00 0.0 4.3e+04 7.6e+01 8.8e+02 4 0 5 0 22 4 0 5 0 22 0 MatGetRow 11130 1.3 1.6117e-02 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 7 0.0 1.0450e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCreateSubMat 28 1.0 1.1240e-01 1.0 0.00e+00 0.0 4.7e+03 3.4e+02 4.8e+02 1 0 1 0 12 1 0 1 0 12 0 MatGetOrdering 7 0.0 4.1057e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCoarsen 14 1.0 2.3110e-01 1.0 0.00e+00 0.0 1.7e+04 2.0e+02 5.6e+01 2 0 2 0 1 2 0 2 0 1 0 MatZeroEntries 35 1.0 3.1219e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAXPY 14 1.0 2.5521e-02 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 2.8e+01 0 0 0 0 1 0 0 0 0 1 0 MatMatMult 14 1.0 1.1578e-01 1.1 3.55e+05 1.3 1.9e+04 2.8e+02 2.2e+02 1 0 2 0 6 1 0 2 0 6 156 MatMatMultSym 14 1.0 1.0316e-01 1.1 0.00e+00 0.0 1.5e+04 2.6e+02 2.0e+02 1 0 2 0 5 1 0 2 0 5 0 MatMatMultNum 14 1.0 5.2454e-03 1.0 3.55e+05 1.3 3.7e+03 3.9e+02 2.8e+01 0 0 0 0 1 0 0 0 0 1 3446 MatPtAP 14 1.0 1.4122e-01 1.0 2.01e+06 1.6 3.1e+04 5.7e+02 2.4e+02 1 0 3 1 6 1 0 3 1 6 670 MatPtAPSymbolic 14 1.0 1.0606e-01 1.0 0.00e+00 0.0 1.7e+04 8.1e+02 9.8e+01 1 0 2 1 2 1 0 2 1 2 0 MatPtAPNumeric 14 1.0 3.5112e-02 1.0 2.01e+06 1.6 1.4e+04 2.8e+02 1.4e+02 0 0 2 0 4 0 0 2 0 4 2696 MatTrnMatMult 7 1.0 2.0047e-01 1.0 7.99e+05 1.0 2.2e+04 1.5e+03 1.3e+02 1 0 2 2 3 1 0 2 2 3 255 MatTrnMatMultSym 7 1.0 1.2940e-01 1.1 0.00e+00 0.0 1.8e+04 9.6e+02 1.2e+02 1 0 2 1 3 1 0 2 1 3 0 MatTrnMatMultNum 7 1.0 2.9624e-02 1.0 7.99e+05 1.0 3.6e+03 4.4e+03 1.4e+01 0 0 0 1 0 0 0 0 1 0 1725 MatGetLocalMat 56 1.0 2.0188e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetBrAoCol 42 1.0 4.0787e-02 1.2 0.00e+00 0.0 2.6e+04 6.8e+02 0.0e+00 0 0 3 1 0 0 0 3 1 0 0 DMCoarsen 2 1.0 7.6291e-02 1.0 0.00e+00 0.0 4.1e+03 7.5e+01 4.4e+01 1 0 0 0 1 1 0 0 0 1 0 DMCreateInterp 2 1.0 3.0714e-01 1.0 1.02e+05 1.0 1.9e+03 2.3e+02 5.0e+01 2 0 0 0 1 2 0 0 0 1 21 SFSetGraph 14 1.0 4.0817e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFBcastBegin 84 1.0 1.8828e-01 1.0 0.00e+00 0.0 1.7e+04 2.0e+02 0.0e+00 1 0 2 0 0 1 0 2 0 0 0 SFBcastEnd 84 1.0 4.4432e-03 8.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SNESSolve 1 1.0 1.0420e+01 1.0 1.33e+09 1.0 9.1e+05 1.5e+03 3.9e+03 71100100100 99 71100100100 99 8138 SNESFunctionEval 8 1.0 6.5219e-01 1.0 0.00e+00 0.0 8.2e+03 2.1e+03 0.0e+00 4 0 1 1 0 4 0 1 1 0 0 SNESJacobianEval 21 1.0 2.4299e+00 1.0 0.00e+00 0.0 2.3e+04 2.6e+04 7.0e+01 16 0 3 44 2 16 0 3 44 2 0 SNESLineSearch 7 1.0 5.7857e-01 1.0 1.05e+07 1.0 1.1e+04 2.1e+03 2.8e+01 4 1 1 2 1 4 1 1 2 1 1162 KSPGMRESOrthog 465 1.0 1.3661e-01 1.4 3.40e+07 1.0 0.0e+00 0.0e+00 4.6e+02 1 3 0 0 12 1 3 0 0 12 15835 KSPSetUp 86 1.0 3.6672e-02 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 1 0 0 0 0 1 0 KSPSolve 7 1.0 7.4996e+00 1.0 1.32e+09 1.0 8.9e+05 1.0e+03 3.9e+03 51 99 98 66 97 51 99 98 66 97 11217 PCGAMGGraph_AGG 14 1.0 2.8959e-01 1.0 5.07e+04 1.2 1.8e+04 6.9e+01 3.6e+02 2 0 2 0 9 2 0 2 0 9 9 PCGAMGCoarse_AGG 14 1.0 5.9780e-01 1.0 7.99e+05 1.0 5.7e+04 7.0e+02 3.0e+02 4 0 6 3 8 4 0 6 3 8 85 PCGAMGProl_AGG 14 1.0 7.4159e-01 1.0 0.00e+00 0.0 2.4e+04 2.5e+02 5.5e+02 5 0 3 0 14 5 0 3 0 14 0 PCGAMGPOpt_AGG 14 1.0 8.7707e-01 1.0 3.03e+06 1.3 5.6e+04 2.6e+02 6.6e+02 6 0 6 1 16 6 0 6 1 17 180 GAMG: createProl 14 1.0 2.5156e+00 1.0 3.87e+06 1.2 1.6e+05 4.0e+02 1.9e+03 17 0 17 4 47 17 0 17 4 47 84 Graph 28 1.0 2.6128e-01 1.0 5.07e+04 1.2 1.8e+04 6.9e+01 3.6e+02 2 0 2 0 9 2 0 2 0 9 10 MIS/Agg 14 1.0 2.6040e-01 1.0 0.00e+00 0.0 1.7e+04 2.0e+02 5.6e+01 2 0 2 0 1 2 0 2 0 1 0 SA: col data 14 1.0 3.9800e-02 1.0 0.00e+00 0.0 1.8e+04 2.2e+02 3.5e+02 0 0 2 0 9 0 0 2 0 9 0 SA: frmProl0 14 1.0 6.9142e-01 1.0 0.00e+00 0.0 5.8e+03 3.3e+02 1.4e+02 5 0 1 0 4 5 0 1 0 4 0 SA: smooth 14 1.0 1.8576e-01 1.1 3.85e+05 1.3 1.9e+04 2.8e+02 2.8e+02 1 0 2 0 7 1 0 2 0 7 105 GAMG: partLevel 14 1.0 3.9370e-01 1.0 2.01e+06 1.6 3.7e+04 5.2e+02 9.8e+02 3 0 4 1 25 3 0 4 1 25 240 repartition 14 1.0 4.3548e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.4e+01 0 0 0 0 2 0 0 0 0 2 0 Invert-Sort 14 1.0 4.2467e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 5.6e+01 0 0 0 0 1 0 0 0 0 1 0 Move A 14 1.0 9.9339e-02 1.0 0.00e+00 0.0 2.7e+03 5.6e+02 2.5e+02 1 0 0 0 6 1 0 0 0 6 0 Move P 14 1.0 2.5126e-02 1.0 0.00e+00 0.0 2.1e+03 4.0e+01 2.5e+02 0 0 0 0 6 0 0 0 0 6 0 PCSetUp 14 1.0 4.1871e+00 1.0 6.72e+06 1.3 2.2e+05 1.1e+03 3.1e+03 28 0 24 18 79 28 0 24 18 79 86 PCSetUpOnBlocks 52 1.0 7.7224e-02 1.5 1.23e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2 PCApply 52 1.0 3.2490e+00 1.0 1.24e+09 1.0 6.5e+05 9.3e+02 6.2e+02 22 93 71 44 15 22 93 71 44 16 24331 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Toy Hydrostatic Ice 1 1 832 0. Vector 1075 1075 8005232 0. Vector Scatter 132 132 293952 0. Matrix 518 518 16877840 0. Matrix Coarsen 14 14 9464 0. Distributed Mesh 6 6 32576 0. Index Set 425 425 528856 0. IS L to G Mapping 6 6 87840 0. Star Forest Bipartite Graph 26 26 23152 0. Discrete System 6 6 5576 0. SNES 1 1 1488 0. SNESLineSearch 1 1 1040 0. DMSNES 3 3 2296 0. Krylov Solver 26 26 925984 0. DMKSP interface 3 3 2088 0. Preconditioner 22 22 22888 0. PetscRandom 28 28 19208 0. Viewer 1 0 0 0. ======================================================================================================================== Average time to get PetscTime(): 5.00679e-07 Average time for MPI_Barrier(): 7.82013e-06 Average time for zero size MPI_Send(): 0.000110529 #PETSc Option Table entries: -M 40 -N 40 -P 5 -da_refine 2 -log_view -mg_coarse_pc_type gamg -mg_levels_0_pc_type gamg -mg_levels_1_sub_pc_type cholesky -pc_type mg -thi_mat_type baij #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 8 Configure options: --with-64-bit-indices=1 --with-cc=cc --with-clib-autodetect=0 --with-cxx=CC --with-cxxlib-autodetect=0 --with-debugging=0 --with-fc=ftn --with-fortranlib-autodetect=0 --with-memalign=64 --with-mpiexec=srun COPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" CXXOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" FOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" PETSC_ARCH=arch-cori-c-opt ----------------------------------------- Libraries compiled on Mon Apr 3 16:17:18 2017 on cori04 Machine characteristics: Linux-3.12.60-52.63.1.12215.0.PTF.1017941-default-x86_64-with-SuSE-12-x86_64 Using PETSc directory: /global/homes/j/jychang/Software/petsc Using PETSc arch: arch-cori-c-opt ----------------------------------------- Using C compiler: cc ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: ftn ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include ----------------------------------------- Using C linker: cc Using Fortran linker: ftn Using libraries: -Wl,-rpath,/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -L/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -lpetsc -ldl ----------------------------------------- Level 3 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 320 x 320 x 33 (3379200), size (m) 31.25 x 31.25 x 31.25 Level 2 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 160 x 160 x 17 (435200), size (m) 62.5 x 62.5 x 62.5 Level 1 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 80 x 80 x 9 (57600), size (m) 125. x 125. x 125. Level 0 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 40 x 40 x 5 (8000), size (m) 250. x 250. x 250. Solution statistics after solve: Full CONVERGED_FNORM_RELATIVE: Number of SNES iterations = 8, total linear iterations = 52 |X|_2 31108.6 -9.12379e-18 <= u <= 24.6134 -3.12097 <= v <= 3.12097 4.04048e-21 <= c <= 24.6134 Surface statistics: u in [1.223876e+01, 2.461340e+01] mean 2.022494e+01 Global eta range 2.6772e+10 to 9.2273e+12 converged range 2.6772e+10 to 6.96406e+12 Global beta2 range 1e+100 to 0. converged range 1e+100 to 0. Wall-clock time: 6.581e+01 seconds Degrees-of-freedom: 6758400 FLOPS: 7.904e+11 L1 misses: 1.742e+10 Intensity: 4.537e+01 Rate: 1.027e+05 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ./ex48cori on a arch-cori-c-opt named nid12477 with 64 processors, by jychang Tue Apr 4 18:20:33 2017 Using Petsc Development GIT revision: v3.7.5-3418-ge372536 GIT Date: 2017-03-30 13:35:15 -0500 Max Max/Min Avg Total Time (sec): 6.609e+01 1.00006 6.609e+01 Objects: 2.667e+03 1.00604 2.651e+03 Flop: 1.236e+10 1.00047 1.235e+10 7.904e+11 Flop/sec: 1.870e+08 1.00049 1.869e+08 1.196e+10 MPI Messages: 2.432e+04 1.21798 2.046e+04 1.310e+06 MPI Message Lengths: 9.076e+07 1.01239 4.388e+03 5.747e+09 MPI Reductions: 4.830e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 6.6082e+01 100.0% 7.9042e+11 100.0% 1.310e+06 100.0% 4.388e+03 100.0% 4.829e+03 100.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage BuildTwoSided 16 1.0 1.4446e-01 1.1 0.00e+00 0.0 2.1e+03 8.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecDot 8 1.0 1.1624e-02 4.2 1.69e+06 1.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 0 0 0 0 0 0 9303 VecMDot 612 1.0 8.1869e-01 2.3 1.49e+08 1.0 0.0e+00 0.0e+00 6.1e+02 1 1 0 0 13 1 1 0 0 13 11644 VecNorm 693 1.0 4.6368e-01 1.4 3.77e+07 1.0 0.0e+00 0.0e+00 6.9e+02 1 0 0 0 14 1 0 0 0 14 5196 VecScale 676 1.0 9.6162e-02 1.1 1.70e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 11333 VecCopy 380 1.0 3.6643e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1630 1.0 7.8395e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 64 1.0 4.5922e-02 1.4 3.64e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 5064 VecAYPX 2400 1.0 1.5746e-01 1.2 7.28e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 29571 VecAXPBYCZ 1200 1.0 1.1229e-01 1.3 1.46e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 82931 VecWAXPY 8 1.0 3.3190e-03 1.1 8.45e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 16290 VecMAXPY 676 1.0 1.9178e-01 1.0 1.80e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 59863 VecAssemblyBegin 128 1.0 1.2105e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 3.4e+02 0 0 0 0 7 0 0 0 0 7 0 VecAssemblyEnd 128 1.0 5.0521e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 200 1.0 7.9799e-03 1.4 1.53e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1184 VecScatterBegin 3249 1.0 3.8720e-01 1.2 0.00e+00 0.0 1.1e+06 2.6e+03 0.0e+00 1 0 87 52 0 1 0 87 52 0 0 VecScatterEnd 3249 1.0 1.4296e+00 3.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecSetRandom 16 1.0 4.2772e-04 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecReduceArith 16 1.0 1.1934e-02 1.4 3.38e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 18122 VecReduceComm 8 1.0 1.7081e-02 3.5 0.00e+00 0.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 676 1.0 3.6172e-01 1.2 5.11e+07 1.0 0.0e+00 0.0e+00 6.8e+02 1 0 0 0 14 1 0 0 0 14 9039 MatMult 2420 1.0 9.2205e+00 1.1 6.12e+09 1.0 9.8e+05 2.8e+03 0.0e+00 13 50 75 48 0 13 50 75 48 0 42494 MatMultAdd 300 1.0 2.7077e-01 1.5 4.87e+07 1.0 4.3e+04 1.0e+03 0.0e+00 0 0 3 1 0 0 0 3 1 0 11496 MatMultTranspose 327 1.0 5.8079e-01 2.2 5.60e+07 1.0 4.9e+04 1.0e+03 0.0e+00 1 0 4 1 0 1 0 4 1 0 6160 MatSolve 60 0.0 1.2194e-02 0.0 1.06e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 9 MatSOR 2240 1.0 1.4959e+01 1.0 5.51e+09 1.0 0.0e+00 0.0e+00 0.0e+00 22 45 0 0 0 22 45 0 0 0 23571 MatLUFactorSym 8 1.0 6.1549e-02 5.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 8 1.0 2.3015e-02 2.1 1.41e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 6 MatScale 48 1.0 5.6979e-02 1.2 9.17e+04 1.3 4.2e+03 1.2e+02 0.0e+00 0 0 0 0 0 0 0 0 0 0 83 MatResidual 300 1.0 1.1370e+00 1.1 7.53e+08 1.0 1.2e+05 2.8e+03 0.0e+00 2 6 9 6 0 2 6 9 6 0 42373 MatAssemblyBegin 431 1.0 1.8237e+00 3.2 0.00e+00 0.0 2.9e+04 9.2e+04 4.2e+02 3 0 2 47 9 3 0 2 47 9 0 MatAssemblyEnd 431 1.0 1.1601e+00 1.0 0.00e+00 0.0 5.0e+04 1.3e+02 1.0e+03 2 0 4 0 21 2 0 4 0 21 0 MatGetRow 12720 1.3 2.5819e-02 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 8 0.0 2.5271e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCreateSubMat 32 1.0 2.0623e-01 1.0 0.00e+00 0.0 5.4e+03 3.4e+02 5.4e+02 0 0 0 0 11 0 0 0 0 11 0 MatGetOrdering 8 0.0 1.3389e-01 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCoarsen 16 1.0 4.5810e-01 1.0 0.00e+00 0.0 2.0e+04 2.0e+02 6.4e+01 1 0 2 0 1 1 0 2 0 1 0 MatZeroEntries 48 1.0 1.7515e-01 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAXPY 16 1.0 4.0947e-02 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 3.2e+01 0 0 0 0 1 0 0 0 0 1 0 MatMatMult 16 1.0 1.0302e-01 1.2 4.06e+05 1.3 2.1e+04 2.8e+02 2.6e+02 0 0 2 0 5 0 0 2 0 5 201 MatMatMultSym 16 1.0 9.6708e-02 1.2 0.00e+00 0.0 1.7e+04 2.6e+02 2.2e+02 0 0 1 0 5 0 0 1 0 5 0 MatMatMultNum 16 1.0 5.9156e-03 1.0 4.06e+05 1.3 4.2e+03 3.9e+02 3.2e+01 0 0 0 0 1 0 0 0 0 1 3492 MatPtAP 16 1.0 2.2897e-01 1.0 2.30e+06 1.6 3.6e+04 5.7e+02 2.7e+02 0 0 3 0 6 0 0 3 0 6 473 MatPtAPSymbolic 16 1.0 1.7466e-01 1.1 0.00e+00 0.0 1.9e+04 8.1e+02 1.1e+02 0 0 1 0 2 0 0 1 0 2 0 MatPtAPNumeric 16 1.0 5.4217e-02 1.0 2.30e+06 1.6 1.6e+04 2.8e+02 1.6e+02 0 0 1 0 3 0 0 1 0 3 1996 MatTrnMatMult 8 1.0 4.4060e-01 1.0 9.13e+05 1.0 2.5e+04 1.5e+03 1.5e+02 1 0 2 1 3 1 0 2 1 3 133 MatTrnMatMultSym 8 1.0 2.4225e-01 1.1 0.00e+00 0.0 2.0e+04 9.6e+02 1.4e+02 0 0 2 0 3 0 0 2 0 3 0 MatTrnMatMultNum 8 1.0 3.9546e-02 1.0 9.13e+05 1.0 4.1e+03 4.4e+03 1.6e+01 0 0 0 0 0 0 0 0 0 0 1477 MatGetLocalMat 64 1.0 3.6692e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetBrAoCol 48 1.0 3.5152e-02 1.7 0.00e+00 0.0 3.0e+04 6.8e+02 0.0e+00 0 0 2 0 0 0 0 2 0 0 0 DMCoarsen 3 1.0 3.1582e-01 1.0 0.00e+00 0.0 6.1e+03 1.8e+02 6.6e+01 0 0 0 0 1 0 0 0 0 1 0 DMCreateInterp 3 1.0 3.5840e-01 1.0 8.07e+05 1.0 2.9e+03 5.9e+02 7.5e+01 1 0 0 0 2 1 0 0 0 2 144 SFSetGraph 16 1.0 4.1389e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFBcastBegin 96 1.0 3.8057e-01 1.0 0.00e+00 0.0 2.0e+04 2.0e+02 0.0e+00 1 0 2 0 0 1 0 2 0 0 0 SFBcastEnd 96 1.0 8.6324e-03 4.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SNESSolve 1 1.0 5.9930e+01 1.0 1.24e+10 1.0 1.3e+06 4.4e+03 4.8e+03 91100100100 99 91100100100 99 13189 SNESFunctionEval 9 1.0 5.0673e+00 1.0 0.00e+00 0.0 9.2e+03 8.1e+03 0.0e+00 8 0 1 1 0 8 0 1 1 0 0 SNESJacobianEval 32 1.0 2.0361e+01 1.0 0.00e+00 0.0 3.5e+04 7.8e+04 1.1e+02 31 0 3 47 2 31 0 3 47 2 0 SNESLineSearch 8 1.0 4.7538e+00 1.0 9.64e+07 1.0 1.2e+04 8.1e+03 3.2e+01 7 1 1 2 1 7 1 1 2 1 1297 KSPGMRESOrthog 612 1.0 1.0330e+00 1.8 2.98e+08 1.0 0.0e+00 0.0e+00 6.1e+02 1 2 0 0 13 1 2 0 0 13 18458 KSPSetUp 107 1.0 1.9878e-01 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 7.6e+01 0 0 0 0 2 0 0 0 0 2 0 KSPSolve 8 1.0 3.6696e+01 1.0 1.23e+10 1.0 1.3e+06 2.8e+03 4.7e+03 55 99 98 63 98 55 99 98 63 98 21371 PCGAMGGraph_AGG 16 1.0 8.1084e-01 1.0 5.79e+04 1.2 2.1e+04 6.9e+01 4.2e+02 1 0 2 0 9 1 0 2 0 9 4 PCGAMGCoarse_AGG 16 1.0 1.2953e+00 1.0 9.13e+05 1.0 6.5e+04 7.0e+02 3.4e+02 2 0 5 1 7 2 0 5 1 7 45 PCGAMGProl_AGG 16 1.0 6.5544e-01 1.0 0.00e+00 0.0 2.8e+04 2.5e+02 6.2e+02 1 0 2 0 13 1 0 2 0 13 0 PCGAMGPOpt_AGG 16 1.0 1.3947e+00 1.0 3.46e+06 1.3 6.4e+04 2.6e+02 7.5e+02 2 0 5 0 16 2 0 5 0 16 129 GAMG: createProl 16 1.0 4.1571e+00 1.0 4.43e+06 1.2 1.8e+05 4.0e+02 2.1e+03 6 0 14 1 44 6 0 14 1 44 58 Graph 32 1.0 7.6665e-01 1.0 5.79e+04 1.2 2.1e+04 6.9e+01 4.2e+02 1 0 2 0 9 1 0 2 0 9 4 MIS/Agg 16 1.0 5.2536e-01 1.0 0.00e+00 0.0 2.0e+04 2.0e+02 6.4e+01 1 0 2 0 1 1 0 2 0 1 0 SA: col data 16 1.0 3.8185e-02 1.0 0.00e+00 0.0 2.1e+04 2.2e+02 4.0e+02 0 0 2 0 8 0 0 2 0 8 0 SA: frmProl0 16 1.0 5.8698e-01 1.0 0.00e+00 0.0 6.7e+03 3.3e+02 1.6e+02 1 0 1 0 3 1 0 1 0 3 0 SA: smooth 16 1.0 2.0391e-01 1.1 4.40e+05 1.3 2.1e+04 2.8e+02 3.2e+02 0 0 2 0 7 0 0 2 0 7 110 GAMG: partLevel 16 1.0 6.8757e-01 1.0 2.30e+06 1.6 4.3e+04 5.2e+02 1.1e+03 1 0 3 0 23 1 0 3 0 23 157 repartition 16 1.0 9.9953e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.6e+01 0 0 0 0 2 0 0 0 0 2 0 Invert-Sort 16 1.0 5.5922e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 6.4e+01 0 0 0 0 1 0 0 0 0 1 0 Move A 16 1.0 1.9150e-01 1.0 0.00e+00 0.0 3.1e+03 5.6e+02 2.9e+02 0 0 0 0 6 0 0 0 0 6 0 Move P 16 1.0 2.8941e-02 1.0 0.00e+00 0.0 2.4e+03 4.0e+01 2.9e+02 0 0 0 0 6 0 0 0 0 6 0 PCSetUp 16 1.0 9.7850e+00 1.0 1.41e+07 1.1 2.6e+05 3.0e+03 3.7e+03 15 0 20 14 76 15 0 20 14 76 84 PCSetUpOnBlocks 60 1.0 2.5405e-01 1.3 1.41e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1 PCApply 60 1.0 2.5596e+01 1.0 1.16e+10 1.0 1.0e+06 2.6e+03 9.0e+02 38 94 76 45 19 38 94 76 45 19 28902 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Toy Hydrostatic Ice 1 1 832 0. Vector 1252 1252 44157448 0. Vector Scatter 154 154 1191056 0. Matrix 598 598 87165056 0. Matrix Coarsen 16 16 10816 0. Distributed Mesh 8 8 43408 0. Index Set 492 492 1170320 0. IS L to G Mapping 8 8 569072 0. Star Forest Bipartite Graph 32 32 28400 0. Discrete System 8 8 7432 0. SNES 1 1 1488 0. SNESLineSearch 1 1 1040 0. DMSNES 4 4 3088 0. Krylov Solver 30 30 1077024 0. DMKSP interface 4 4 2784 0. Preconditioner 25 25 25896 0. PetscRandom 32 32 21952 0. Viewer 1 0 0 0. ======================================================================================================================== Average time to get PetscTime(): 5.96046e-07 Average time for MPI_Barrier(): 7.20024e-06 Average time for zero size MPI_Send(): 0.000111561 #PETSc Option Table entries: -M 40 -N 40 -P 5 -da_refine 3 -log_view -mg_coarse_pc_type gamg -mg_levels_0_pc_type gamg -mg_levels_1_sub_pc_type cholesky -pc_type mg -thi_mat_type baij #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 8 Configure options: --with-64-bit-indices=1 --with-cc=cc --with-clib-autodetect=0 --with-cxx=CC --with-cxxlib-autodetect=0 --with-debugging=0 --with-fc=ftn --with-fortranlib-autodetect=0 --with-memalign=64 --with-mpiexec=srun COPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" CXXOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" FOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" PETSC_ARCH=arch-cori-c-opt ----------------------------------------- Libraries compiled on Mon Apr 3 16:17:18 2017 on cori04 Machine characteristics: Linux-3.12.60-52.63.1.12215.0.PTF.1017941-default-x86_64-with-SuSE-12-x86_64 Using PETSc directory: /global/homes/j/jychang/Software/petsc Using PETSc arch: arch-cori-c-opt ----------------------------------------- Using C compiler: cc ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: ftn ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include ----------------------------------------- Using C linker: cc Using Fortran linker: ftn Using libraries: -Wl,-rpath,/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -L/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -lpetsc -ldl ----------------------------------------- Level 4 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 640 x 640 x 65 (26624000), size (m) 15.625 x 15.625 x 15.625 Level 3 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 320 x 320 x 33 (3379200), size (m) 31.25 x 31.25 x 31.25 Level 2 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 160 x 160 x 17 (435200), size (m) 62.5 x 62.5 x 62.5 Level 1 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 80 x 80 x 9 (57600), size (m) 125. x 125. x 125. Level 0 domain size (m) 1e+04 x 1e+04 x 1e+03, num elements 40 x 40 x 5 (8000), size (m) 250. x 250. x 250. Solution statistics after solve: Full CONVERGED_FNORM_RELATIVE: Number of SNES iterations = 8, total linear iterations = 52 |X|_2 87518.2 -1.30201e-16 <= u <= 24.6202 -3.12198 <= v <= 3.12198 1.30557e-20 <= c <= 24.6202 Surface statistics: u in [1.224323e+01, 2.462018e+01] mean 2.023071e+01 Global eta range 2.63231e+10 to 9.2273e+12 converged range 2.63231e+10 to 8.50855e+12 Global beta2 range 1e+100 to 0. converged range 1e+100 to 0. Wall-clock time: 6.399e+02 seconds Degrees-of-freedom: 53248000 FLOPS: 6.366e+12 L1 misses: 1.465e+11 Intensity: 4.346e+01 Rate: 8.321e+04 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ./ex48cori on a arch-cori-c-opt named nid12477 with 64 processors, by jychang Tue Apr 4 18:31:23 2017 Using Petsc Development GIT revision: v3.7.5-3418-ge372536 GIT Date: 2017-03-30 13:35:15 -0500 Max Max/Min Avg Total Time (sec): 6.404e+02 1.00001 6.404e+02 Objects: 2.745e+03 1.00586 2.729e+03 Flop: 9.948e+10 1.00006 9.947e+10 6.366e+12 Flop/sec: 1.553e+08 1.00006 1.553e+08 9.941e+09 MPI Messages: 2.842e+04 1.18082 2.457e+04 1.572e+06 MPI Message Lengths: 3.486e+08 1.00320 1.415e+04 2.225e+10 MPI Reductions: 5.139e+03 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 6.4040e+02 100.0% 6.3662e+12 100.0% 1.572e+06 100.0% 1.415e+04 100.0% 5.138e+03 100.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage BuildTwoSided 16 1.0 1.6144e-01 1.1 0.00e+00 0.0 2.1e+03 8.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecDot 8 1.0 9.8296e-02 2.4 1.33e+07 1.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 0 0 0 0 0 0 8667 VecMDot 692 1.0 1.0580e+01 6.6 1.17e+09 1.0 0.0e+00 0.0e+00 6.9e+02 1 1 0 0 13 1 1 0 0 13 7079 VecNorm 781 1.0 3.6949e+00 2.3 2.96e+08 1.0 0.0e+00 0.0e+00 7.8e+02 0 0 0 0 15 0 0 0 0 15 5126 VecScale 764 1.0 1.9312e+00 1.1 1.34e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 4435 VecCopy 448 1.0 1.0568e+00 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1850 1.0 1.1037e+00 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 72 1.0 1.9138e-01 1.2 2.86e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 9554 VecAYPX 2880 1.0 4.2828e+00 1.5 5.72e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 1 0 0 0 1 1 0 0 0 8547 VecAXPBYCZ 1440 1.0 2.5842e+00 2.1 1.14e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 28330 VecWAXPY 8 1.0 7.3841e-02 1.1 6.66e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 5769 VecMAXPY 764 1.0 6.6720e+00 1.1 1.41e+09 1.0 0.0e+00 0.0e+00 0.0e+00 1 1 0 0 0 1 1 0 0 0 13519 VecAssemblyBegin 128 1.0 1.4438e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 3.4e+02 0 0 0 0 7 0 0 0 0 7 0 VecAssemblyEnd 128 1.0 5.6171e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 208 1.0 2.4564e-02 1.7 9.98e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2586 VecScatterBegin 3826 1.0 2.2829e+00 1.6 0.00e+00 0.0 1.4e+06 8.2e+03 0.0e+00 0 0 89 51 0 0 0 89 51 0 0 VecScatterEnd 3826 1.0 2.4707e+01 8.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 VecSetRandom 16 1.0 4.2892e-04 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecReduceArith 16 1.0 3.0433e-02 1.5 2.66e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 55989 VecReduceComm 8 1.0 1.1865e-0122.9 0.00e+00 0.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 764 1.0 4.7302e+00 1.5 4.02e+08 1.0 0.0e+00 0.0e+00 7.6e+02 1 0 0 0 15 1 0 0 0 15 5432 MatMult 2860 1.0 1.4578e+02 1.1 4.92e+10 1.0 1.2e+06 8.8e+03 0.0e+00 21 49 77 48 0 21 49 77 48 0 21579 MatMultAdd 360 1.0 3.3118e+00 1.5 3.84e+08 1.0 5.5e+04 3.0e+03 0.0e+00 0 0 3 1 0 0 0 3 1 0 7418 MatMultTranspose 396 1.0 6.5461e+00 2.5 4.41e+08 1.0 6.2e+04 3.1e+03 0.0e+00 1 0 4 1 0 1 0 4 1 0 4316 MatSolve 60 0.0 8.3424e-02 0.0 1.06e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1 MatSOR 2688 1.0 2.3942e+02 1.1 4.47e+10 1.0 0.0e+00 0.0e+00 0.0e+00 36 45 0 0 0 36 45 0 0 0 11946 MatLUFactorSym 8 1.0 1.9460e-01 2.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 8 1.0 5.1702e-02 3.6 1.41e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 3 MatScale 48 1.0 7.2104e-02 1.3 9.17e+04 1.3 4.2e+03 1.2e+02 0.0e+00 0 0 0 0 0 0 0 0 0 0 66 MatResidual 360 1.0 1.8100e+01 1.2 6.04e+09 1.0 1.5e+05 8.5e+03 0.0e+00 3 6 10 6 0 3 6 10 6 0 21373 MatAssemblyBegin 441 1.0 8.2658e+00 2.6 0.00e+00 0.0 3.4e+04 3.2e+05 4.6e+02 1 0 2 48 9 1 0 2 48 9 0 MatAssemblyEnd 441 1.0 3.5791e+00 1.0 0.00e+00 0.0 5.1e+04 3.7e+02 1.0e+03 1 0 3 0 20 1 0 3 0 20 0 MatGetRow 12720 1.3 2.6373e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 8 0.0 3.1515e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCreateSubMat 32 1.0 2.2857e-01 1.0 0.00e+00 0.0 5.4e+03 3.4e+02 5.4e+02 0 0 0 0 11 0 0 0 0 11 0 MatGetOrdering 8 0.0 3.7957e-01 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCoarsen 16 1.0 5.0139e-01 1.0 0.00e+00 0.0 2.0e+04 2.0e+02 6.4e+01 0 0 1 0 1 0 0 1 0 1 0 MatZeroEntries 56 1.0 2.8899e+00 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAXPY 16 1.0 5.1729e-02 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 3.2e+01 0 0 0 0 1 0 0 0 0 1 0 MatMatMult 16 1.0 1.2365e-01 1.2 4.06e+05 1.3 2.1e+04 2.8e+02 2.6e+02 0 0 1 0 5 0 0 1 0 5 167 MatMatMultSym 16 1.0 1.1723e-01 1.2 0.00e+00 0.0 1.7e+04 2.6e+02 2.2e+02 0 0 1 0 4 0 0 1 0 4 0 MatMatMultNum 16 1.0 5.9431e-03 1.0 4.06e+05 1.3 4.2e+03 3.9e+02 3.2e+01 0 0 0 0 1 0 0 0 0 1 3476 MatPtAP 16 1.0 2.7400e-01 1.0 2.30e+06 1.6 3.6e+04 5.7e+02 2.7e+02 0 0 2 0 5 0 0 2 0 5 395 MatPtAPSymbolic 16 1.0 2.1314e-01 1.1 0.00e+00 0.0 1.9e+04 8.1e+02 1.1e+02 0 0 1 0 2 0 0 1 0 2 0 MatPtAPNumeric 16 1.0 6.0712e-02 1.0 2.30e+06 1.6 1.6e+04 2.8e+02 1.6e+02 0 0 1 0 3 0 0 1 0 3 1782 MatTrnMatMult 8 1.0 4.9175e-01 1.0 9.13e+05 1.0 2.5e+04 1.5e+03 1.5e+02 0 0 2 0 3 0 0 2 0 3 119 MatTrnMatMultSym 8 1.0 2.6476e-01 1.1 0.00e+00 0.0 2.0e+04 9.6e+02 1.4e+02 0 0 1 0 3 0 0 1 0 3 0 MatTrnMatMultNum 8 1.0 4.0972e-02 1.0 9.13e+05 1.0 4.1e+03 4.4e+03 1.6e+01 0 0 0 0 0 0 0 0 0 0 1426 MatGetLocalMat 64 1.0 4.0983e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetBrAoCol 48 1.0 4.3412e-02 1.8 0.00e+00 0.0 3.0e+04 6.8e+02 0.0e+00 0 0 2 0 0 0 0 2 0 0 0 DMCoarsen 4 1.0 5.2356e-01 1.0 0.00e+00 0.0 8.2e+03 4.8e+02 8.8e+01 0 0 1 0 2 0 0 1 0 2 0 DMCreateInterp 4 1.0 1.2483e+00 1.0 6.39e+06 1.0 3.8e+03 1.7e+03 1.0e+02 0 0 0 0 2 0 0 0 0 2 328 SFSetGraph 16 1.0 4.1509e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFBcastBegin 96 1.0 4.0962e-01 1.1 0.00e+00 0.0 2.0e+04 2.0e+02 0.0e+00 0 0 1 0 0 0 0 1 0 0 0 SFBcastEnd 96 1.0 9.7880e-03 5.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SNESSolve 1 1.0 6.1583e+02 1.0 9.95e+10 1.0 1.6e+06 1.4e+04 5.1e+03 96100100100 99 96100100100 99 10338 SNESFunctionEval 9 1.0 3.8730e+01 1.0 0.00e+00 0.0 9.2e+03 3.2e+04 0.0e+00 6 0 1 1 0 6 0 1 1 0 0 SNESJacobianEval 40 1.0 1.5628e+02 1.0 0.00e+00 0.0 4.4e+04 2.5e+05 1.4e+02 24 0 3 49 3 24 0 3 49 3 0 SNESLineSearch 8 1.0 3.6784e+01 1.0 7.72e+08 1.0 1.2e+04 3.2e+04 3.2e+01 6 1 1 2 1 6 1 1 2 1 1343 KSPGMRESOrthog 692 1.0 1.6206e+01 2.2 2.34e+09 1.0 0.0e+00 0.0e+00 6.9e+02 2 2 0 0 13 2 2 0 0 13 9243 KSPSetUp 116 1.0 1.9615e+00 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 9.2e+01 0 0 0 0 2 0 0 0 0 2 0 KSPSolve 8 1.0 4.3837e+02 1.0 9.87e+10 1.0 1.5e+06 8.8e+03 5.0e+03 68 99 98 61 98 68 99 98 61 98 14409 PCGAMGGraph_AGG 16 1.0 1.0600e+00 1.0 5.79e+04 1.2 2.1e+04 6.9e+01 4.2e+02 0 0 1 0 8 0 0 1 0 8 3 PCGAMGCoarse_AGG 16 1.0 1.4782e+00 1.0 9.13e+05 1.0 6.5e+04 7.0e+02 3.4e+02 0 0 4 0 7 0 0 4 0 7 40 PCGAMGProl_AGG 16 1.0 9.3608e-01 1.0 0.00e+00 0.0 2.8e+04 2.5e+02 6.2e+02 0 0 2 0 12 0 0 2 0 12 0 PCGAMGPOpt_AGG 16 1.0 2.0733e+00 1.0 3.46e+06 1.3 6.4e+04 2.6e+02 7.5e+02 0 0 4 0 15 0 0 4 0 15 87 GAMG: createProl 16 1.0 5.5551e+00 1.0 4.43e+06 1.2 1.8e+05 4.0e+02 2.1e+03 1 0 11 0 42 1 0 11 0 42 43 Graph 32 1.0 1.0173e+00 1.0 5.79e+04 1.2 2.1e+04 6.9e+01 4.2e+02 0 0 1 0 8 0 0 1 0 8 3 MIS/Agg 16 1.0 5.7764e-01 1.0 0.00e+00 0.0 2.0e+04 2.0e+02 6.4e+01 0 0 1 0 1 0 0 1 0 1 0 SA: col data 16 1.0 4.0720e-02 1.0 0.00e+00 0.0 2.1e+04 2.2e+02 4.0e+02 0 0 1 0 8 0 0 1 0 8 0 SA: frmProl0 16 1.0 8.5838e-01 1.0 0.00e+00 0.0 6.7e+03 3.3e+02 1.6e+02 0 0 0 0 3 0 0 0 0 3 0 SA: smooth 16 1.0 2.5695e-01 1.1 4.40e+05 1.3 2.1e+04 2.8e+02 3.2e+02 0 0 1 0 6 0 0 1 0 6 87 GAMG: partLevel 16 1.0 8.0943e-01 1.0 2.30e+06 1.6 4.3e+04 5.2e+02 1.1e+03 0 0 3 0 22 0 0 3 0 22 134 repartition 16 1.0 1.2132e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.6e+01 0 0 0 0 2 0 0 0 0 2 0 Invert-Sort 16 1.0 6.2821e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 6.4e+01 0 0 0 0 1 0 0 0 0 1 0 Move A 16 1.0 2.2548e-01 1.0 0.00e+00 0.0 3.1e+03 5.6e+02 2.9e+02 0 0 0 0 6 0 0 0 0 6 0 Move P 16 1.0 2.8295e-02 1.0 0.00e+00 0.0 2.4e+03 4.0e+01 2.9e+02 0 0 0 0 6 0 0 0 0 6 0 PCSetUp 16 1.0 3.4525e+01 1.0 6.52e+07 1.0 2.8e+05 1.0e+04 3.8e+03 5 0 18 13 74 5 0 18 13 74 119 PCSetUpOnBlocks 60 1.0 9.5716e-01 1.1 1.41e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 PCApply 60 1.0 3.8705e+02 1.0 9.32e+10 1.0 1.2e+06 8.0e+03 1.1e+03 60 94 79 45 21 60 94 79 45 21 15407 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Toy Hydrostatic Ice 1 1 832 0. Vector 1291 1291 324723200 0. Vector Scatter 159 159 7955496 0. Matrix 607 607 637604560 0. Matrix Coarsen 16 16 10816 0. Distributed Mesh 10 10 54240 0. Index Set 502 502 5108752 0. IS L to G Mapping 10 10 4120768 0. Star Forest Bipartite Graph 36 36 31824 0. Discrete System 10 10 9288 0. SNES 1 1 1488 0. SNESLineSearch 1 1 1040 0. DMSNES 5 5 3880 0. Krylov Solver 32 32 1108960 0. DMKSP interface 5 5 3480 0. Preconditioner 26 26 26936 0. PetscRandom 32 32 21952 0. Viewer 1 0 0 0. ======================================================================================================================== Average time to get PetscTime(): 5.00679e-07 Average time for MPI_Barrier(): 7.20024e-06 Average time for zero size MPI_Send(): 0.000109892 #PETSc Option Table entries: -M 40 -N 40 -P 5 -da_refine 4 -log_view -mg_coarse_pc_type gamg -mg_levels_0_pc_type gamg -mg_levels_1_sub_pc_type cholesky -pc_type mg -thi_mat_type baij #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 8 Configure options: --with-64-bit-indices=1 --with-cc=cc --with-clib-autodetect=0 --with-cxx=CC --with-cxxlib-autodetect=0 --with-debugging=0 --with-fc=ftn --with-fortranlib-autodetect=0 --with-memalign=64 --with-mpiexec=srun COPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" CXXOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" FOPTFLAGS="-g -O3 -fp-model fast -xMIC-AVX512" PETSC_ARCH=arch-cori-c-opt ----------------------------------------- Libraries compiled on Mon Apr 3 16:17:18 2017 on cori04 Machine characteristics: Linux-3.12.60-52.63.1.12215.0.PTF.1017941-default-x86_64-with-SuSE-12-x86_64 Using PETSc directory: /global/homes/j/jychang/Software/petsc Using PETSc arch: arch-cori-c-opt ----------------------------------------- Using C compiler: cc ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: ftn ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/include -I/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/include ----------------------------------------- Using C linker: cc Using Fortran linker: ftn Using libraries: -Wl,-rpath,/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -L/global/homes/j/jychang/Software/petsc/arch-cori-c-opt/lib -lpetsc -ldl -----------------------------------------