**************************************************************************************************************************************************************** *** WIDEN YOUR WINDOW TO 160 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** **************************************************************************************************************************************************************** ------------------------------------------------------------------ PETSc Performance Summary: ------------------------------------------------------------------ /home/berger/BoussDevMultiLevel5Eqn/examples/SGNTestInit/xgeoclaw on a arch-mpi-opt named juniper.cims.nyu.edu with 2 processors, by berger Fri Sep 16 17:41:34 2022 Using Petsc Development GIT revision: v3.17.4-1192-g531d53b GIT Date: 2022-09-01 17:15:07 -0400 Max Max/Min Avg Total Time (sec): 2.021e+02 1.000 2.021e+02 Objects: 2.252e+05 1.199 2.065e+05 Flops: 2.671e+10 1.195 2.454e+10 4.907e+10 Flops/sec: 1.322e+08 1.195 1.214e+08 2.429e+08 MPI Msg Count: 9.134e+04 1.002 9.123e+04 1.825e+05 MPI Msg Len (bytes): 8.759e+09 1.011 9.552e+04 1.743e+10 MPI Reductions: 1.868e+05 1.000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total 0: Main Stage: 2.0206e+02 100.0% 4.9073e+10 100.0% 1.825e+05 100.0% 9.552e+04 100.0% 1.867e+05 100.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent AvgLen: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage ---- Total Max Ratio Max Ratio Max Ratio Mess AvgLen Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage BuildTwoSided 27490 1.0 1.0284e+01 1.1 0.00e+00 0.0 2.1e+04 4.0e+00 2.7e+04 5 0 11 0 15 5 0 11 0 15 0 BuildTwoSidedF 16230 1.0 9.5499e+00 1.1 0.00e+00 0.0 1.3e+04 3.3e+05 1.6e+04 5 0 7 24 9 5 0 7 24 9 0 MatMult 78482 1.5 1.4375e+01 1.1 1.45e+10 1.3 8.0e+04 1.1e+05 8.2e+02 7 53 44 49 0 7 53 44 49 0 1806 MatMultAdd 14097 1.5 1.2616e+00 1.2 6.01e+08 1.1 1.2e+04 4.5e+03 0.0e+00 1 2 7 0 0 1 2 7 0 0 926 MatMultTranspose 14097 1.5 1.5192e+00 1.2 6.01e+08 1.0 1.5e+04 3.9e+03 1.2e+03 1 2 8 0 1 1 2 8 0 1 773 MatSolve 4777 0.0 1.2574e-02 0.0 5.77e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 459 MatLUFactorSym 613 1.6 1.6562e-02 5.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatLUFactorNum 613 1.6 9.9657e-0316.1 6.73e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 675 MatConvert 3586 1.5 1.7801e+00 1.2 0.00e+00 0.0 6.9e+03 2.5e+04 2.3e+03 1 0 4 1 1 1 0 4 1 1 0 MatScale 5379 1.5 1.1240e+00 1.1 3.91e+08 1.1 5.2e+03 5.1e+04 1.2e+03 1 2 3 2 1 1 2 3 2 1 672 MatResidual 14097 1.5 2.5553e+00 1.1 2.48e+09 1.2 1.4e+04 1.0e+05 0.0e+00 1 9 8 8 0 1 9 8 8 0 1754 MatAssemblyBegin 35904 1.2 2.1923e+01 1.3 0.00e+00 0.0 1.3e+04 3.3e+05 1.4e+04 10 0 7 24 7 10 0 7 24 7 0 MatAssemblyEnd 35904 1.2 4.0444e+01 1.0 3.41e+071158.2 0.0e+00 0.0e+00 4.5e+04 20 0 0 0 24 20 0 0 0 24 1 MatGetRowIJ 648 0.0 2.3928e-03 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCreateSubMat 758 1.0 2.0990e-01 1.0 0.00e+00 0.0 1.9e+03 1.8e+03 1.1e+04 0 0 1 0 6 0 0 1 0 6 0 MatGetOrdering 613 0.0 1.7006e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatCoarsen 1793 1.5 3.8518e+01 1.0 5.40e+08 1.1 2.9e+04 7.6e+04 4.7e+04 19 2 16 13 25 19 2 16 13 25 26 MatZeroEntries 1542 1.0 6.1590e-0233.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAXPY 3586 1.5 1.7284e+01 1.1 1.45e+08 1.1 0.0e+00 0.0e+00 7.0e+03 8 1 0 0 4 8 1 0 0 4 16 MatTranspose 6670 1.2 3.5198e+01 1.0 0.00e+00 0.0 1.3e+04 2.4e+05 1.9e+04 17 0 7 18 10 17 0 7 18 10 0 MatMatMultSym 5490 1.2 9.1986e+00 1.1 0.00e+00 0.0 7.1e+03 6.2e+04 1.4e+04 4 0 4 3 8 4 0 4 3 8 0 MatMatMultNum 5490 1.2 3.0940e+00 1.1 9.04e+08 1.0 2.1e+03 1.0e+05 1.5e+03 1 4 1 1 1 1 4 1 1 1 576 MatPtAPSymbolic 2406 1.6 1.9428e+01 1.0 0.00e+00 0.0 1.3e+04 1.1e+05 1.1e+04 9 0 7 8 6 9 0 7 8 6 0 MatPtAPNumeric 2406 1.6 1.4324e+01 1.1 1.83e+09 1.1 3.6e+03 2.4e+05 7.7e+03 7 7 2 5 4 7 7 2 5 4 248 MatGetLocalMat 4626 1.0 1.3448e+00 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 MatGetBrAoCol 4626 1.0 6.4488e+00 1.0 0.00e+00 0.0 1.7e+04 1.1e+05 3.8e+02 3 0 9 11 0 3 0 9 11 0 0 MatGetSymTransR 864 0.0 4.4594e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatSetPreallCOO 48 0.0 1.6761e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatSetValuesCOO 613 0.0 1.0095e+00 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 KSPSetUp 4908 1.6 1.3330e-01 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 2.3e+03 0 0 0 0 1 0 0 0 0 1 0 KSPSolve 613 1.6 1.4492e+02 1.2 2.66e+10 1.2 1.8e+05 9.1e+04 1.8e+05 66 98 98 94 94 66 98 98 94 94 332 KSPGMRESOrthog 22094 1.5 2.0621e+00 1.0 4.46e+09 1.1 0.0e+00 0.0e+00 1.4e+04 1 17 0 0 8 1 17 0 0 8 4064 PCSetUp_GAMG+ 613 1.6 1.2628e+02 1.1 9.55e+09 1.1 9.5e+04 9.8e+04 1.7e+05 61 36 52 53 93 61 36 52 53 93 142 PCGAMGCreateG 1793 1.5 4.8060e+01 1.0 4.47e+08 1.1 1.7e+04 1.9e+05 2.0e+04 23 2 10 19 11 23 2 10 19 11 18 PCGAMGFilter 1793 1.5 6.1589e-04 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 GAMG Coarsen 1793 1.5 4.0180e+01 1.1 5.40e+08 1.1 2.9e+04 7.6e+04 4.7e+04 19 2 16 13 25 19 2 16 13 25 25 GAMG MIS/Agg 1793 1.5 3.8537e+01 1.0 5.40e+08 1.1 2.9e+04 7.6e+04 4.7e+04 19 2 16 13 25 19 2 16 13 25 26 PCGAMGProl 1793 1.5 6.1596e+00 1.0 0.00e+00 0.0 5.2e+03 7.6e+04 1.9e+04 3 0 3 2 10 3 0 3 2 10 0 GAMG Prol-col 1793 1.5 1.1039e+00 1.0 0.00e+00 0.0 3.5e+03 6.3e+04 8.1e+03 1 0 2 1 4 1 0 2 1 4 0 GAMG Prol-lift 1793 1.5 4.8790e+00 1.1 0.00e+00 0.0 1.7e+03 1.0e+05 5.8e+03 2 0 1 1 3 2 0 1 1 3 0 PCGAMGOptProl 1793 1.5 1.5209e+01 1.1 7.24e+09 1.2 2.6e+04 8.7e+04 4.7e+04 7 27 14 13 25 7 27 14 13 25 882 GAMG smooth 1793 1.5 9.2698e+00 1.1 4.05e+08 1.2 6.9e+03 7.6e+04 1.5e+04 4 2 4 3 8 4 2 4 3 8 80 PCGAMGCreateL 1793 1.5 1.6082e+01 1.1 1.32e+09 1.0 1.7e+04 6.5e+04 3.4e+04 8 5 9 6 18 8 5 9 6 18 160 GAMG PtAP 1793 1.5 1.5774e+01 1.1 1.32e+09 1.0 1.2e+04 8.8e+04 1.4e+04 7 5 7 6 7 7 5 7 6 7 164 GAMG Reduce 379 1.0 3.0623e-01 1.0 0.00e+00 0.0 4.4e+03 8.2e+02 2.0e+04 0 0 2 0 11 0 0 2 0 11 0 PCGAMG Gal l00 613 1.6 1.3900e+01 1.1 1.28e+09 1.3 5.4e+03 2.0e+05 4.5e+03 7 5 3 6 2 7 5 3 6 2 162 PCGAMG Opt l00 613 1.6 6.4129e+00 1.1 3.12e+08 1.4 3.0e+03 1.7e+05 3.8e+03 3 1 2 3 2 3 1 2 3 2 82 PCGAMG Gal l01 613 1.6 1.5097e+00 1.2 2.30e+08 7.2 3.9e+03 8.8e+02 4.5e+03 1 1 2 0 2 1 1 2 0 2 173 PCGAMG Opt l01 613 1.6 4.3841e-01 1.1 2.57e+07 7.7 2.0e+03 3.3e+02 3.8e+03 0 0 1 0 2 0 0 1 0 2 66 PCGAMG Gal l02 541 1.4 3.5510e-01 1.1 5.63e+07 6.5 3.1e+03 2.6e+02 4.5e+03 0 0 2 0 2 0 0 2 0 2 183 PCGAMG Opt l02 541 1.4 1.4577e-01 1.1 6.98e+06 5.4 2.0e+03 1.3e+02 3.8e+03 0 0 1 0 2 0 0 1 0 2 57 PCGAMG Gal l03 26 1.0 8.0912e-03 1.0 4.56e+05 0.0 0.0e+00 0.0e+00 3.1e+02 0 0 0 0 0 0 0 0 0 0 56 PCGAMG Opt l03 26 1.0 4.9567e-03 1.0 7.30e+04 0.0 0.0e+00 0.0e+00 2.6e+02 0 0 0 0 0 0 0 0 0 0 15 PCSetUp 1826 2.4 1.2788e+02 1.1 9.56e+09 1.1 9.5e+04 1.0e+05 1.8e+05 61 36 52 56 94 61 36 52 56 94 140 PCSetUpOnBlocks 4777 1.6 5.3615e-02 3.8 6.73e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 125 PCApply 3032 4.9 1.4428e+0210.8 2.66e+10 2.4 1.3e+05 8.4e+04 9.4e+04 39 76 73 64 50 39 76 73 64 50 260 VecMDot 22094 1.5 1.2151e+00 1.1 2.23e+09 1.1 0.0e+00 0.0e+00 1.4e+04 1 9 0 0 8 1 9 0 0 8 3449 VecNorm 24500 1.5 3.4822e-01 1.3 5.10e+08 1.1 0.0e+00 0.0e+00 1.6e+04 0 2 0 0 8 0 2 0 0 8 2746 VecScale 24500 1.5 1.4646e-01 1.2 2.55e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 3264 VecCopy 44697 1.5 2.1185e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 57254 1.4 1.6440e-01 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 2406 1.6 3.2757e-02 1.2 5.38e+07 1.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 3064 VecAYPX 84582 1.5 9.5569e-01 1.1 8.61e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 3 0 0 0 0 3 0 0 0 1713 VecAXPBYCZ 28194 1.5 6.0327e-01 1.1 1.08e+09 1.1 0.0e+00 0.0e+00 0.0e+00 0 4 0 0 0 0 4 0 0 0 3393 VecMAXPY 24500 1.5 1.1422e+00 1.2 2.69e+09 1.1 0.0e+00 0.0e+00 0.0e+00 1 10 0 0 0 1 10 0 0 0 4418 VecAssemblyBegin 2705 1.0 6.2464e-02 2.8 0.00e+00 0.0 0.0e+00 0.0e+00 2.3e+03 0 0 0 0 1 0 0 0 0 1 0 VecAssemblyEnd 2705 1.0 2.5163e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 76111 1.5 1.0028e+00 1.1 5.79e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 1098 VecScatterBegin 73975 1.0 1.4727e+00 1.1 0.00e+00 0.0 1.2e+05 7.9e+04 4.7e+03 1 0 64 53 3 1 0 64 53 3 0 VecScatterEnd 73975 1.0 2.8098e+00 1.1 6.57e+0635.7 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 2 VecNormalize 24500 1.5 5.1895e-01 1.2 7.66e+08 1.1 0.0e+00 0.0e+00 1.6e+04 0 3 0 0 8 0 3 0 0 8 2764 DMConvert 17807 0.0 7.9498e-01 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DMGlobalToLocal 11260 0.0 2.1893e+00 0.0 0.00e+00 0.0 1.4e+04 1.8e+04 5.6e+03 1 0 8 1 3 1 0 8 1 3 0 DMLocalToGlobal 5432 0.0 6.4000e-02 0.0 0.00e+00 0.0 4.1e+03 3.9e+04 0.0e+00 0 0 2 1 0 0 0 2 1 0 0 DMLocatePoints 5432 0.0 6.3669e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DMCoarsen 2326 0.0 1.4489e-02 0.0 0.00e+00 0.0 1.3e+03 3.5e+04 0.0e+00 0 0 1 0 0 0 0 1 0 0 0 DMRefine 2326 0.0 3.8828e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DMAdaptInterp 81733 0.0 5.1170e-01 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DMPlexBuFrCeLi 81733 0.0 3.3868e-02 0.0 6.57e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 194 SFSetGraph 17807 0.0 5.3620e-01 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFSetUp 11260 0.0 2.4624e+00 0.0 0.00e+00 0.0 1.4e+04 1.8e+04 5.6e+03 1 0 8 1 3 1 0 8 1 3 0 SFBcastBegin 5432 0.0 6.0916e-02 0.0 0.00e+00 0.0 4.1e+03 3.9e+04 0.0e+00 0 0 2 1 0 0 0 2 1 0 0 SFBcastEnd 5432 0.0 7.0248e-01 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFReduceBegin 2326 0.0 1.6638e-02 0.0 0.00e+00 0.0 1.3e+03 3.5e+04 0.0e+00 0 0 1 0 0 0 0 1 0 0 0 SFReduceEnd 2326 0.0 3.5606e-02 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFPack 81733 0.0 5.3409e-01 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 SFUnpack 81733 0.0 2.5624e-02 0.0 1.84e+05 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 7 ------------------------------------------------------------------------------------------------------------------------ Object Type Creations Destructions. Reports information only for process 0. --- Event Stage 0: Main Stage Container 11151 11129 Matrix 63027 62930 Matrix Coarsen 1793 1793 Krylov Solver 2131 2110 Preconditioner 2131 2110 Vector 84122 83982 Viewer 3 0 PetscRandom 1793 1793 Index Set 31276 31259 Distributed Mesh 1987 1975 Star Forest Graph 21781 21741 Discrete System 1987 1975 Weak Form 1987 1975 ======================================================================================================================== Average time to get PetscTime(): 2.49e-08 Average time for MPI_Barrier(): 2.1514e-06 Average time for zero size MPI_Send(): 7.6605e-06 #PETSc Option Table entries: -fp_trap off -ksp_type preonly -log_view -mpi_ksp_max_it 100 -mpi_ksp_monitor -mpi_ksp_rtol 1.e-7 -mpi_ksp_type gmres -mpi_linear_solver_server -mpi_linear_solver_server_view -mpi_pc_gamg_sym_graph true -mpi_pc_gamg_symmetrize_graph true -mpi_pc_type gamg -pc_type mpi #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4 Configure options: --download-mpich --with-debugging=0 --with-fc=gfortran -----------------------------------------