OpenBLAS MKL Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total 0: Main Stage: 1.7832e+03 72.7% 3.5465e+10 0.3% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 1: Stage 1 of Code: 1.5764e+02 6.4% 2.6307e+12 22.1% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 2: Stage 2 of Code: 1.3532e+02 5.5% 2.5381e+12 21.3% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 3: Stage 3 of Code: 2.2701e+02 9.3% 3.9071e+12 32.8% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 4: Stage 4 of Code: 1.4944e+02 6.1% 2.7805e+12 23.4% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent AvgLen: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage ---- Total Max Ratio Max Ratio Max Ratio Mess AvgLen Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 1: Stage 1 of Code VecMDot 164 1.0 2.2231e+01 1.0 2.17e+11 1.0 0.0e+00 0.0e+00 0.0e+00 1 2 0 0 0 14 8 0 0 0 9772 VecNorm 166 1.0 1.0829e+00 1.0 2.67e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 2461 VecScale 1652 1.0 1.1797e+00 1.0 4.87e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 4130 VecCopy 5 1.0 2.8322e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1157 1.0 1.4018e+00 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0 VecAXPY 496 1.0 1.1631e-01 1.0 3.20e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 27471 VecMAXPY 330 1.0 4.9450e+01 1.0 2.25e+11 1.0 0.0e+00 0.0e+00 0.0e+00 2 2 0 0 0 31 9 0 0 0 4543 VecNormalize 166 1.0 1.1499e+00 1.0 4.00e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 3477 MatMult 495 1.0 5.5593e+01 1.0 2.19e+12 1.0 0.0e+00 0.0e+00 0.0e+00 2 18 0 0 0 35 83 0 0 0 39338 MatMultAdd 3795 1.0 2.2066e+01 1.0 8.97e+11 1.0 0.0e+00 0.0e+00 0.0e+00 1 8 0 0 0 14 34 0 0 0 40650 MatMultTranspose 165 1.0 8.2456e-02 1.0 3.96e+03 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatMultTrAdd 165 1.0 1.4583e+00 1.0 3.18e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 2180 MatSolve 165 1.0 2.6645e+01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 17 0 0 0 0 0 PCApply 165 1.0 2.7211e+01 1.0 1.09e+10 1.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 17 0 0 0 0 399 KSPSolve 1 1.0 1.5488e+02 1.0 2.62e+12 1.0 0.0e+00 0.0e+00 0.0e+00 6 22 0 0 0 98 99 0 0 0 16900 KSPGMRESOrthog 164 1.0 7.0004e+01 1.0 4.34e+11 1.0 0.0e+00 0.0e+00 0.0e+00 3 4 0 0 0 44 17 0 0 0 6207 INTEL MKL Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total 0: Main Stage: 1.9440e+03 76.5% 3.5465e+10 0.3% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 1: Stage 1 of Code: 1.1936e+02 4.7% 2.6307e+12 20.7% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 2: Stage 2 of Code: 1.1342e+02 4.5% 2.5566e+12 20.1% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 3: Stage 3 of Code: 2.2451e+02 8.8% 4.4771e+12 35.2% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 4: Stage 4 of Code: 1.3836e+02 5.4% 3.0090e+12 23.7% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent AvgLen: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flop --- Global --- --- Stage ---- Total Max Ratio Max Ratio Max Ratio Mess AvgLen Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 1: Stage 1 of Code VecMDot 164 1.0 4.2087e+00 1.0 2.17e+11 1.0 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 4 8 0 0 0 51619 VecNorm 166 1.0 1.0858e+00 1.0 2.67e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 2455 VecScale 1652 1.0 1.9282e-01 1.0 4.87e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 25267 VecCopy 5 1.0 2.9560e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 1157 1.0 1.2266e+00 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0 VecAXPY 496 1.0 9.6929e-02 1.0 3.20e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 32963 VecMAXPY 330 1.0 4.6746e+01 1.0 2.25e+11 1.0 0.0e+00 0.0e+00 0.0e+00 2 2 0 0 0 39 9 0 0 0 4806 VecNormalize 166 1.0 1.1572e+00 1.0 4.00e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 3455 MatMult 495 1.0 4.3662e+01 1.0 2.19e+12 1.0 0.0e+00 0.0e+00 0.0e+00 2 17 0 0 0 37 83 0 0 0 50087 MatMultAdd 3795 1.0 1.7547e+01 1.0 8.97e+11 1.0 0.0e+00 0.0e+00 0.0e+00 1 7 0 0 0 15 34 0 0 0 51119 MatMultTranspose 165 1.0 5.4579e-02 1.0 3.96e+03 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatMultTrAdd 165 1.0 1.3524e+00 1.0 3.18e+09 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 2351 MatSolve 165 1.0 2.0859e+01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 17 0 0 0 0 0 PCApply 165 1.0 2.1287e+01 1.0 1.09e+10 1.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 18 0 0 0 0 510 KSPSolve 1 1.0 1.1649e+02 1.0 2.62e+12 1.0 0.0e+00 0.0e+00 0.0e+00 5 21 0 0 0 98 99 0 0 0 22470 KSPGMRESOrthog 164 1.0 4.9377e+01 1.0 4.34e+11 1.0 0.0e+00 0.0e+00 0.0e+00 2 3 0 0 0 41 17 0 0 0 8800