<html><head><style type="text/css"><!-- DIV {margin:0px;} --></style></head><body><div style="font-family:arial, helvetica, sans-serif;font-size:12pt"><div><div>Hi,</div><div>I am trying to benchmark the performance of my code on 8 processors and am trying to find where most of the time is used. When I look at the breakdown of the stages time required, the total add up to ~7s however, the main stage time is ~350s. I am not being able to find out the stage which is taking so much extra time. Could you please suggest something ?</div><div><br></div><div>Thanks. </div><div><br></div><div>Time (sec): 3.508e+02 1.00000 3.508e+02</div><div>Objects: 3.310e+02 1.00000 3.310e+02</div><div>Flops: 1.279e+08 1.03856 1.267e+08
1.014e+09</div><div>Flops/sec: 3.646e+05 1.03856 3.612e+05 2.890e+06</div><div>Memory: 2.817e+07 1.06078 2.221e+08</div><div>MPI Messages: 3.150e+02 1.26506 2.985e+02 2.388e+03</div><div>MPI Message Lengths: 4.011e+06 1.83678 1.191e+04 2.843e+07</div><div>MPI Reductions: 6.970e+02 1.00000</div><div><br></div><div><br></div><div><br></div><div><div>VecMDot 51 1.0 6.0108e-02 2.4 5.65e+06 1.0 0.0e+00 0.0e+00 5.1e+01 0 4 0 0 7 0 4 0 0 11 752</div><div>VecNorm 67 1.0
1.1708e-02 1.1 7.41e+05 1.0 0.0e+00 0.0e+00 6.7e+01 0 1 0 0 10 0 1 0 0 14 507</div><div>VecScale 75 1.0 1.2923e-03 1.0 3.69e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2282</div><div>VecCopy 20 1.0 6.3189e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0</div><div>VecSet 66 1.0 5.0619e-03 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0</div><div>VecAXPY 51 1.0 8.9883e-04 1.1 2.95e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0
0 0 0 0 0 0 2625</div><div>VecWAXPY 30 1.0 2.0204e-03 1.2 1.09e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 430</div><div>VecMAXPY 54 1.0 8.1123e-03 1.0 6.28e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 5 0 0 0 0 5 0 0 0 6192</div><div>VecAssemblyBegin 3 1.0 4.9893e-04 3.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.0e+00 0 0 0 0 1 0 0 0 0 2 0</div><div>VecAssemblyEnd 3 1.0 2.0169e-05 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0</div><div>VecPointwiseMult 9 1.0
1.7315e-04 1.0 1.84e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 852</div><div>VecScatterBegin 118 1.0 7.5530e-03 1.5 0.00e+00 0.0 2.2e+03 6.0e+03 0.0e+00 0 0 93 47 0 0 0 93 47 0 0</div><div>VecScatterEnd 118 1.0 3.5505e-02 3.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0</div><div>VecNormalize 54 1.0 7.4800e-03 1.4 9.95e+05 1.0 0.0e+00 0.0e+00 5.4e+01 0 1 0 0 8 0 1 0 0 11 1065</div><div>MatMult 66 1.0 6.4065e-01 1.1 5.67e+07 1.1 9.7e+02 1.1e+04 0.0e+00 0 44 41 38 0 0 44 41 38 0 694</div><div>MatSolve
81 1.0 3.9077e-01 1.0 4.40e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 35 0 0 0 0 35 0 0 0 901</div><div>MatLUFactorSym 8 1.0 1.0749e-05 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0</div><div>MatLUFactorNum 9 1.0 4.7825e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 11 0 0 0 0 11 0 0 0 2288</div><div>MatILUFactorSym 1 1.0 1.5453e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 0 0 0 0 0 1 0</div></div><div><div>MatConvert 2 1.0 3.7113e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01 0 0 0 0 1 0 0
0 0 2 0</div><div>MatScale 16 1.0 3.7416e-05 1.2 4.90e+03 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1047</div><div>MatAssemblyBegin 59 1.0 3.1884e+00484.1 0.00e+00 0.0 4.2e+01 3.6e+05 6.0e+00 0 0 2 53 1 0 0 2 53 1 0</div><div>MatAssemblyEnd 59 1.0 1.2491e+00 1.0 0.00e+00 0.0 7.6e+01 1.5e+03 2.1e+01 0 0 3 0 3 0 0 3 0 4 0</div><div>MatGetValues 792 1.0 1.7098e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0</div><div>MatGetRowIJ 1 1.0 5.9657e-06 2.0 0.00e+00 0.0
0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0</div><div>MatGetOrdering 1 1.0 2.0580e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0</div><div>MatZeroEntries 1 1.0 5.6988e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0</div><div>MatTranspose 16 1.0 6.8201e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 3.2e+01 0 0 0 0 5 0 0 0 0 7 0</div><div>MatMatMult 32 1.0 1.3172e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 6.4e+01 0 0 0 0 9 0 0 0 0 14
0</div><div>MatMatSolve 8 1.0 1.3842e-03 1.1 3.60e+02 1.0 0.0e+00 0.0e+00 3.2e+01 0 0 0 0 5 0 0 0 0 7 2</div><div>KSPGMRESOrthog 51 1.0 6.8248e-02 2.1 1.13e+07 1.0 0.0e+00 0.0e+00 5.1e+01 0 9 0 0 7 0 9 0 0 11 1325</div><div>KSPSetup 4 1.0 2.2410e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0</div><div>KSPSolve 8 1.0 1.0945e+00 1.0 1.21e+08 1.0 8.8e+02 1.1e+04 1.9e+02 0 95 37 34 27 0 95 37 34 40 877</div><div>PCSetUp 3 1.0 6.8884e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 1.1e+01 0 11
0 0 2 0 11 0 0 2 1588</div><div>PCSetUpOnBlocks 3 1.0 6.7468e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 7.0e+00 0 11 0 0 1 0 11 0 0 1 1622</div><div>PCApply 57 1.0 4.0561e-01 1.0 4.40e+07 1.0 0.0e+00 0.0e+00 5.7e+01 0 35 0 0 8 0 35 0 0 12 868</div><div>------------------------------------------------------------------------------------------------------------------------</div><div><br></div></div></div><div style="position:fixed"></div>
</div></body></html>