[petsc-users] profiling the total time
khalid ashraf
khalid_eee at yahoo.com
Thu Jun 16 04:14:36 CDT 2011
Hi,
I am trying to benchmark the performance of my code on 8 processors and am
trying to find where most of the time is used. When I look at the breakdown of
the stages time required, the total add up to ~7s however, the main stage time
is ~350s. I am not being able to find out the stage which is taking so much
extra time. Could you please suggest something ?
Thanks.
Time (sec): 3.508e+02 1.00000 3.508e+02
Objects: 3.310e+02 1.00000 3.310e+02
Flops: 1.279e+08 1.03856 1.267e+08 1.014e+09
Flops/sec: 3.646e+05 1.03856 3.612e+05 2.890e+06
Memory: 2.817e+07 1.06078 2.221e+08
MPI Messages: 3.150e+02 1.26506 2.985e+02 2.388e+03
MPI Message Lengths: 4.011e+06 1.83678 1.191e+04 2.843e+07
MPI Reductions: 6.970e+02 1.00000
VecMDot 51 1.0 6.0108e-02 2.4 5.65e+06 1.0 0.0e+00 0.0e+00 5.1e+01
0 4 0 0 7 0 4 0 0 11 752
VecNorm 67 1.0 1.1708e-02 1.1 7.41e+05 1.0 0.0e+00 0.0e+00 6.7e+01
0 1 0 0 10 0 1 0 0 14 507
VecScale 75 1.0 1.2923e-03 1.0 3.69e+05 1.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 2282
VecCopy 20 1.0 6.3189e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 0
VecSet 66 1.0 5.0619e-03 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 0
VecAXPY 51 1.0 8.9883e-04 1.1 2.95e+05 1.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 2625
VecWAXPY 30 1.0 2.0204e-03 1.2 1.09e+05 1.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 430
VecMAXPY 54 1.0 8.1123e-03 1.0 6.28e+06 1.0 0.0e+00 0.0e+00 0.0e+00
0 5 0 0 0 0 5 0 0 0 6192
VecAssemblyBegin 3 1.0 4.9893e-04 3.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.0e+00
0 0 0 0 1 0 0 0 0 2 0
VecAssemblyEnd 3 1.0 2.0169e-05 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 0
VecPointwiseMult 9 1.0 1.7315e-04 1.0 1.84e+04 1.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 852
VecScatterBegin 118 1.0 7.5530e-03 1.5 0.00e+00 0.0 2.2e+03 6.0e+03 0.0e+00
0 0 93 47 0 0 0 93 47 0 0
VecScatterEnd 118 1.0 3.5505e-02 3.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 0
VecNormalize 54 1.0 7.4800e-03 1.4 9.95e+05 1.0 0.0e+00 0.0e+00 5.4e+01
0 1 0 0 8 0 1 0 0 11 1065
MatMult 66 1.0 6.4065e-01 1.1 5.67e+07 1.1 9.7e+02 1.1e+04 0.0e+00
0 44 41 38 0 0 44 41 38 0 694
MatSolve 81 1.0 3.9077e-01 1.0 4.40e+07 1.0 0.0e+00 0.0e+00 0.0e+00
0 35 0 0 0 0 35 0 0 0 901
MatLUFactorSym 8 1.0 1.0749e-05 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 0
MatLUFactorNum 9 1.0 4.7825e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 0.0e+00
0 11 0 0 0 0 11 0 0 0 2288
MatILUFactorSym 1 1.0 1.5453e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00
0 0 0 0 0 0 0 0 0 1 0
MatConvert 2 1.0 3.7113e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01
0 0 0 0 1 0 0 0 0 2 0
MatScale 16 1.0 3.7416e-05 1.2 4.90e+03 1.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 1047
MatAssemblyBegin 59 1.0 3.1884e+00484.1 0.00e+00 0.0 4.2e+01 3.6e+05
6.0e+00 0 0 2 53 1 0 0 2 53 1 0
MatAssemblyEnd 59 1.0 1.2491e+00 1.0 0.00e+00 0.0 7.6e+01 1.5e+03 2.1e+01
0 0 3 0 3 0 0 3 0 4 0
MatGetValues 792 1.0 1.7098e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 0
MatGetRowIJ 1 1.0 5.9657e-06 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 0
MatGetOrdering 1 1.0 2.0580e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00
0 0 0 0 1 0 0 0 0 1 0
MatZeroEntries 1 1.0 5.6988e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 0
MatTranspose 16 1.0 6.8201e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 3.2e+01
0 0 0 0 5 0 0 0 0 7 0
MatMatMult 32 1.0 1.3172e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 6.4e+01
0 0 0 0 9 0 0 0 0 14 0
MatMatSolve 8 1.0 1.3842e-03 1.1 3.60e+02 1.0 0.0e+00 0.0e+00 3.2e+01
0 0 0 0 5 0 0 0 0 7 2
KSPGMRESOrthog 51 1.0 6.8248e-02 2.1 1.13e+07 1.0 0.0e+00 0.0e+00 5.1e+01
0 9 0 0 7 0 9 0 0 11 1325
KSPSetup 4 1.0 2.2410e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00
0 0 0 0 0 0 0 0 0 0 0
KSPSolve 8 1.0 1.0945e+00 1.0 1.21e+08 1.0 8.8e+02 1.1e+04 1.9e+02
0 95 37 34 27 0 95 37 34 40 877
PCSetUp 3 1.0 6.8884e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 1.1e+01
0 11 0 0 2 0 11 0 0 2 1588
PCSetUpOnBlocks 3 1.0 6.7468e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 7.0e+00
0 11 0 0 1 0 11 0 0 1 1622
PCApply 57 1.0 4.0561e-01 1.0 4.40e+07 1.0 0.0e+00 0.0e+00 5.7e+01
0 35 0 0 8 0 35 0 0 12 868
------------------------------------------------------------------------------------------------------------------------
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20110616/fc514af0/attachment-0001.htm>
More information about the petsc-users
mailing list