[petsc-users] profiling the total time

khalid ashraf khalid_eee at yahoo.com
Thu Jun 16 04:14:36 CDT 2011


Hi,
I am trying to benchmark the performance of my code on 8 processors and am 
trying to find where most of the time is used. When I look at the breakdown of 
the stages time required, the total add up to ~7s however, the main stage time 
is ~350s. I am not being able to find out the stage which is taking so much 
extra time.   Could you please suggest something ?

Thanks.  

Time (sec):           3.508e+02      1.00000   3.508e+02
Objects:              3.310e+02      1.00000   3.310e+02
Flops:                1.279e+08      1.03856   1.267e+08  1.014e+09
Flops/sec:            3.646e+05      1.03856   3.612e+05  2.890e+06
Memory:               2.817e+07      1.06078              2.221e+08
MPI Messages:         3.150e+02      1.26506   2.985e+02  2.388e+03
MPI Message Lengths:  4.011e+06      1.83678   1.191e+04  2.843e+07
MPI Reductions:       6.970e+02      1.00000



VecMDot               51 1.0 6.0108e-02 2.4 5.65e+06 1.0 0.0e+00 0.0e+00 5.1e+01 
 0  4  0  0  7   0  4  0  0 11   752
VecNorm               67 1.0 1.1708e-02 1.1 7.41e+05 1.0 0.0e+00 0.0e+00 6.7e+01 
 0  1  0  0 10   0  1  0  0 14   507
VecScale              75 1.0 1.2923e-03 1.0 3.69e+05 1.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0  2282
VecCopy               20 1.0 6.3189e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0     0
VecSet                66 1.0 5.0619e-03 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0     0
VecAXPY               51 1.0 8.9883e-04 1.1 2.95e+05 1.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0  2625
VecWAXPY              30 1.0 2.0204e-03 1.2 1.09e+05 1.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0   430
VecMAXPY              54 1.0 8.1123e-03 1.0 6.28e+06 1.0 0.0e+00 0.0e+00 0.0e+00 
 0  5  0  0  0   0  5  0  0  0  6192
VecAssemblyBegin       3 1.0 4.9893e-04 3.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.0e+00 
 0  0  0  0  1   0  0  0  0  2     0
VecAssemblyEnd         3 1.0 2.0169e-05 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0     0
VecPointwiseMult       9 1.0 1.7315e-04 1.0 1.84e+04 1.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0   852
VecScatterBegin      118 1.0 7.5530e-03 1.5 0.00e+00 0.0 2.2e+03 6.0e+03 0.0e+00 
 0  0 93 47  0   0  0 93 47  0     0
VecScatterEnd        118 1.0 3.5505e-02 3.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0     0
VecNormalize          54 1.0 7.4800e-03 1.4 9.95e+05 1.0 0.0e+00 0.0e+00 5.4e+01 
 0  1  0  0  8   0  1  0  0 11  1065
MatMult               66 1.0 6.4065e-01 1.1 5.67e+07 1.1 9.7e+02 1.1e+04 0.0e+00 
 0 44 41 38  0   0 44 41 38  0   694
MatSolve              81 1.0 3.9077e-01 1.0 4.40e+07 1.0 0.0e+00 0.0e+00 0.0e+00 
 0 35  0  0  0   0 35  0  0  0   901
MatLUFactorSym         8 1.0 1.0749e-05 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0     0
MatLUFactorNum         9 1.0 4.7825e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 0.0e+00 
 0 11  0  0  0   0 11  0  0  0  2288
MatILUFactorSym        1 1.0 1.5453e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 
 0  0  0  0  0   0  0  0  0  1     0
MatConvert             2 1.0 3.7113e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01 
 0  0  0  0  1   0  0  0  0  2     0
MatScale              16 1.0 3.7416e-05 1.2 4.90e+03 1.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0  1047
MatAssemblyBegin      59 1.0 3.1884e+00484.1 0.00e+00 0.0 4.2e+01 3.6e+05 
6.0e+00  0  0  2 53  1   0  0  2 53  1     0
MatAssemblyEnd        59 1.0 1.2491e+00 1.0 0.00e+00 0.0 7.6e+01 1.5e+03 2.1e+01 
 0  0  3  0  3   0  0  3  0  4     0
MatGetValues         792 1.0 1.7098e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0     0
MatGetRowIJ            1 1.0 5.9657e-06 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0     0
MatGetOrdering         1 1.0 2.0580e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 
 0  0  0  0  1   0  0  0  0  1     0
MatZeroEntries         1 1.0 5.6988e-04 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0     0
MatTranspose          16 1.0 6.8201e-04 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 3.2e+01 
 0  0  0  0  5   0  0  0  0  7     0
MatMatMult            32 1.0 1.3172e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 6.4e+01 
 0  0  0  0  9   0  0  0  0 14     0
MatMatSolve            8 1.0 1.3842e-03 1.1 3.60e+02 1.0 0.0e+00 0.0e+00 3.2e+01 
 0  0  0  0  5   0  0  0  0  7     2
KSPGMRESOrthog        51 1.0 6.8248e-02 2.1 1.13e+07 1.0 0.0e+00 0.0e+00 5.1e+01 
 0  9  0  0  7   0  9  0  0 11  1325
KSPSetup               4 1.0 2.2410e-03 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 
 0  0  0  0  0   0  0  0  0  0     0
KSPSolve               8 1.0 1.0945e+00 1.0 1.21e+08 1.0 8.8e+02 1.1e+04 1.9e+02 
 0 95 37 34 27   0 95 37 34 40   877
PCSetUp                3 1.0 6.8884e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 1.1e+01 
 0 11  0  0  2   0 11  0  0  2  1588
PCSetUpOnBlocks        3 1.0 6.7468e-02 1.0 1.38e+07 1.0 0.0e+00 0.0e+00 7.0e+00 
 0 11  0  0  1   0 11  0  0  1  1622
PCApply               57 1.0 4.0561e-01 1.0 4.40e+07 1.0 0.0e+00 0.0e+00 5.7e+01 
 0 35  0  0  8   0 35  0  0 12   868
------------------------------------------------------------------------------------------------------------------------
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20110616/fc514af0/attachment-0001.htm>


More information about the petsc-users mailing list