[petsc-users] Enquiry regarding log summary results

TAY wee-beng zonexo at gmail.com
Fri Oct 5 10:08:35 CDT 2012


On 4/10/2012 11:44 PM, Satish Balay wrote:
> On Thu, 4 Oct 2012, Matthew Knepley wrote:
>
>> the Mom-Z solve is bad because the assembly of the operator is
>> screwed up. You are communicating too many values. So, jsut go into
>> your code and count how many off process entries you set.
> running with '-info' should give debug info regarding this
> communication [stack size used for messages, and the number of mallocs
> needed etc..]
>
> Satish

Ok, I found my problem. I was using:

PetscLogStage  stage

and the time for the poisson eqn got combined into that of the 
momentum_z solve. Hence the confusion in results.

I have attached the new results. The time is taken up mainly by GAMG. I 
am using:

-poisson_pc_gamg_agg_nsmooths 1
-poisson_pc_type gamg

What are the possible ways to improve the efficiency or reduce the time?

If I use HYPRE's geometric multigrid, performance is somewhat better, 
such that momentum_x/y/z accounts for a total of 15% computing time.

 From the log_summary, for the momentum eqns, it seems that MatAssembly 
is the main problem, is that so?
-------------- next part --------------
  0.250000000000000       0.400000000000000       0.000000000000000E+000
  0.000000000000000E+000  0.000000000000000E+000  0.500000000000000     
   1.00000000000000       0.300000000000000                0     -400000
 AB,AA,BB   -3.99000009451993        4.20550010475563     
   4.07500009855721     
 size_x,size_y,size_z           75         404         629
 Warning - length difference between element and cell
 max_element_length,min_element_length,min_delta
  0.125984138460047       3.703647122769601E-004  1.000000000000000E-002
 maximum ngh_surfaces and ngh_vertics are           72          26
 minimum ngh_surfaces and ngh_vertics are            1           1
 IIB_I_cell_no_uvw_total1       14652       15150       15000       61272
       60750       62100
    1      0.00300000      0.33018343      0.41442094      1.19457187  0.49100420E-02 -0.13216209E+00  0.19023281E+08
    2      0.00214949      0.08042523      0.51210792      1.22010993 -0.13701147E-03 -0.10344701E+00  0.19018108E+08
    3      0.00203615      0.02146905      0.54566355      1.23244869  0.10544591E-05 -0.90478005E-01  0.19017334E+08
    4      0.00200855      0.01354072      0.56450141      1.24354136 -0.62836065E-05  0.20489605E-01  0.19017160E+08
    5      0.00199787      0.01065016      0.58015529      1.25235552 -0.13971205E-03  0.11199829E+00  0.19017088E+08
 escape_time reached, so abort
 fx_p,fy_p,fz_p,fx_v,fy_v,fz_v,cd_cl_cs_explicit1,cd_cl_cs_implicit1
 -2.636323997944099E-007  -2.23618995423546      -0.161689384600393     
  2.513651961966575E-007 -3.449431535817790E-003  1.492238456281484E-002
 -1.226720359775232E-008  -2.23963938577127      -0.146767000037579     
  7.158703928325935E-009  -2.42687397020185      -7.458519403454597E-002
************************************************************************************************************************
***             WIDEN YOUR WINDOW TO 120 CHARACTERS.  Use 'enscript -r -fCourier9' to print this document            ***
************************************************************************************************************************

---------------------------------------------- PETSc Performance Summary: ----------------------------------------------

./a.out on a petsc-3.3-dev_shared_rel named n12-05 with 48 processors, by wtay Fri Oct  5 16:19:16 2012
Using Petsc Development HG revision: 9883b54053eca13dd473a4711adfd309d1436b6e  HG Date: Sun Sep 30 22:42:36 2012 -0500

                         Max       Max/Min        Avg      Total 
Time (sec):           1.840e+03      1.00338   1.837e+03
Objects:              5.680e+02      1.00000   5.680e+02
Flops:                1.735e+11      1.26417   1.414e+11  6.787e+12
Flops/sec:            9.426e+07      1.26189   7.696e+07  3.694e+09
MPI Messages:         1.682e+05      3.29645   1.099e+05  5.273e+06
MPI Message Lengths:  8.645e+09      2.00779   7.622e+04  4.019e+11
MPI Reductions:       1.374e+04      1.00000

Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract)
                            e.g., VecAXPY() for real vectors of length N --> 2N flops
                            and VecAXPY() for complex vectors of length N --> 8N flops

Summary of Stages:   ----- Time ------  ----- Flops -----  --- Messages ---  -- Message Lengths --  -- Reductions --
                        Avg     %Total     Avg     %Total   counts   %Total     Avg         %Total   counts   %Total 
 0:      Main Stage: 2.4790e+02  13.5%  0.0000e+00   0.0%  0.000e+00   0.0%  0.000e+00        0.0%  3.600e+01   0.3% 
 1:         poisson: 1.4545e+03  79.2%  6.4730e+12  95.4%  5.245e+06  99.5%  7.496e+04       98.3%  1.146e+04  83.4% 
 2:      momentum_x: 4.5177e+01   2.5%  1.0421e+11   1.5%  9.400e+03   0.2%  4.179e+02        0.5%  7.480e+02   5.4% 
 3:      momentum_y: 4.4457e+01   2.4%  1.0494e+11   1.5%  9.400e+03   0.2%  4.236e+02        0.6%  7.480e+02   5.4% 
 4:      momentum_z: 4.5353e+01   2.5%  1.0502e+11   1.5%  9.400e+03   0.2%  4.235e+02        0.6%  7.480e+02   5.4% 

------------------------------------------------------------------------------------------------------------------------
See the 'Profiling' chapter of the users' manual for details on interpreting output.
Phase summary info:
   Count: number of times phase was executed
   Time and Flops: Max - maximum over all processors
                   Ratio - ratio of maximum to minimum over all processors
   Mess: number of messages sent
   Avg. len: average message length
   Reduct: number of global reductions
   Global: entire computation
   Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop().
      %T - percent time in this phase         %f - percent flops in this phase
      %M - percent messages in this phase     %L - percent message lengths in this phase
      %R - percent reductions in this phase
   Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors)
------------------------------------------------------------------------------------------------------------------------
Event                Count      Time (sec)     Flops                             --- Global ---  --- Stage ---   Total
                   Max Ratio  Max     Ratio   Max  Ratio  Mess   Avg len Reduct  %T %f %M %L %R  %T %f %M %L %R Mflop/s
------------------------------------------------------------------------------------------------------------------------

--- Event Stage 0: Main Stage


--- Event Stage 1: poisson

MatMult            43722 1.0 1.0849e+03 1.3 1.14e+11 1.3 4.2e+06 8.9e+04 0.0e+00 52 62 80 93  0  66 65 80 95  0  3897
MatMultAdd          6728 1.0 8.7529e+01 2.9 8.16e+09 1.5 5.1e+05 1.8e+04 0.0e+00  3  4 10  2  0   3  4 10  2  0  3043
MatMultTranspose    6728 1.0 1.4643e+02 4.8 8.16e+09 1.5 5.1e+05 1.8e+04 0.0e+00  3  4 10  2  0   4  4 10  2  0  1819
MatSolve            3364 0.0 2.1961e-02 0.0 4.42e+06 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0   201
MatLUFactorSym         1 1.0 5.3883e-05 3.6 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatLUFactorNum         1 1.0 5.2929e-05 5.8 1.04e+04 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0   196
MatConvert             4 1.0 1.4656e-01 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatScale              12 1.0 1.7515e-01 2.3 1.58e+07 1.4 4.0e+02 8.3e+04 0.0e+00  0  0  0  0  0   0  0  0  0  0  3186
MatAssemblyBegin      69 1.0 5.6784e+00 1.9 0.00e+00 0.0 1.2e+03 1.4e+04 7.4e+01  0  0  0  0  1   0  0  0  0  1     0
MatAssemblyEnd        69 1.0 1.4912e+00 1.1 0.00e+00 0.0 6.7e+03 1.0e+04 2.0e+02  0  0  0  0  1   0  0  0  0  2     0
MatGetRow         961340 1.1 5.2880e-01 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetRowIJ            1 0.0 1.4782e-05 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetOrdering         1 0.0 7.2956e-05 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.2e-02  0  0  0  0  0   0  0  0  0  0     0
MatCoarsen             4 1.0 6.3715e-01 1.2 0.00e+00 0.0 1.9e+04 1.8e+04 2.8e+02  0  0  0  0  2   0  0  0  0  2     0
MatAXPY                4 1.0 4.7550e-02 3.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatMatMult             4 1.0 1.1370e+00 1.0 1.10e+07 1.4 2.8e+03 3.7e+04 9.6e+01  0  0  0  0  1   0  0  0  0  1   352
MatMatMultSym          4 1.0 8.7142e-01 1.0 0.00e+00 0.0 2.4e+03 2.9e+04 8.8e+01  0  0  0  0  1   0  0  0  0  1     0
MatMatMultNum          4 1.0 2.6963e-01 1.1 1.10e+07 1.4 4.0e+02 8.3e+04 8.0e+00  0  0  0  0  0   0  0  0  0  0  1482
MatPtAP                4 1.0 3.5065e+00 1.0 4.77e+08 3.1 5.9e+03 6.9e+04 1.1e+02  0  0  0  0  1   0  0  0  0  1  2441
MatPtAPSymbolic        4 1.0 2.0355e+00 1.1 0.00e+00 0.0 5.3e+03 5.7e+04 1.0e+02  0  0  0  0  1   0  0  0  0  1     0
MatPtAPNumeric         4 1.0 1.4810e+00 1.0 4.77e+08 3.1 6.0e+02 1.7e+05 8.0e+00  0  0  0  0  0   0  0  0  0  0  5778
MatTrnMatMult          4 1.0 8.2675e+00 1.0 5.30e+08 4.6 2.7e+03 4.7e+05 1.2e+02  0  0  0  0  1   1  0  0  0  1  2706
MatGetLocalMat        20 1.0 4.2365e-01 1.8 0.00e+00 0.0 0.0e+00 0.0e+00 2.4e+01  0  0  0  0  0   0  0  0  0  0     0
MatGetBrAoCol         12 1.0 2.7735e-01 2.3 0.00e+00 0.0 2.8e+03 1.2e+05 1.6e+01  0  0  0  0  0   0  0  0  0  0     0
MatGetSymTrans         8 1.0 4.6894e-02 2.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPGMRESOrthog      1722 1.0 1.1443e+00 2.2 1.06e+08 1.1 0.0e+00 0.0e+00 1.7e+03  0  0  0  0 13   0  0  0  0 15  4026
KSPSetUp              11 1.0 1.4655e-01 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01  0  0  0  0  0   0  0  0  0  0     0
KSPSolve              50 1.0 1.4493e+03 1.0 1.67e+11 1.3 5.2e+06 7.5e+04 1.1e+04 79 95 99 98 81 100100100100 97  4466
VecDot              1632 1.0 8.7316e+01 6.6 1.38e+09 1.1 0.0e+00 0.0e+00 1.6e+03  2  1  0  0 12   3  1  0  0 14   712
VecDotNorm2          816 1.0 8.7681e+01 5.3 2.77e+09 1.1 0.0e+00 0.0e+00 2.4e+03  3  2  0  0 18   3  2  0  0 21  1419
VecMDot             1722 1.0 1.0294e+00 3.8 5.30e+07 1.1 0.0e+00 0.0e+00 1.7e+03  0  0  0  0 13   0  0  0  0 15  2238
VecNorm             4274 1.0 4.4562e+01 7.8 7.45e+08 1.1 0.0e+00 0.0e+00 4.3e+03  1  0  0  0 31   1  1  0  0 37   751
VecScale           30320 1.0 2.6244e+01 3.3 3.24e+09 1.1 0.0e+00 0.0e+00 0.0e+00  1  2  0  0  0   1  2  0  0  0  5377
VecCopy             8514 1.0 9.8725e+00 3.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecSet             28819 1.0 9.3239e+00 4.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecAXPY            55510 1.0 7.0225e+01 3.0 1.29e+10 1.1 0.0e+00 0.0e+00 0.0e+00  2  8  0  0  0   3  9  0  0  0  8026
VecAYPX            53824 1.0 9.9916e+01 2.7 8.08e+09 1.1 0.0e+00 0.0e+00 0.0e+00  4  5  0  0  0   5  5  0  0  0  3525
VecAXPBYCZ          1632 1.0 3.1461e+01 2.1 2.77e+09 1.1 0.0e+00 0.0e+00 0.0e+00  1  2  0  0  0   2  2  0  0  0  3955
VecWAXPY            1632 1.0 3.2483e+01 2.4 1.38e+09 1.1 0.0e+00 0.0e+00 0.0e+00  1  1  0  0  0   2  1  0  0  0  1915
VecMAXPY            3408 1.0 3.1750e-01 3.2 6.27e+07 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0  8575
VecAssemblyBegin     195 1.0 4.5760e+0012.0 0.00e+00 0.0 0.0e+00 0.0e+00 5.8e+02  0  0  0  0  4   0  0  0  0  5     0
VecAssemblyEnd       195 1.0 1.6549e-03 4.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecPointwiseMult   40412 1.0 1.3043e+02 3.6 4.86e+09 1.1 0.0e+00 0.0e+00 0.0e+00  4  3  0  0  0   4  3  0  0  0  1622
VecScatterBegin    57281 1.0 2.0484e+01 4.3 0.00e+00 0.0 5.2e+06 7.5e+04 0.0e+00  1  0 99 98  0   1  0100100  0     0
VecScatterEnd      57281 1.0 6.3320e+0214.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 18  0  0  0  0  22  0  0  0  0     0
VecSetRandom           4 1.0 4.1965e-02 3.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecNormalize        3408 1.0 3.5365e+01116.9 1.61e+07 1.1 0.0e+00 0.0e+00 3.4e+03  0  0  0  0 25   0  0  0  0 30    20
PCSetUp                2 1.0 2.1552e+01 1.0 9.41e+08 1.3 4.2e+04 7.0e+04 1.2e+03  1  1  1  1  8   1  1  1  1 10  1933
PCSetUpOnBlocks     1682 1.0 2.8644e-03 2.1 1.04e+04 0.0 0.0e+00 0.0e+00 5.0e+00  0  0  0  0  0   0  0  0  0  0     4
PCApply             1682 1.0 1.2035e+03 1.2 1.50e+11 1.3 5.0e+06 7.0e+04 5.1e+03 62 83 96 88 37  78 87 96 90 44  4672
PCGAMGgraph_AGG        4 1.0 2.6394e+00 1.0 1.10e+07 1.4 1.2e+03 4.2e+04 7.6e+01  0  0  0  0  1   0  0  0  0  1   151
PCGAMGcoarse_AGG       4 1.0 9.4073e+00 1.0 5.30e+08 4.6 2.4e+04 7.9e+04 4.8e+02  1  0  0  0  3   1  0  0  0  4  2378
PCGAMGProl_AGG         4 1.0 2.3864e+00 1.0 0.00e+00 0.0 3.2e+03 4.6e+04 1.1e+02  0  0  0  0  1   0  0  0  0  1     0
PCGAMGPOpt_AGG         4 1.0 3.6797e+00 1.0 2.53e+08 1.2 6.8e+03 6.4e+04 2.1e+02  0  0  0  0  2   0  0  0  0  2  2809

--- Event Stage 2: momentum_x

MatMult               98 1.0 2.9512e+00 1.1 5.31e+08 1.2 9.2e+03 2.4e+05 0.0e+00  0  0  0  1  0   6 23 98 99  0  8071
MatSolve             147 1.0 3.5473e+00 1.1 7.79e+08 1.1 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   8 33  0  0  0  9840
MatLUFactorNum        49 1.0 4.3144e+00 1.2 4.32e+08 1.2 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   9 19  0  0  0  4482
MatILUFactorSym        1 1.0 8.6374e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatAssemblyBegin      49 1.0 6.6507e+0030.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.8e+01  0  0  0  0  1   7  0  0  0 13     0
MatAssemblyEnd        49 1.0 2.4696e+00 1.8 0.00e+00 0.0 1.9e+02 5.9e+04 8.0e+00  0  0  0  0  0   5  0  2  1  1     0
MatGetRowIJ            1 1.0 6.9141e-06 7.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetOrdering         1 1.0 9.7661e-03 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPSetUp              98 1.0 3.3892e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPSolve              49 1.0 1.3279e+01 1.0 2.32e+09 1.1 9.2e+03 2.4e+05 3.5e+02  1  2  0  1  3  29100 98 99 47  7848
VecDot                98 1.0 5.0782e-01 2.5 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01  0  0  0  0  1   1  4  0  0 13  7356
VecDotNorm2           49 1.0 5.2031e-01 2.1 1.66e+08 1.1 0.0e+00 0.0e+00 1.5e+02  0  0  0  0  1   1  7  0  0 20 14359
VecNorm               98 1.0 7.8282e-01 6.4 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01  0  0  0  0  1   1  4  0  0 13  4772
VecCopy               98 1.0 3.3748e-01 1.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  0  0  0  0     0
VecSet               295 1.0 6.4768e-01 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  0  0  0  0     0
VecAXPBYCZ            98 1.0 6.4953e-01 1.4 1.66e+08 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  7  0  0  0 11502
VecWAXPY              98 1.0 5.8485e-01 1.2 8.31e+07 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  4  0  0  0  6387
VecAssemblyBegin      98 1.0 1.3877e+00100.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02  0  0  0  0  2   1  0  0  0 39     0
VecAssemblyEnd        98 1.0 1.2057e-03 5.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecScatterBegin       98 1.0 6.0991e-02 2.7 0.00e+00 0.0 9.2e+03 2.4e+05 0.0e+00  0  0  0  1  0   0  0 98 99  0     0
VecScatterEnd         98 1.0 3.4077e-01 2.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
PCSetUp               98 1.0 4.4126e+00 1.2 4.32e+08 1.2 0.0e+00 0.0e+00 5.0e+00  0  0  0  0  0   9 19  0  0  1  4382
PCSetUpOnBlocks       49 1.0 4.4121e+00 1.2 4.32e+08 1.2 0.0e+00 0.0e+00 3.0e+00  0  0  0  0  0   9 19  0  0  0  4382
PCApply              147 1.0 3.7470e+00 1.1 7.79e+08 1.1 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   8 33  0  0  0  9316

--- Event Stage 3: momentum_y

MatMult               98 1.0 2.9856e+00 1.1 5.36e+08 1.2 9.2e+03 2.4e+05 0.0e+00  0  0  0  1  0   6 23 98 99  0  8055
MatSolve             147 1.0 3.5550e+00 1.1 7.87e+08 1.1 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   8 34  0  0  0  9913
MatLUFactorNum        49 1.0 4.6245e+00 1.2 4.36e+08 1.2 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   9 19  0  0  0  4218
MatILUFactorSym        1 1.0 8.6295e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatAssemblyBegin      49 1.0 4.5507e+00207.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.8e+01  0  0  0  0  1   5  0  0  0 13     0
MatAssemblyEnd        49 1.0 2.4393e+00 1.8 0.00e+00 0.0 1.9e+02 6.0e+04 8.0e+00  0  0  0  0  0   5  0  2  1  1     0
MatGetRowIJ            1 1.0 1.1921e-06 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetOrdering         1 1.0 9.5730e-03 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPSetUp              98 1.0 3.3241e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPSolve              49 1.0 1.3589e+01 1.0 2.34e+09 1.1 9.2e+03 2.4e+05 3.5e+02  1  2  0  1  3  31100 98 99 47  7723
VecDot                98 1.0 5.2115e-01 2.5 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01  0  0  0  0  1   1  4  0  0 13  7168
VecDotNorm2           49 1.0 5.1932e-01 2.1 1.66e+08 1.1 0.0e+00 0.0e+00 1.5e+02  0  0  0  0  1   1  7  0  0 20 14386
VecNorm               98 1.0 8.4822e-01 6.8 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01  0  0  0  0  1   1  4  0  0 13  4404
VecCopy               98 1.0 3.1552e-01 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  0  0  0  0     0
VecSet               295 1.0 6.1609e-01 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  0  0  0  0     0
VecAXPBYCZ            98 1.0 6.5669e-01 1.4 1.66e+08 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  7  0  0  0 11377
VecWAXPY              98 1.0 5.8188e-01 1.2 8.31e+07 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  4  0  0  0  6420
VecAssemblyBegin      98 1.0 1.3747e+0099.2 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02  0  0  0  0  2   1  0  0  0 39     0
VecAssemblyEnd        98 1.0 1.2748e-03 6.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecScatterBegin       98 1.0 6.1957e-02 3.0 0.00e+00 0.0 9.2e+03 2.4e+05 0.0e+00  0  0  0  1  0   0  0 98 99  0     0
VecScatterEnd         98 1.0 3.4862e-01 2.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  0  0  0  0     0
PCSetUp               98 1.0 4.7222e+00 1.2 4.36e+08 1.2 0.0e+00 0.0e+00 5.0e+00  0  0  0  0  0  10 19  0  0  1  4131
PCSetUpOnBlocks       49 1.0 4.7218e+00 1.2 4.36e+08 1.2 0.0e+00 0.0e+00 3.0e+00  0  0  0  0  0  10 19  0  0  0  4131
PCApply              147 1.0 3.8046e+00 1.1 7.87e+08 1.1 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   8 34  0  0  0  9262

--- Event Stage 4: momentum_z

MatMult               98 1.0 3.0181e+00 1.2 5.37e+08 1.2 9.2e+03 2.4e+05 0.0e+00  0  0  0  1  0   6 23 98 99  0  7974
MatSolve             147 1.0 3.5577e+00 1.1 7.87e+08 1.1 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   8 34  0  0  0  9913
MatLUFactorNum        49 1.0 4.4279e+00 1.2 4.37e+08 1.2 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   9 19  0  0  0  4413
MatILUFactorSym        1 1.0 8.8243e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatAssemblyBegin      49 1.0 5.2133e+00110.2 0.00e+00 0.0 0.0e+00 0.0e+00 9.8e+01  0  0  0  0  1   6  0  0  0 13     0
MatAssemblyEnd        49 1.0 2.4158e+00 1.8 0.00e+00 0.0 1.9e+02 6.0e+04 8.0e+00  0  0  0  0  0   5  0  2  1  1     0
MatGetRowIJ            1 1.0 1.9073e-06 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetOrdering         1 1.0 9.3498e-03 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPSetUp              98 1.0 3.3165e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPSolve              49 1.0 1.3420e+01 1.0 2.34e+09 1.1 9.2e+03 2.4e+05 3.5e+02  1  2  0  1  3  30100 98 99 47  7826
VecDot                98 1.0 5.2100e-01 2.5 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01  0  0  0  0  1   1  4  0  0 13  7170
VecDotNorm2           49 1.0 5.3533e-01 2.2 1.66e+08 1.1 0.0e+00 0.0e+00 1.5e+02  0  0  0  0  1   1  7  0  0 20 13956
VecNorm               98 1.0 7.6278e-01 5.2 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01  0  0  0  0  1   1  4  0  0 13  4897
VecCopy               98 1.0 3.4060e-01 1.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  0  0  0  0     0
VecSet               295 1.0 6.4979e-01 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  0  0  0  0     0
VecAXPBYCZ            98 1.0 6.4095e-01 1.4 1.66e+08 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  7  0  0  0 11656
VecWAXPY              98 1.0 5.7849e-01 1.1 8.31e+07 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1  4  0  0  0  6457
VecAssemblyBegin      98 1.0 1.3208e+0054.2 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02  0  0  0  0  2   1  0  0  0 39     0
VecAssemblyEnd        98 1.0 1.2825e-03 6.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecScatterBegin       98 1.0 6.1303e-02 2.8 0.00e+00 0.0 9.2e+03 2.4e+05 0.0e+00  0  0  0  1  0   0  0 98 99  0     0
VecScatterEnd         98 1.0 3.7855e-01 2.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
PCSetUp               98 1.0 4.5256e+00 1.2 4.37e+08 1.2 0.0e+00 0.0e+00 5.0e+00  0  0  0  0  0   9 19  0  0  1  4317
PCSetUpOnBlocks       49 1.0 4.5250e+00 1.2 4.37e+08 1.2 0.0e+00 0.0e+00 3.0e+00  0  0  0  0  0   9 19  0  0  0  4318
PCApply              147 1.0 3.8119e+00 1.1 7.87e+08 1.1 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   8 34  0  0  0  9252
------------------------------------------------------------------------------------------------------------------------

Memory usage is given in bytes:

Object Type          Creations   Destructions     Memory  Descendants' Mem.
Reports information only for process 0.

--- Event Stage 0: Main Stage

              Matrix    12             56    568064932     0
       Krylov Solver     4             13        31728     0
              Vector     8             85    129979032     0
      Vector Scatter     0             12        12720     0
           Index Set     0             16      5102648     0
      Preconditioner     0             13        12540     0
              Viewer     1              0            0     0

--- Event Stage 1: poisson

              Matrix   117             76    368468980     0
      Matrix Coarsen     4              4         2480     0
       Krylov Solver    10              4       120512     0
              Vector   225            175    107847696     0
      Vector Scatter    31             22        23320     0
           Index Set    81             74       737472     0
      Preconditioner    11              4         3456     0
         PetscRandom     4              4         2464     0

--- Event Stage 2: momentum_x

              Matrix     1              0            0     0
       Krylov Solver     1              0            0     0
              Vector    10              1         1504     0
      Vector Scatter     1              0            0     0
           Index Set     5              2       120488     0
      Preconditioner     2              0            0     0

--- Event Stage 3: momentum_y

              Matrix     1              0            0     0
       Krylov Solver     1              0            0     0
              Vector    10              1         1504     0
      Vector Scatter     1              0            0     0
           Index Set     5              2       122104     0
      Preconditioner     2              0            0     0

--- Event Stage 4: momentum_z

              Matrix     1              0            0     0
       Krylov Solver     1              0            0     0
              Vector    10              1         1504     0
      Vector Scatter     1              0            0     0
           Index Set     5              2       122096     0
      Preconditioner     2              0            0     0
========================================================================================================================
Average time to get PetscTime(): 1.90735e-07
Average time for MPI_Barrier(): 4.04358e-05
Average time for zero size MPI_Send(): 2.70406e-05
#PETSc Option Table entries:
-log_summary
-poisson_pc_gamg_agg_nsmooths 1
-poisson_pc_type gamg
#End of PETSc Option Table entries
Compiled without FORTRAN kernels
Compiled with full precision matrices (default)
sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4
Configure run at: Mon Oct  1 11:36:09 2012
Configure options: --with-mpi-dir=/opt/openmpi-1.5.3/ --with-blas-lapack-dir=/opt/intelcpro-11.1.059/mkl/lib/em64t/ --with-debugging=0 --download-hypre=1 --prefix=/home/wtay/Lib/petsc-3.3-dev_shared_rel --known-mpi-shared=1 --with-shared-libraries
-----------------------------------------
Libraries compiled on Mon Oct  1 11:36:09 2012 on hpc12 
Machine characteristics: Linux-2.6.32-279.1.1.el6.x86_64-x86_64-with-centos-6.3-Final
Using PETSc directory: /home/wtay/Codes/petsc-dev
Using PETSc arch: petsc-3.3-dev_shared_rel
-----------------------------------------

Using C compiler: /opt/openmpi-1.5.3/bin/mpicc  -fPIC -wd1572 -O3  ${COPTFLAGS} ${CFLAGS}
Using Fortran compiler: /opt/openmpi-1.5.3/bin/mpif90  -fPIC -O3   ${FOPTFLAGS} ${FFLAGS} 
-----------------------------------------

Using include paths: -I/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/include -I/home/wtay/Codes/petsc-dev/include -I/home/wtay/Codes/petsc-dev/include -I/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/include -I/opt/openmpi-1.5.3/include
-----------------------------------------

Using C linker: /opt/openmpi-1.5.3/bin/mpicc
Using Fortran linker: /opt/openmpi-1.5.3/bin/mpif90
Using libraries: -Wl,-rpath,/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/lib -L/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/lib -lpetsc -lX11 -Wl,-rpath,/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/lib -L/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/lib -lHYPRE -Wl,-rpath,/opt/openmpi-1.5.3/lib -L/opt/openmpi-1.5.3/lib -Wl,-rpath,/opt/intelcpro-11.1.059/lib/intel64 -L/opt/intelcpro-11.1.059/lib/intel64 -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.6 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.6 -lmpi_cxx -lstdc++ -lpthread -Wl,-rpath,/opt/intelcpro-11.1.059/mkl/lib/em64t -L/opt/intelcpro-11.1.059/mkl/lib/em64t -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -liomp5 -lpthread -lmpi_f90 -lmpi_f77 -lm -lm -lifport -lifcore -lm -lm -lm -lmpi_cxx -lstdc++ -lmpi_cxx -lstdc++ -ldl -lmpi -lnsl -lutil -limf -lsvml -lipgo -ldecimal -lgcc_s -lirc -lpthread -lirc_s -ldl 
-----------------------------------------



More information about the petsc-users mailing list