[petsc-users] Enquiry regarding log summary results
TAY wee-beng
zonexo at gmail.com
Fri Oct 5 10:08:35 CDT 2012
On 4/10/2012 11:44 PM, Satish Balay wrote:
> On Thu, 4 Oct 2012, Matthew Knepley wrote:
>
>> the Mom-Z solve is bad because the assembly of the operator is
>> screwed up. You are communicating too many values. So, jsut go into
>> your code and count how many off process entries you set.
> running with '-info' should give debug info regarding this
> communication [stack size used for messages, and the number of mallocs
> needed etc..]
>
> Satish
Ok, I found my problem. I was using:
PetscLogStage stage
and the time for the poisson eqn got combined into that of the
momentum_z solve. Hence the confusion in results.
I have attached the new results. The time is taken up mainly by GAMG. I
am using:
-poisson_pc_gamg_agg_nsmooths 1
-poisson_pc_type gamg
What are the possible ways to improve the efficiency or reduce the time?
If I use HYPRE's geometric multigrid, performance is somewhat better,
such that momentum_x/y/z accounts for a total of 15% computing time.
From the log_summary, for the momentum eqns, it seems that MatAssembly
is the main problem, is that so?
-------------- next part --------------
0.250000000000000 0.400000000000000 0.000000000000000E+000
0.000000000000000E+000 0.000000000000000E+000 0.500000000000000
1.00000000000000 0.300000000000000 0 -400000
AB,AA,BB -3.99000009451993 4.20550010475563
4.07500009855721
size_x,size_y,size_z 75 404 629
Warning - length difference between element and cell
max_element_length,min_element_length,min_delta
0.125984138460047 3.703647122769601E-004 1.000000000000000E-002
maximum ngh_surfaces and ngh_vertics are 72 26
minimum ngh_surfaces and ngh_vertics are 1 1
IIB_I_cell_no_uvw_total1 14652 15150 15000 61272
60750 62100
1 0.00300000 0.33018343 0.41442094 1.19457187 0.49100420E-02 -0.13216209E+00 0.19023281E+08
2 0.00214949 0.08042523 0.51210792 1.22010993 -0.13701147E-03 -0.10344701E+00 0.19018108E+08
3 0.00203615 0.02146905 0.54566355 1.23244869 0.10544591E-05 -0.90478005E-01 0.19017334E+08
4 0.00200855 0.01354072 0.56450141 1.24354136 -0.62836065E-05 0.20489605E-01 0.19017160E+08
5 0.00199787 0.01065016 0.58015529 1.25235552 -0.13971205E-03 0.11199829E+00 0.19017088E+08
escape_time reached, so abort
fx_p,fy_p,fz_p,fx_v,fy_v,fz_v,cd_cl_cs_explicit1,cd_cl_cs_implicit1
-2.636323997944099E-007 -2.23618995423546 -0.161689384600393
2.513651961966575E-007 -3.449431535817790E-003 1.492238456281484E-002
-1.226720359775232E-008 -2.23963938577127 -0.146767000037579
7.158703928325935E-009 -2.42687397020185 -7.458519403454597E-002
************************************************************************************************************************
*** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document ***
************************************************************************************************************************
---------------------------------------------- PETSc Performance Summary: ----------------------------------------------
./a.out on a petsc-3.3-dev_shared_rel named n12-05 with 48 processors, by wtay Fri Oct 5 16:19:16 2012
Using Petsc Development HG revision: 9883b54053eca13dd473a4711adfd309d1436b6e HG Date: Sun Sep 30 22:42:36 2012 -0500
Max Max/Min Avg Total
Time (sec): 1.840e+03 1.00338 1.837e+03
Objects: 5.680e+02 1.00000 5.680e+02
Flops: 1.735e+11 1.26417 1.414e+11 6.787e+12
Flops/sec: 9.426e+07 1.26189 7.696e+07 3.694e+09
MPI Messages: 1.682e+05 3.29645 1.099e+05 5.273e+06
MPI Message Lengths: 8.645e+09 2.00779 7.622e+04 4.019e+11
MPI Reductions: 1.374e+04 1.00000
Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract)
e.g., VecAXPY() for real vectors of length N --> 2N flops
and VecAXPY() for complex vectors of length N --> 8N flops
Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions --
Avg %Total Avg %Total counts %Total Avg %Total counts %Total
0: Main Stage: 2.4790e+02 13.5% 0.0000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 3.600e+01 0.3%
1: poisson: 1.4545e+03 79.2% 6.4730e+12 95.4% 5.245e+06 99.5% 7.496e+04 98.3% 1.146e+04 83.4%
2: momentum_x: 4.5177e+01 2.5% 1.0421e+11 1.5% 9.400e+03 0.2% 4.179e+02 0.5% 7.480e+02 5.4%
3: momentum_y: 4.4457e+01 2.4% 1.0494e+11 1.5% 9.400e+03 0.2% 4.236e+02 0.6% 7.480e+02 5.4%
4: momentum_z: 4.5353e+01 2.5% 1.0502e+11 1.5% 9.400e+03 0.2% 4.235e+02 0.6% 7.480e+02 5.4%
------------------------------------------------------------------------------------------------------------------------
See the 'Profiling' chapter of the users' manual for details on interpreting output.
Phase summary info:
Count: number of times phase was executed
Time and Flops: Max - maximum over all processors
Ratio - ratio of maximum to minimum over all processors
Mess: number of messages sent
Avg. len: average message length
Reduct: number of global reductions
Global: entire computation
Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop().
%T - percent time in this phase %f - percent flops in this phase
%M - percent messages in this phase %L - percent message lengths in this phase
%R - percent reductions in this phase
Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors)
------------------------------------------------------------------------------------------------------------------------
Event Count Time (sec) Flops --- Global --- --- Stage --- Total
Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %f %M %L %R %T %f %M %L %R Mflop/s
------------------------------------------------------------------------------------------------------------------------
--- Event Stage 0: Main Stage
--- Event Stage 1: poisson
MatMult 43722 1.0 1.0849e+03 1.3 1.14e+11 1.3 4.2e+06 8.9e+04 0.0e+00 52 62 80 93 0 66 65 80 95 0 3897
MatMultAdd 6728 1.0 8.7529e+01 2.9 8.16e+09 1.5 5.1e+05 1.8e+04 0.0e+00 3 4 10 2 0 3 4 10 2 0 3043
MatMultTranspose 6728 1.0 1.4643e+02 4.8 8.16e+09 1.5 5.1e+05 1.8e+04 0.0e+00 3 4 10 2 0 4 4 10 2 0 1819
MatSolve 3364 0.0 2.1961e-02 0.0 4.42e+06 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 201
MatLUFactorSym 1 1.0 5.3883e-05 3.6 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatLUFactorNum 1 1.0 5.2929e-05 5.8 1.04e+04 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 196
MatConvert 4 1.0 1.4656e-01 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatScale 12 1.0 1.7515e-01 2.3 1.58e+07 1.4 4.0e+02 8.3e+04 0.0e+00 0 0 0 0 0 0 0 0 0 0 3186
MatAssemblyBegin 69 1.0 5.6784e+00 1.9 0.00e+00 0.0 1.2e+03 1.4e+04 7.4e+01 0 0 0 0 1 0 0 0 0 1 0
MatAssemblyEnd 69 1.0 1.4912e+00 1.1 0.00e+00 0.0 6.7e+03 1.0e+04 2.0e+02 0 0 0 0 1 0 0 0 0 2 0
MatGetRow 961340 1.1 5.2880e-01 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatGetRowIJ 1 0.0 1.4782e-05 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatGetOrdering 1 0.0 7.2956e-05 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 4.2e-02 0 0 0 0 0 0 0 0 0 0 0
MatCoarsen 4 1.0 6.3715e-01 1.2 0.00e+00 0.0 1.9e+04 1.8e+04 2.8e+02 0 0 0 0 2 0 0 0 0 2 0
MatAXPY 4 1.0 4.7550e-02 3.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatMatMult 4 1.0 1.1370e+00 1.0 1.10e+07 1.4 2.8e+03 3.7e+04 9.6e+01 0 0 0 0 1 0 0 0 0 1 352
MatMatMultSym 4 1.0 8.7142e-01 1.0 0.00e+00 0.0 2.4e+03 2.9e+04 8.8e+01 0 0 0 0 1 0 0 0 0 1 0
MatMatMultNum 4 1.0 2.6963e-01 1.1 1.10e+07 1.4 4.0e+02 8.3e+04 8.0e+00 0 0 0 0 0 0 0 0 0 0 1482
MatPtAP 4 1.0 3.5065e+00 1.0 4.77e+08 3.1 5.9e+03 6.9e+04 1.1e+02 0 0 0 0 1 0 0 0 0 1 2441
MatPtAPSymbolic 4 1.0 2.0355e+00 1.1 0.00e+00 0.0 5.3e+03 5.7e+04 1.0e+02 0 0 0 0 1 0 0 0 0 1 0
MatPtAPNumeric 4 1.0 1.4810e+00 1.0 4.77e+08 3.1 6.0e+02 1.7e+05 8.0e+00 0 0 0 0 0 0 0 0 0 0 5778
MatTrnMatMult 4 1.0 8.2675e+00 1.0 5.30e+08 4.6 2.7e+03 4.7e+05 1.2e+02 0 0 0 0 1 1 0 0 0 1 2706
MatGetLocalMat 20 1.0 4.2365e-01 1.8 0.00e+00 0.0 0.0e+00 0.0e+00 2.4e+01 0 0 0 0 0 0 0 0 0 0 0
MatGetBrAoCol 12 1.0 2.7735e-01 2.3 0.00e+00 0.0 2.8e+03 1.2e+05 1.6e+01 0 0 0 0 0 0 0 0 0 0 0
MatGetSymTrans 8 1.0 4.6894e-02 2.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
KSPGMRESOrthog 1722 1.0 1.1443e+00 2.2 1.06e+08 1.1 0.0e+00 0.0e+00 1.7e+03 0 0 0 0 13 0 0 0 0 15 4026
KSPSetUp 11 1.0 1.4655e-01 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01 0 0 0 0 0 0 0 0 0 0 0
KSPSolve 50 1.0 1.4493e+03 1.0 1.67e+11 1.3 5.2e+06 7.5e+04 1.1e+04 79 95 99 98 81 100100100100 97 4466
VecDot 1632 1.0 8.7316e+01 6.6 1.38e+09 1.1 0.0e+00 0.0e+00 1.6e+03 2 1 0 0 12 3 1 0 0 14 712
VecDotNorm2 816 1.0 8.7681e+01 5.3 2.77e+09 1.1 0.0e+00 0.0e+00 2.4e+03 3 2 0 0 18 3 2 0 0 21 1419
VecMDot 1722 1.0 1.0294e+00 3.8 5.30e+07 1.1 0.0e+00 0.0e+00 1.7e+03 0 0 0 0 13 0 0 0 0 15 2238
VecNorm 4274 1.0 4.4562e+01 7.8 7.45e+08 1.1 0.0e+00 0.0e+00 4.3e+03 1 0 0 0 31 1 1 0 0 37 751
VecScale 30320 1.0 2.6244e+01 3.3 3.24e+09 1.1 0.0e+00 0.0e+00 0.0e+00 1 2 0 0 0 1 2 0 0 0 5377
VecCopy 8514 1.0 9.8725e+00 3.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecSet 28819 1.0 9.3239e+00 4.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecAXPY 55510 1.0 7.0225e+01 3.0 1.29e+10 1.1 0.0e+00 0.0e+00 0.0e+00 2 8 0 0 0 3 9 0 0 0 8026
VecAYPX 53824 1.0 9.9916e+01 2.7 8.08e+09 1.1 0.0e+00 0.0e+00 0.0e+00 4 5 0 0 0 5 5 0 0 0 3525
VecAXPBYCZ 1632 1.0 3.1461e+01 2.1 2.77e+09 1.1 0.0e+00 0.0e+00 0.0e+00 1 2 0 0 0 2 2 0 0 0 3955
VecWAXPY 1632 1.0 3.2483e+01 2.4 1.38e+09 1.1 0.0e+00 0.0e+00 0.0e+00 1 1 0 0 0 2 1 0 0 0 1915
VecMAXPY 3408 1.0 3.1750e-01 3.2 6.27e+07 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 8575
VecAssemblyBegin 195 1.0 4.5760e+0012.0 0.00e+00 0.0 0.0e+00 0.0e+00 5.8e+02 0 0 0 0 4 0 0 0 0 5 0
VecAssemblyEnd 195 1.0 1.6549e-03 4.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecPointwiseMult 40412 1.0 1.3043e+02 3.6 4.86e+09 1.1 0.0e+00 0.0e+00 0.0e+00 4 3 0 0 0 4 3 0 0 0 1622
VecScatterBegin 57281 1.0 2.0484e+01 4.3 0.00e+00 0.0 5.2e+06 7.5e+04 0.0e+00 1 0 99 98 0 1 0100100 0 0
VecScatterEnd 57281 1.0 6.3320e+0214.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 18 0 0 0 0 22 0 0 0 0 0
VecSetRandom 4 1.0 4.1965e-02 3.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecNormalize 3408 1.0 3.5365e+01116.9 1.61e+07 1.1 0.0e+00 0.0e+00 3.4e+03 0 0 0 0 25 0 0 0 0 30 20
PCSetUp 2 1.0 2.1552e+01 1.0 9.41e+08 1.3 4.2e+04 7.0e+04 1.2e+03 1 1 1 1 8 1 1 1 1 10 1933
PCSetUpOnBlocks 1682 1.0 2.8644e-03 2.1 1.04e+04 0.0 0.0e+00 0.0e+00 5.0e+00 0 0 0 0 0 0 0 0 0 0 4
PCApply 1682 1.0 1.2035e+03 1.2 1.50e+11 1.3 5.0e+06 7.0e+04 5.1e+03 62 83 96 88 37 78 87 96 90 44 4672
PCGAMGgraph_AGG 4 1.0 2.6394e+00 1.0 1.10e+07 1.4 1.2e+03 4.2e+04 7.6e+01 0 0 0 0 1 0 0 0 0 1 151
PCGAMGcoarse_AGG 4 1.0 9.4073e+00 1.0 5.30e+08 4.6 2.4e+04 7.9e+04 4.8e+02 1 0 0 0 3 1 0 0 0 4 2378
PCGAMGProl_AGG 4 1.0 2.3864e+00 1.0 0.00e+00 0.0 3.2e+03 4.6e+04 1.1e+02 0 0 0 0 1 0 0 0 0 1 0
PCGAMGPOpt_AGG 4 1.0 3.6797e+00 1.0 2.53e+08 1.2 6.8e+03 6.4e+04 2.1e+02 0 0 0 0 2 0 0 0 0 2 2809
--- Event Stage 2: momentum_x
MatMult 98 1.0 2.9512e+00 1.1 5.31e+08 1.2 9.2e+03 2.4e+05 0.0e+00 0 0 0 1 0 6 23 98 99 0 8071
MatSolve 147 1.0 3.5473e+00 1.1 7.79e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 8 33 0 0 0 9840
MatLUFactorNum 49 1.0 4.3144e+00 1.2 4.32e+08 1.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 9 19 0 0 0 4482
MatILUFactorSym 1 1.0 8.6374e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatAssemblyBegin 49 1.0 6.6507e+0030.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.8e+01 0 0 0 0 1 7 0 0 0 13 0
MatAssemblyEnd 49 1.0 2.4696e+00 1.8 0.00e+00 0.0 1.9e+02 5.9e+04 8.0e+00 0 0 0 0 0 5 0 2 1 1 0
MatGetRowIJ 1 1.0 6.9141e-06 7.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatGetOrdering 1 1.0 9.7661e-03 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 0 0 0 0 0 0 0
KSPSetUp 98 1.0 3.3892e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
KSPSolve 49 1.0 1.3279e+01 1.0 2.32e+09 1.1 9.2e+03 2.4e+05 3.5e+02 1 2 0 1 3 29100 98 99 47 7848
VecDot 98 1.0 5.0782e-01 2.5 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01 0 0 0 0 1 1 4 0 0 13 7356
VecDotNorm2 49 1.0 5.2031e-01 2.1 1.66e+08 1.1 0.0e+00 0.0e+00 1.5e+02 0 0 0 0 1 1 7 0 0 20 14359
VecNorm 98 1.0 7.8282e-01 6.4 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01 0 0 0 0 1 1 4 0 0 13 4772
VecCopy 98 1.0 3.3748e-01 1.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0
VecSet 295 1.0 6.4768e-01 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0
VecAXPBYCZ 98 1.0 6.4953e-01 1.4 1.66e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 7 0 0 0 11502
VecWAXPY 98 1.0 5.8485e-01 1.2 8.31e+07 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 4 0 0 0 6387
VecAssemblyBegin 98 1.0 1.3877e+00100.0 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02 0 0 0 0 2 1 0 0 0 39 0
VecAssemblyEnd 98 1.0 1.2057e-03 5.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecScatterBegin 98 1.0 6.0991e-02 2.7 0.00e+00 0.0 9.2e+03 2.4e+05 0.0e+00 0 0 0 1 0 0 0 98 99 0 0
VecScatterEnd 98 1.0 3.4077e-01 2.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
PCSetUp 98 1.0 4.4126e+00 1.2 4.32e+08 1.2 0.0e+00 0.0e+00 5.0e+00 0 0 0 0 0 9 19 0 0 1 4382
PCSetUpOnBlocks 49 1.0 4.4121e+00 1.2 4.32e+08 1.2 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 0 9 19 0 0 0 4382
PCApply 147 1.0 3.7470e+00 1.1 7.79e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 8 33 0 0 0 9316
--- Event Stage 3: momentum_y
MatMult 98 1.0 2.9856e+00 1.1 5.36e+08 1.2 9.2e+03 2.4e+05 0.0e+00 0 0 0 1 0 6 23 98 99 0 8055
MatSolve 147 1.0 3.5550e+00 1.1 7.87e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 8 34 0 0 0 9913
MatLUFactorNum 49 1.0 4.6245e+00 1.2 4.36e+08 1.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 9 19 0 0 0 4218
MatILUFactorSym 1 1.0 8.6295e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatAssemblyBegin 49 1.0 4.5507e+00207.0 0.00e+00 0.0 0.0e+00 0.0e+00 9.8e+01 0 0 0 0 1 5 0 0 0 13 0
MatAssemblyEnd 49 1.0 2.4393e+00 1.8 0.00e+00 0.0 1.9e+02 6.0e+04 8.0e+00 0 0 0 0 0 5 0 2 1 1 0
MatGetRowIJ 1 1.0 1.1921e-06 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatGetOrdering 1 1.0 9.5730e-03 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 0 0 0 0 0 0 0
KSPSetUp 98 1.0 3.3241e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
KSPSolve 49 1.0 1.3589e+01 1.0 2.34e+09 1.1 9.2e+03 2.4e+05 3.5e+02 1 2 0 1 3 31100 98 99 47 7723
VecDot 98 1.0 5.2115e-01 2.5 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01 0 0 0 0 1 1 4 0 0 13 7168
VecDotNorm2 49 1.0 5.1932e-01 2.1 1.66e+08 1.1 0.0e+00 0.0e+00 1.5e+02 0 0 0 0 1 1 7 0 0 20 14386
VecNorm 98 1.0 8.4822e-01 6.8 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01 0 0 0 0 1 1 4 0 0 13 4404
VecCopy 98 1.0 3.1552e-01 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0
VecSet 295 1.0 6.1609e-01 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0
VecAXPBYCZ 98 1.0 6.5669e-01 1.4 1.66e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 7 0 0 0 11377
VecWAXPY 98 1.0 5.8188e-01 1.2 8.31e+07 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 4 0 0 0 6420
VecAssemblyBegin 98 1.0 1.3747e+0099.2 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02 0 0 0 0 2 1 0 0 0 39 0
VecAssemblyEnd 98 1.0 1.2748e-03 6.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecScatterBegin 98 1.0 6.1957e-02 3.0 0.00e+00 0.0 9.2e+03 2.4e+05 0.0e+00 0 0 0 1 0 0 0 98 99 0 0
VecScatterEnd 98 1.0 3.4862e-01 2.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0
PCSetUp 98 1.0 4.7222e+00 1.2 4.36e+08 1.2 0.0e+00 0.0e+00 5.0e+00 0 0 0 0 0 10 19 0 0 1 4131
PCSetUpOnBlocks 49 1.0 4.7218e+00 1.2 4.36e+08 1.2 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 0 10 19 0 0 0 4131
PCApply 147 1.0 3.8046e+00 1.1 7.87e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 8 34 0 0 0 9262
--- Event Stage 4: momentum_z
MatMult 98 1.0 3.0181e+00 1.2 5.37e+08 1.2 9.2e+03 2.4e+05 0.0e+00 0 0 0 1 0 6 23 98 99 0 7974
MatSolve 147 1.0 3.5577e+00 1.1 7.87e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 8 34 0 0 0 9913
MatLUFactorNum 49 1.0 4.4279e+00 1.2 4.37e+08 1.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 9 19 0 0 0 4413
MatILUFactorSym 1 1.0 8.8243e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatAssemblyBegin 49 1.0 5.2133e+00110.2 0.00e+00 0.0 0.0e+00 0.0e+00 9.8e+01 0 0 0 0 1 6 0 0 0 13 0
MatAssemblyEnd 49 1.0 2.4158e+00 1.8 0.00e+00 0.0 1.9e+02 6.0e+04 8.0e+00 0 0 0 0 0 5 0 2 1 1 0
MatGetRowIJ 1 1.0 1.9073e-06 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
MatGetOrdering 1 1.0 9.3498e-03 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 0 0 0 0 0 0 0
KSPSetUp 98 1.0 3.3165e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
KSPSolve 49 1.0 1.3420e+01 1.0 2.34e+09 1.1 9.2e+03 2.4e+05 3.5e+02 1 2 0 1 3 30100 98 99 47 7826
VecDot 98 1.0 5.2100e-01 2.5 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01 0 0 0 0 1 1 4 0 0 13 7170
VecDotNorm2 49 1.0 5.3533e-01 2.2 1.66e+08 1.1 0.0e+00 0.0e+00 1.5e+02 0 0 0 0 1 1 7 0 0 20 13956
VecNorm 98 1.0 7.6278e-01 5.2 8.31e+07 1.1 0.0e+00 0.0e+00 9.8e+01 0 0 0 0 1 1 4 0 0 13 4897
VecCopy 98 1.0 3.4060e-01 1.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0
VecSet 295 1.0 6.4979e-01 2.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0
VecAXPBYCZ 98 1.0 6.4095e-01 1.4 1.66e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 7 0 0 0 11656
VecWAXPY 98 1.0 5.7849e-01 1.1 8.31e+07 1.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 4 0 0 0 6457
VecAssemblyBegin 98 1.0 1.3208e+0054.2 0.00e+00 0.0 0.0e+00 0.0e+00 2.9e+02 0 0 0 0 2 1 0 0 0 39 0
VecAssemblyEnd 98 1.0 1.2825e-03 6.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
VecScatterBegin 98 1.0 6.1303e-02 2.8 0.00e+00 0.0 9.2e+03 2.4e+05 0.0e+00 0 0 0 1 0 0 0 98 99 0 0
VecScatterEnd 98 1.0 3.7855e-01 2.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0
PCSetUp 98 1.0 4.5256e+00 1.2 4.37e+08 1.2 0.0e+00 0.0e+00 5.0e+00 0 0 0 0 0 9 19 0 0 1 4317
PCSetUpOnBlocks 49 1.0 4.5250e+00 1.2 4.37e+08 1.2 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 0 9 19 0 0 0 4318
PCApply 147 1.0 3.8119e+00 1.1 7.87e+08 1.1 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 8 34 0 0 0 9252
------------------------------------------------------------------------------------------------------------------------
Memory usage is given in bytes:
Object Type Creations Destructions Memory Descendants' Mem.
Reports information only for process 0.
--- Event Stage 0: Main Stage
Matrix 12 56 568064932 0
Krylov Solver 4 13 31728 0
Vector 8 85 129979032 0
Vector Scatter 0 12 12720 0
Index Set 0 16 5102648 0
Preconditioner 0 13 12540 0
Viewer 1 0 0 0
--- Event Stage 1: poisson
Matrix 117 76 368468980 0
Matrix Coarsen 4 4 2480 0
Krylov Solver 10 4 120512 0
Vector 225 175 107847696 0
Vector Scatter 31 22 23320 0
Index Set 81 74 737472 0
Preconditioner 11 4 3456 0
PetscRandom 4 4 2464 0
--- Event Stage 2: momentum_x
Matrix 1 0 0 0
Krylov Solver 1 0 0 0
Vector 10 1 1504 0
Vector Scatter 1 0 0 0
Index Set 5 2 120488 0
Preconditioner 2 0 0 0
--- Event Stage 3: momentum_y
Matrix 1 0 0 0
Krylov Solver 1 0 0 0
Vector 10 1 1504 0
Vector Scatter 1 0 0 0
Index Set 5 2 122104 0
Preconditioner 2 0 0 0
--- Event Stage 4: momentum_z
Matrix 1 0 0 0
Krylov Solver 1 0 0 0
Vector 10 1 1504 0
Vector Scatter 1 0 0 0
Index Set 5 2 122096 0
Preconditioner 2 0 0 0
========================================================================================================================
Average time to get PetscTime(): 1.90735e-07
Average time for MPI_Barrier(): 4.04358e-05
Average time for zero size MPI_Send(): 2.70406e-05
#PETSc Option Table entries:
-log_summary
-poisson_pc_gamg_agg_nsmooths 1
-poisson_pc_type gamg
#End of PETSc Option Table entries
Compiled without FORTRAN kernels
Compiled with full precision matrices (default)
sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4
Configure run at: Mon Oct 1 11:36:09 2012
Configure options: --with-mpi-dir=/opt/openmpi-1.5.3/ --with-blas-lapack-dir=/opt/intelcpro-11.1.059/mkl/lib/em64t/ --with-debugging=0 --download-hypre=1 --prefix=/home/wtay/Lib/petsc-3.3-dev_shared_rel --known-mpi-shared=1 --with-shared-libraries
-----------------------------------------
Libraries compiled on Mon Oct 1 11:36:09 2012 on hpc12
Machine characteristics: Linux-2.6.32-279.1.1.el6.x86_64-x86_64-with-centos-6.3-Final
Using PETSc directory: /home/wtay/Codes/petsc-dev
Using PETSc arch: petsc-3.3-dev_shared_rel
-----------------------------------------
Using C compiler: /opt/openmpi-1.5.3/bin/mpicc -fPIC -wd1572 -O3 ${COPTFLAGS} ${CFLAGS}
Using Fortran compiler: /opt/openmpi-1.5.3/bin/mpif90 -fPIC -O3 ${FOPTFLAGS} ${FFLAGS}
-----------------------------------------
Using include paths: -I/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/include -I/home/wtay/Codes/petsc-dev/include -I/home/wtay/Codes/petsc-dev/include -I/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/include -I/opt/openmpi-1.5.3/include
-----------------------------------------
Using C linker: /opt/openmpi-1.5.3/bin/mpicc
Using Fortran linker: /opt/openmpi-1.5.3/bin/mpif90
Using libraries: -Wl,-rpath,/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/lib -L/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/lib -lpetsc -lX11 -Wl,-rpath,/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/lib -L/home/wtay/Codes/petsc-dev/petsc-3.3-dev_shared_rel/lib -lHYPRE -Wl,-rpath,/opt/openmpi-1.5.3/lib -L/opt/openmpi-1.5.3/lib -Wl,-rpath,/opt/intelcpro-11.1.059/lib/intel64 -L/opt/intelcpro-11.1.059/lib/intel64 -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.6 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.6 -lmpi_cxx -lstdc++ -lpthread -Wl,-rpath,/opt/intelcpro-11.1.059/mkl/lib/em64t -L/opt/intelcpro-11.1.059/mkl/lib/em64t -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -liomp5 -lpthread -lmpi_f90 -lmpi_f77 -lm -lm -lifport -lifcore -lm -lm -lm -lmpi_cxx -lstdc++ -lmpi_cxx -lstdc++ -ldl -lmpi -lnsl -lutil -limf -lsvml -lipgo -ldecimal -lgcc_s -lirc -lpthread -lirc_s -ldl
-----------------------------------------
More information about the petsc-users
mailing list