[petsc-users] A similar question of "Enquiry regarding log summary results"

Zhenglun (Alan) Wei zhenglun.wei at gmail.com
Fri Oct 5 11:51:15 CDT 2012


Dear folks,
     I hope you're having a nice day.
     I've been reading the thread of "Enquiry regarding log summary 
results" and tried the similar things with -log_summary with my code. 
However, my output seems totally different from Dr. TAY Wee-beng's. I 
attached it here. My code adopts the 
/src/ksp/ksp/example/tutorial/ex45.c from Dr. Yang to solve the Poisson 
equation (twice). The "Event Stage 3" and "Event Stage 5" are just for 
"KSPSolve", for example:

     ierr = PetscLogStagePush(stages[2]);  CHKERRQ(ierr); // Start 
Pressure Poisson Equation Solve
     ierr = KSPSolve(ksp,PETSC_NULL,PETSC_NULL);CHKERRQ(ierr);
     ierr = PetscLogStagePop();  CHKERRQ(ierr); // Finish Pressure 
Poisson Equation  Solve

      Therefore, solving the equation takes most of my times. In the 
procedure of solving the equation, VecMDot, MatMult, MatMultAdd, 
MatMultTranspose takes lots of time.

------------------------------------------------------------------------------------------------------------------------
Event                Count      Time (sec) 
Flops                             --- Global ---  --- Stage --- Total
                    Max Ratio  Max     Ratio   Max  Ratio  Mess   Avg 
len Reduct  %T %f %M %L %R  %T %f %M %L %R Mflop/s
------------------------------------------------------------------------------------------------------------------------

--- Event Stage 3: Pressure Solve

KSPGMRESOrthog      8643 1.0 2.0476e+0313.1 5.84e+09 1.0 0.0e+00 0.0e+00 
8.6e+03 17 13  0  0 17  39 29  0  0 36   270
KSPSetUp               5 1.0 1.0729e-05 3.5 0.00e+00 0.0 0.0e+00 0.0e+00 
0.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPSolve               5 1.0 4.0481e+03 1.0 2.05e+10 1.0 4.0e+07 1.2e+03 
2.4e+04 44 46 47 44 46 100100100100100   477
VecMDot             8643 1.0 2.0424e+0313.5 2.92e+09 1.0 0.0e+00 0.0e+00 
8.6e+03 17  7  0  0 17  39 14  0  0 36   135
VecNorm            14503 1.0 8.7184e+02 6.4 8.47e+08 1.0 0.0e+00 0.0e+00 
1.5e+04  7  2  0  0 28  15  4  0  0 61    92
VecScale           14503 1.0 3.7836e+00 4.7 4.23e+08 1.0 0.0e+00 0.0e+00 
0.0e+00  0  1  0  0  0   0  2  0  0  0 10598
VecCopy             5767 1.0 2.0257e+00 3.8 0.00e+00 0.0 0.0e+00 0.0e+00 
0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecSet             34670 1.0 5.5035e+00 4.7 0.00e+00 0.0 0.0e+00 0.0e+00 
0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecAXPY             8741 1.0 2.9888e+00 4.5 5.10e+08 1.0 0.0e+00 0.0e+00 
0.0e+00  0  1  0  0  0   0  3  0  0  0 16172
VecAYPX             2881 1.0 1.7167e+00 7.2 8.41e+07 1.0 0.0e+00 0.0e+00 
0.0e+00  0  0  0  0  0   0  0  0  0  0  4640
VecWAXPY              93 1.0 5.3297e-02 4.8 2.71e+06 1.0 0.0e+00 0.0e+00 
0.0e+00  0  0  0  0  0   0  0  0  0  0  4824
VecMAXPY           14503 1.0 1.5191e+01 3.1 3.42e+09 1.0 0.0e+00 0.0e+00 
0.0e+00  0  8  0  0  0   0 17  0  0  0 21342
VecAssemblyBegin       5 1.0 3.7097e+0012.5 0.00e+00 0.0 0.0e+00 0.0e+00 
1.5e+01  0  0  0  0  0   0  0  0  0  0     0
VecAssemblyEnd         5 1.0 2.9087e-05 5.8 0.00e+00 0.0 0.0e+00 0.0e+00 
0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecScatterBegin    26022 1.0 1.1726e+01 4.2 0.00e+00 0.0 4.0e+07 1.2e+03 
0.0e+00  0  0 47 44  0   0  0100100  0     0
VecScatterEnd      26022 1.0 3.5268e+03 3.5 0.00e+00 0.0 0.0e+00 0.0e+00 
0.0e+00 19  0  0  0  0  43  0  0  0  0     0
VecNormalize       11524 1.0 8.4214e+02 7.2 1.01e+09 1.0 0.0e+00 0.0e+00 
1.2e+04  6  2  0  0 22  15  5  0  0 49   114
MatMult            14498 1.0 3.0545e+0382.7 5.50e+09 1.0 6.4e+06 7.6e+03 
0.0e+00  9 12  8 44  0  21 27 16 98  0   169
MatMultAdd          2881 1.0 1.8177e+0218.6 1.09e+09 1.1 3.6e+06 1.5e+01 
0.0e+00  1  2  4  0  0   2  5  9  0  0   560
MatMultTranspose    2881 1.0 7.1296e+0274.9 1.09e+09 1.1 3.6e+06 1.5e+01 
0.0e+00  5  2  4  0  0  11  5  9  0  0   143
MatSolve           14405 1.0 3.9663e+01 3.3 4.56e+09 1.0 0.0e+00 0.0e+00 
0.0e+00  0 10  0  0  0   1 22  0  0  0 10890
MatLUFactorNum         5 1.0 3.9389e-02 1.6 3.05e+06 1.0 0.0e+00 0.0e+00 
0.0e+00  0  0  0  0  0   0  0  0  0  0  7322
MatILUFactorSym        5 1.0 4.2221e-02 3.2 0.00e+00 0.0 0.0e+00 0.0e+00 
5.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetRowIJ            5 1.0 9.7752e-06 5.1 0.00e+00 0.0 0.0e+00 0.0e+00 
0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetOrdering         5 1.0 2.9373e-03 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 
1.0e+01  0  0  0  0  0   0  0  0  0  0     0
PCSetUp                5 1.0 8.3110e-02 2.0 3.05e+06 1.0 0.0e+00 0.0e+00 
1.5e+01  0  0  0  0  0   0  0  0  0  0  3470
PCSetUpOnBlocks     5762 1.0 9.9790e-02 2.0 3.05e+06 1.0 0.0e+00 0.0e+00 
1.5e+01  0  0  0  0  0   0  0  0  0  0  2890
PCApply             2881 1.0 3.3151e+03 1.0 1.37e+10 1.0 3.8e+07 1.0e+03 
1.7e+04 36 31 46 36 33  82 67 97 80 73   390

        BTW, how to really calculate the time that a function costs? For 
example, it shows that "VecMDot, 2.0424e+0313.5". Would that be 
2.0424*10^313.5 sec?

thanks,
Alan



-------------- next part --------------
************************************************************************************************************************
***             WIDEN YOUR WINDOW TO 120 CHARACTERS.  Use 'enscript -r -fCourier9' to print this document            ***
************************************************************************************************************************

---------------------------------------------- PETSc Performance Summary: ----------------------------------------------

./ex45 on a arch-linux2-c-debug named compute-7-12.local with 96 processors, by zlwei Thu Oct  4 18:16:59 2012
Using Petsc Development HG revision: 98bf11863c3be31b7c2af504314a500bc64d88c9  HG Date: Wed Aug 29 13:51:08 2012 -0500

                         Max       Max/Min        Avg      Total 
Time (sec):           9.287e+03      1.00000   9.287e+03
Objects:              2.271e+03      1.04127   2.192e+03
Flops:                4.641e+10      1.08786   4.392e+10  4.216e+12
Flops/sec:            4.998e+06      1.08786   4.729e+06  4.540e+08
MPI Messages:         1.039e+06      1.42613   8.729e+05  8.380e+07
MPI Message Lengths:  1.534e+09      2.02516   1.321e+03  1.107e+11
MPI Reductions:       5.191e+04      1.00174

Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract)
                            e.g., VecAXPY() for real vectors of length N --> 2N flops
                            and VecAXPY() for complex vectors of length N --> 8N flops

Summary of Stages:   ----- Time ------  ----- Flops -----  --- Messages ---  -- Message Lengths --  -- Reductions --
                        Avg     %Total     Avg     %Total   counts   %Total     Avg         %Total   counts   %Total 
 0:      Main Stage: 6.3501e+02   6.8%  4.6734e+08   0.0%  2.344e+04   0.0%  7.888e+01        6.0%  3.200e+01   0.1% 
 1:      DMMG Setup: 3.6959e+00   0.0%  0.0000e+00   0.0%  3.768e+03   0.0%  4.090e-02        0.0%  3.500e+01   0.1% 
 2: Pressure RHS Setup: 7.5716e+01   0.8%  9.2365e+10   2.2%  3.556e+05   0.4%  7.402e+00        0.6%  1.106e+03   2.1% 
 3:  Pressure Solve: 4.0469e+03  43.6%  1.9304e+12  45.8%  3.978e+07  47.5%  5.878e+02       44.5%  2.376e+04  45.8% 
 4: Corrector RHS Setup: 1.2268e+02   1.3%  9.2365e+10   2.2%  3.556e+05   0.4%  7.402e+00        0.6%  1.106e+03   2.1% 
 5: Corrector Solve: 4.4028e+03  47.4%  2.1006e+12  49.8%  4.328e+07  51.7%  6.396e+02       48.4%  2.580e+04  49.7% 

------------------------------------------------------------------------------------------------------------------------
See the 'Profiling' chapter of the users' manual for details on interpreting output.
Phase summary info:
   Count: number of times phase was executed
   Time and Flops: Max - maximum over all processors
                   Ratio - ratio of maximum to minimum over all processors
   Mess: number of messages sent
   Avg. len: average message length
   Reduct: number of global reductions
   Global: entire computation
   Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop().
      %T - percent time in this phase         %f - percent flops in this phase
      %M - percent messages in this phase     %L - percent message lengths in this phase
      %R - percent reductions in this phase
   Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors)
------------------------------------------------------------------------------------------------------------------------
Event                Count      Time (sec)     Flops                             --- Global ---  --- Stage ---   Total
                   Max Ratio  Max     Ratio   Max  Ratio  Mess   Avg len Reduct  %T %f %M %L %R  %T %f %M %L %R Mflop/s
------------------------------------------------------------------------------------------------------------------------

--- Event Stage 0: Main Stage

VecNorm               10 1.0 4.2983e+0037.8 5.84e+05 1.0 0.0e+00 0.0e+00 1.0e+01  0  0  0  0  0   0 12  0  0 31    13
VecSet                 1 1.0 2.5029e-03 3.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecAXPY               10 1.0 2.1505e-03 3.0 5.84e+05 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0 12  0  0  0 25713
VecScatterBegin       11 1.0 6.8583e-03 6.0 0.00e+00 0.0 5.8e+03 6.9e+03 0.0e+00  0  0  0  0  0   0  0 25  1  0     0
VecScatterEnd         11 1.0 4.6764e+00693.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatMult               10 1.0 3.7633e+00181.3 3.79e+06 1.0 4.4e+03 7.6e+03 0.0e+00  0  0  0  0  0   0 76 19  1  0    95

--- Event Stage 1: DMMG Setup


--- Event Stage 2: Pressure RHS Setup

KSPGMRESOrthog       835 2.0 2.1537e+00 8.0 1.19e+09 3.5 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   1 46  0  0  0 19894
KSPSetUp              25 1.0 5.1173e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 1.4e+02  0  0  0  0  0   1  0  0  0 12     0
KSPSolve              30 1.0 5.3262e+00 4.9 1.82e+09 2.7 0.0e+00 0.0e+00 1.5e+02  0  2  0  0  0   4 85  0  0 13 14779
VecMDot              835 2.0 1.2693e+00 7.9 5.97e+08 3.5 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   1 23  0  0  0 16877
VecNorm              865 1.9 1.7481e-01 9.8 4.25e+07 1.8 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  3  0  0  0 14441
VecScale             865 1.9 2.4498e-02 2.5 2.12e+07 1.8 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  1  0  0  0 51524
VecCopy               30 1.0 1.1885e-02 3.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecSet               250 1.2 9.3647e-02 5.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecAXPY               30 1.0 9.2101e-03 4.0 1.58e+06 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0 15871
VecMAXPY             865 1.9 1.0598e+00 9.0 6.38e+08 3.3 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   0 26  0  0  0 22458
VecAssemblyBegin       5 1.0 6.5455e-01 6.3 0.00e+00 0.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   1  0  0  0  1     0
VecAssemblyEnd         5 1.0 2.6226e-05 3.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecNormalize         865 1.9 1.8566e-01 6.0 6.37e+07 1.8 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  4  0  0  0 20396
MatMult              835 2.0 1.2689e+00 4.7 2.53e+08 1.9 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1 15  0  0  0 11213
MatSolve             865 1.9 1.5827e+00 3.7 2.63e+08 1.9 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1 16  0  0  0  9541
MatLUFactorSym         5 1.0 1.4618e-02 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   0  0  0  0  1     0
MatLUFactorNum        10 1.0 8.0587e-02 1.9 2.36e+07 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  2  0  0  0 27888
MatILUFactorSym        5 1.0 4.6887e-02 4.6 0.00e+00 0.0 0.0e+00 0.0e+00 5.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatScale               5 1.0 1.9262e-03 2.8 7.91e+05 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0 37944
MatAssemblyBegin      65 1.0 4.2835e+00 8.0 0.00e+00 0.0 6.4e+03 1.6e+04 5.0e+01  0  0  0  0  0   3  0  2 17  5     0
MatAssemblyEnd        65 1.0 1.2531e+01 1.1 0.00e+00 0.0 5.8e+04 1.5e+02 1.2e+02  0  0  0  0  0  16  0 16  1 11     0
MatGetRowIJ           10 1.0 1.2584e-03 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetSubMatrice      20 1.0 7.2942e+0027.7 0.00e+00 0.0 1.1e+04 9.1e+03 9.5e+01  0  0  0  0  0   3  0  3 16  9     0
MatGetOrdering        10 1.0 4.9312e-03 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+01  0  0  0  0  0   0  0  0  0  3     0
MatMatMult             5 1.0 2.5914e-02 4.8 7.40e+05 2.4 0.0e+00 0.0e+00 1.0e+01  0  0  0  0  0   0  0  0  0  1  2422
MatPtAP                5 1.0 2.2527e+01 1.0 1.85e+08 3.1 7.3e+04 3.3e+03 1.4e+02  0  0  0  0  0  30 12 21 38 12   510
MatPtAPSymbolic        5 1.0 2.1820e+01 1.0 0.00e+00 0.0 6.7e+04 3.5e+03 1.2e+02  0  0  0  0  0  29  0 19 37 11     0
MatPtAPNumeric         5 1.0 1.0236e+00 1.6 1.85e+08 3.1 6.2e+03 7.4e+02 1.0e+01  0  0  0  0  0   1 12  2  1  1 11233
MatGetRedundant        5 1.0 2.4219e+01 1.2 0.00e+00 0.0 1.4e+05 1.1e+03 2.0e+01  0  0  0  0  0  29  0 38 25  2     0
MatGetLocalMat         5 1.0 1.9222e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01  0  0  0  0  0   0  0  0  0  1     0
MatGetBrAoCol          5 1.0 1.7008e+0119.2 0.00e+00 0.0 6.6e+03 3.5e+04 1.0e+01  0  0  0  0  0  12  0  2 37  1     0
MatGetSymTrans        10 1.0 9.4867e-03 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
PCSetUp                5 1.0 7.3916e+01 1.0 1.92e+09 2.5 3.3e+05 1.8e+03 7.1e+02  1  2  0  1  1  95100 93 96 64  1250
PCApply              865 1.9 1.5850e+00 3.7 2.63e+08 1.9 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1 16  0  0  0  9527

--- Event Stage 3: Pressure Solve

KSPGMRESOrthog      8643 1.0 2.0476e+0313.1 5.84e+09 1.0 0.0e+00 0.0e+00 8.6e+03 17 13  0  0 17  39 29  0  0 36   270
KSPSetUp               5 1.0 1.0729e-05 3.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPSolve               5 1.0 4.0481e+03 1.0 2.05e+10 1.0 4.0e+07 1.2e+03 2.4e+04 44 46 47 44 46 100100100100100   477
VecMDot             8643 1.0 2.0424e+0313.5 2.92e+09 1.0 0.0e+00 0.0e+00 8.6e+03 17  7  0  0 17  39 14  0  0 36   135
VecNorm            14503 1.0 8.7184e+02 6.4 8.47e+08 1.0 0.0e+00 0.0e+00 1.5e+04  7  2  0  0 28  15  4  0  0 61    92
VecScale           14503 1.0 3.7836e+00 4.7 4.23e+08 1.0 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   0  2  0  0  0 10598
VecCopy             5767 1.0 2.0257e+00 3.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecSet             34670 1.0 5.5035e+00 4.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecAXPY             8741 1.0 2.9888e+00 4.5 5.10e+08 1.0 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   0  3  0  0  0 16172
VecAYPX             2881 1.0 1.7167e+00 7.2 8.41e+07 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0  4640
VecWAXPY              93 1.0 5.3297e-02 4.8 2.71e+06 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0  4824
VecMAXPY           14503 1.0 1.5191e+01 3.1 3.42e+09 1.0 0.0e+00 0.0e+00 0.0e+00  0  8  0  0  0   0 17  0  0  0 21342
VecAssemblyBegin       5 1.0 3.7097e+0012.5 0.00e+00 0.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   0  0  0  0  0     0
VecAssemblyEnd         5 1.0 2.9087e-05 5.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecScatterBegin    26022 1.0 1.1726e+01 4.2 0.00e+00 0.0 4.0e+07 1.2e+03 0.0e+00  0  0 47 44  0   0  0100100  0     0
VecScatterEnd      26022 1.0 3.5268e+03 3.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 19  0  0  0  0  43  0  0  0  0     0
VecNormalize       11524 1.0 8.4214e+02 7.2 1.01e+09 1.0 0.0e+00 0.0e+00 1.2e+04  6  2  0  0 22  15  5  0  0 49   114
MatMult            14498 1.0 3.0545e+0382.7 5.50e+09 1.0 6.4e+06 7.6e+03 0.0e+00  9 12  8 44  0  21 27 16 98  0   169
MatMultAdd          2881 1.0 1.8177e+0218.6 1.09e+09 1.1 3.6e+06 1.5e+01 0.0e+00  1  2  4  0  0   2  5  9  0  0   560
MatMultTranspose    2881 1.0 7.1296e+0274.9 1.09e+09 1.1 3.6e+06 1.5e+01 0.0e+00  5  2  4  0  0  11  5  9  0  0   143
MatSolve           14405 1.0 3.9663e+01 3.3 4.56e+09 1.0 0.0e+00 0.0e+00 0.0e+00  0 10  0  0  0   1 22  0  0  0 10890
MatLUFactorNum         5 1.0 3.9389e-02 1.6 3.05e+06 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0  7322
MatILUFactorSym        5 1.0 4.2221e-02 3.2 0.00e+00 0.0 0.0e+00 0.0e+00 5.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetRowIJ            5 1.0 9.7752e-06 5.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetOrdering         5 1.0 2.9373e-03 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01  0  0  0  0  0   0  0  0  0  0     0
PCSetUp                5 1.0 8.3110e-02 2.0 3.05e+06 1.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   0  0  0  0  0  3470
PCSetUpOnBlocks     5762 1.0 9.9790e-02 2.0 3.05e+06 1.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   0  0  0  0  0  2890
PCApply             2881 1.0 3.3151e+03 1.0 1.37e+10 1.0 3.8e+07 1.0e+03 1.7e+04 36 31 46 36 33  82 67 97 80 73   390

--- Event Stage 4: Corrector RHS Setup

KSPGMRESOrthog       835 2.0 2.1493e+00 7.9 1.19e+09 3.5 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   1 46  0  0  0 19935
KSPSetUp              25 1.0 5.1442e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 1.4e+02  0  0  0  0  0   0  0  0  0 12     0
KSPSolve              30 1.0 5.3268e+00 4.7 1.82e+09 2.7 0.0e+00 0.0e+00 1.5e+02  0  2  0  0  0   2 85  0  0 13 14777
VecMDot              835 2.0 1.2651e+00 7.8 5.97e+08 3.5 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   0 23  0  0  0 16933
VecNorm              865 1.9 1.4096e-01 7.7 4.25e+07 1.8 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  3  0  0  0 17909
VecScale             865 1.9 2.4702e-02 2.5 2.12e+07 1.8 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  1  0  0  0 51099
VecCopy               30 1.0 1.1083e-02 3.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecSet               250 1.2 1.6737e-0110.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecAXPY               30 1.0 9.1493e-03 3.9 1.58e+06 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0 15977
VecMAXPY             865 1.9 1.1516e+00 9.5 6.38e+08 3.3 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   0 26  0  0  0 20668
VecAssemblyBegin       5 1.0 6.9304e-01 9.7 0.00e+00 0.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   0  0  0  0  1     0
VecAssemblyEnd         5 1.0 3.0994e-05 4.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecNormalize         865 1.9 1.5194e-01 4.8 6.37e+07 1.8 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  4  0  0  0 24922
MatMult              835 2.0 1.2717e+00 4.5 2.53e+08 1.9 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1 15  0  0  0 11188
MatSolve             865 1.9 1.5930e+00 3.7 2.63e+08 1.9 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1 16  0  0  0  9478
MatLUFactorSym         5 1.0 1.3718e-02 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   0  0  0  0  1     0
MatLUFactorNum        10 1.0 1.5112e-01 3.6 2.36e+07 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  2  0  0  0 14871
MatILUFactorSym        5 1.0 4.7347e-02 4.5 0.00e+00 0.0 0.0e+00 0.0e+00 5.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatScale               5 1.0 2.2573e-03 3.6 7.91e+05 1.1 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0 32378
MatAssemblyBegin      65 1.0 4.3617e+00 8.9 0.00e+00 0.0 6.4e+03 1.6e+04 5.0e+01  0  0  0  0  0   2  0  2 17  5     0
MatAssemblyEnd        65 1.0 9.5838e+00 1.1 0.00e+00 0.0 5.8e+04 1.5e+02 1.2e+02  0  0  0  0  0   7  0 16  1 11     0
MatGetRowIJ           10 1.0 1.9398e-03 2.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetSubMatrice      20 1.0 7.1810e+0026.4 0.00e+00 0.0 1.1e+04 9.1e+03 9.5e+01  0  0  0  0  0   2  0  3 16  9     0
MatGetOrdering        10 1.0 5.8639e-03 1.7 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+01  0  0  0  0  0   0  0  0  0  3     0
MatMatMult             5 1.0 2.5045e-02 4.9 7.40e+05 2.4 0.0e+00 0.0e+00 1.0e+01  0  0  0  0  0   0  0  0  0  1  2506
MatPtAP                5 1.0 1.9980e+01 1.0 1.85e+08 3.1 7.3e+04 3.3e+03 1.4e+02  0  0  0  0  0  16 12 21 38 12   575
MatPtAPSymbolic        5 1.0 1.9330e+01 1.0 0.00e+00 0.0 6.7e+04 3.5e+03 1.2e+02  0  0  0  0  0  16  0 19 37 11     0
MatPtAPNumeric         5 1.0 1.0708e+00 1.7 1.85e+08 3.1 6.2e+03 7.4e+02 1.0e+01  0  0  0  0  0   1 12  2  1  1 10737
MatGetRedundant        5 1.0 2.3013e+01 1.1 0.00e+00 0.0 1.4e+05 1.1e+03 2.0e+01  0  0  0  0  0  18  0 38 25  2     0
MatGetLocalMat         5 1.0 1.8890e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01  0  0  0  0  0   0  0  0  0  1     0
MatGetBrAoCol          5 1.0 1.7516e+0129.6 0.00e+00 0.0 6.6e+03 3.5e+04 1.0e+01  0  0  0  0  0   7  0  2 37  1     0
MatGetSymTrans        10 1.0 9.4180e-03 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
PCSetUp                5 1.0 6.7590e+01 1.0 1.92e+09 2.5 3.3e+05 1.8e+03 7.1e+02  1  2  0  1  1  54100 93 96 64  1367
PCApply              865 1.9 1.5955e+00 3.7 2.63e+08 1.9 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   1 16  0  0  0  9464

--- Event Stage 5: Corrector Solve

KSPGMRESOrthog      9405 1.0 2.2276e+0312.9 6.35e+09 1.0 0.0e+00 0.0e+00 9.4e+03 19 14  0  0 18  39 29  0  0 36   270
KSPSetUp               5 1.0 1.1921e-05 4.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
KSPSolve               5 1.0 4.4038e+03 1.0 2.23e+10 1.0 4.3e+07 1.2e+03 2.6e+04 47 50 52 48 50 100100100100100   477
VecMDot             9405 1.0 2.2221e+0313.3 3.17e+09 1.0 0.0e+00 0.0e+00 9.4e+03 18  7  0  0 18  39 14  0  0 36   135
VecNorm            15782 1.0 9.4888e+02 6.4 9.21e+08 1.0 0.0e+00 0.0e+00 1.6e+04  7  2  0  0 30  15  4  0  0 61    92
VecScale           15782 1.0 4.1324e+00 4.7 4.61e+08 1.0 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   0  2  0  0  0 10559
VecCopy             6275 1.0 2.1976e+00 3.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecSet             37727 1.0 5.9847e+00 4.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecAXPY             9512 1.0 3.2965e+00 4.6 5.55e+08 1.0 0.0e+00 0.0e+00 0.0e+00  0  1  0  0  0   0  3  0  0  0 15955
VecAYPX             3135 1.0 1.8733e+00 7.2 9.15e+07 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0  4627
VecWAXPY             102 1.0 5.8617e-02 4.8 2.98e+06 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0  4811
VecMAXPY           15782 1.0 1.6524e+01 3.1 3.72e+09 1.0 0.0e+00 0.0e+00 0.0e+00  0  8  0  0  0   0 17  0  0  0 21351
VecAssemblyBegin       5 1.0 2.8475e+00 3.9 0.00e+00 0.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   0  0  0  0  0     0
VecAssemblyEnd         5 1.0 4.1962e-05 8.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
VecScatterBegin    28317 1.0 1.3162e+01 4.3 0.00e+00 0.0 4.3e+07 1.2e+03 0.0e+00  0  0 52 48  0   0  0100100  0     0
VecScatterEnd      28317 1.0 3.8321e+03 3.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 21  0  0  0  0  43  0  0  0  0     0
VecNormalize       12540 1.0 9.1638e+02 7.2 1.10e+09 1.0 0.0e+00 0.0e+00 1.3e+04  7  2  0  0 24  15  5  0  0 49   114
MatMult            15777 1.0 3.3229e+0381.6 5.99e+09 1.0 6.9e+06 7.6e+03 0.0e+00 10 13  8 48  0  21 27 16 98  0   169
MatMultAdd          3135 1.0 1.9759e+0218.9 1.19e+09 1.1 3.9e+06 1.5e+01 0.0e+00  1  3  5  0  0   2  5  9  0  0   560
MatMultTranspose    3135 1.0 7.7572e+0277.0 1.19e+09 1.1 3.9e+06 1.5e+01 0.0e+00  5  3  5  0  0  11  5  9  0  0   143
MatSolve           15675 1.0 4.3195e+01 3.2 4.96e+09 1.0 0.0e+00 0.0e+00 0.0e+00  0 11  0  0  0   1 22  0  0  0 10881
MatLUFactorNum         5 1.0 3.9492e-02 1.7 3.05e+06 1.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0  7303
MatILUFactorSym        5 1.0 4.3832e-02 3.3 0.00e+00 0.0 0.0e+00 0.0e+00 5.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetRowIJ            5 1.0 1.0014e-0510.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00  0  0  0  0  0   0  0  0  0  0     0
MatGetOrdering         5 1.0 1.3327e-02 9.4 0.00e+00 0.0 0.0e+00 0.0e+00 1.0e+01  0  0  0  0  0   0  0  0  0  0     0
PCSetUp                5 1.0 8.3063e-02 1.9 3.05e+06 1.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   0  0  0  0  0  3472
PCSetUpOnBlocks     6270 1.0 1.0566e-01 2.0 3.05e+06 1.0 0.0e+00 0.0e+00 1.5e+01  0  0  0  0  0   0  0  0  0  0  2730
PCApply             3135 1.0 3.6073e+03 1.0 1.49e+10 1.0 4.2e+07 1.0e+03 1.9e+04 39 33 50 39 36  82 67 97 80 73   390
------------------------------------------------------------------------------------------------------------------------

Memory usage is given in bytes:

Object Type          Creations   Destructions     Memory  Descendants' Mem.
Reports information only for process 0.

--- Event Stage 0: Main Stage

           Container     0             20        10960     0
       Krylov Solver     0             61       589472     0
              Vector    11           1252    255019784     0
      Vector Scatter     0             94        97384     0
              Matrix     0            160  185546300120     0
    Distributed Mesh     0             22      8606680     0
     Bipartite Graph     0             44        30096     0
           Index Set     0             90      2213200     0
   IS L to G Mapping     0             33      5681412     0
      Preconditioner     0             60        56800     0
              Viewer     1              0            0     0

--- Event Stage 1: DMMG Setup

       Krylov Solver     1              0            0     0
              Vector     5              4         5856     0
      Vector Scatter     4              0            0     0
    Distributed Mesh     2              0            0     0
     Bipartite Graph     4              0            0     0
           Index Set    10             10       264444     0
   IS L to G Mapping     3              0            0     0

--- Event Stage 2: Pressure RHS Setup

           Container    10              0            0     0
       Krylov Solver    30              0            0     0
              Vector   385             55      2416680     0
      Vector Scatter    45              0            0     0
              Matrix   105             30     29966460     0
    Distributed Mesh    10              0            0     0
     Bipartite Graph    20              0            0     0
           Index Set   155            125      2631340     0
   IS L to G Mapping    15              0            0     0
      Preconditioner    30              0            0     0

--- Event Stage 3: Pressure Solve

              Vector   290              0            0     0
              Matrix     5              0            0     0
           Index Set    15              0            0     0

--- Event Stage 4: Corrector RHS Setup

           Container    10              0            0     0
       Krylov Solver    30              0            0     0
              Vector   385             55      2416680     0
      Vector Scatter    45              0            0     0
              Matrix   105             30     29966460     0
    Distributed Mesh    10              0            0     0
     Bipartite Graph    20              0            0     0
           Index Set   155            125      2631340     0
   IS L to G Mapping    15              0            0     0
      Preconditioner    30              0            0     0

--- Event Stage 5: Corrector Solve

              Vector   290              0            0     0
              Matrix     5              0            0     0
           Index Set    15              0            0     0
========================================================================================================================
Average time to get PetscTime(): 2.14577e-07
Average time for MPI_Barrier(): 0.000852394
Average time for zero size MPI_Send(): 0.00120548
#PETSc Option Table entries:
-ksp_rtol 1.0e-7
-ksp_type fgmres
-log_summary
-mg_levels_ksp_max_it 1
-mg_levels_ksp_type gmres
-mg_levels_pc_type bjacobi
-pc_type exotic
#End of PETSc Option Table entries
Compiled without FORTRAN kernels
Compiled with full precision matrices (default)
sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4
Configure run at: Tue Sep 18 16:22:39 2012
Configure options: --with-debugging=0 --with-cc=gcc --with-fc=gfortran --download-f-blas-lapack --download-mpich
-----------------------------------------
Libraries compiled on Tue Sep 18 16:22:39 2012 on firefox.bioinfo.ittc.ku.edu 
Machine characteristics: Linux-2.6.18-92.1.13.el5-x86_64-with-redhat-5.2-Final
Using PETSc directory: /bio/work1/zlwei/PETSc/petsc-dev
Using PETSc arch: arch-linux2-c-debug
-----------------------------------------

Using C compiler: /bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/bin/mpicc  -Wall -Wwrite-strings -Wno-strict-aliasing -Wno-unknown-pragmas -O  ${COPTFLAGS} ${CFLAGS}
Using Fortran compiler: /bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/bin/mpif90   -Wall -Wno-unused-variable -O  ${FOPTFLAGS} ${FFLAGS} 
-----------------------------------------

Using include paths: -I/bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/include -I/bio/work1/zlwei/PETSc/petsc-dev/include -I/bio/work1/zlwei/PETSc/petsc-dev/include -I/bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/include
-----------------------------------------

Using C linker: /bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/bin/mpicc
Using Fortran linker: /bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/bin/mpif90
Using libraries: -Wl,-rpath,/bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/lib -L/bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/lib -lpetsc -lX11 -lpthread -Wl,-rpath,/bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/lib -L/bio/work1/zlwei/PETSc/petsc-dev/arch-linux2-c-debug/lib -lflapack -lfblas -lm -L/usr/lib/gcc/x86_64-redhat-linux/4.1.2 -lmpichf90 -lgfortran -lm -lm -ldl -lmpich -lopa -lmpl -lrt -lgcc_s -ldl 
-----------------------------------------



More information about the petsc-users mailing list