--- PETSc preconditioner built (in 2.553612e+02) 0 KSP Residual norm 2.368804472986e-01 1 KSP Residual norm 5.676430019132e-02 2 KSP Residual norm 1.898005876002e-02 3 KSP Residual norm 6.193922902926e-03 4 KSP Residual norm 2.008448794493e-03 5 KSP Residual norm 6.390465670228e-04 6 KSP Residual norm 2.157709394389e-04 7 KSP Residual norm 7.295973819979e-05 8 KSP Residual norm 2.358343271482e-05 9 KSP Residual norm 7.489696222067e-06 10 KSP Residual norm 2.390946857593e-06 11 KSP Residual norm 8.068086385141e-07 12 KSP Residual norm 2.706607789750e-07 13 KSP Residual norm 8.636910863397e-08 14 KSP Residual norm 2.761981175858e-08 15 KSP Residual norm 8.755459874599e-09 16 KSP Residual norm 2.708848598334e-09 17 KSP Residual norm 8.968748881930e-10 KSP Object: 2048 MPI processes type: gmres GMRES: restart=30, using Classical (unmodified) Gram-Schmidt Orthogonalization with no iterative refinement GMRES: happy breakdown tolerance 1e-30 maximum iterations=200, initial guess is zero tolerances: relative=1e-08, absolute=1e-50, divergence=10000 left preconditioning using PRECONDITIONED norm type for convergence test PC Object: 2048 MPI processes type: ml MG: type is MULTIPLICATIVE, levels=3 cycles=v Cycles per PCApply=1 Using Galerkin computed coarse grid matrices Coarse grid solver -- level ------------------------------- KSP Object: (mg_coarse_) 2048 MPI processes type: preonly maximum iterations=1, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using NONE norm type for convergence test PC Object: (mg_coarse_) 2048 MPI processes type: redundant Redundant preconditioner: First (color=0) of 2048 PCs follows KSP Object: (mg_coarse_redundant_) 1 MPI processes type: preonly maximum iterations=10000, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using NONE norm type for convergence test PC Object: (mg_coarse_redundant_) 1 MPI processes type: lu LU: out-of-place factorization tolerance for zero pivot 2.22045e-14 using diagonal shift on blocks to prevent zero pivot matrix ordering: nd factor fill ratio given 5, needed 4.38507 Factored matrix follows: Matrix Object: 1 MPI processes type: seqaij rows=2055, cols=2055 package used to perform factorization: petsc total: nonzeros=2476965, allocated nonzeros=2476965 total number of mallocs used during MatSetValues calls =0 using I-node routines: found 1644 nodes, limit used is 5 linear system matrix = precond matrix: Matrix Object: 1 MPI processes type: seqaij rows=2055, cols=2055 total: nonzeros=564863, allocated nonzeros=1085040 total number of mallocs used during MatSetValues calls =0 not using I-node routines linear system matrix = precond matrix: Matrix Object: 2048 MPI processes type: mpiaij rows=2055, cols=2055 total: nonzeros=564863, allocated nonzeros=564863 total number of mallocs used during MatSetValues calls =0 not using I-node (on process 0) routines Down solver (pre-smoother) on level 1 ------------------------------- KSP Object: (mg_levels_1_) 2048 MPI processes type: richardson Richardson: damping factor=1 maximum iterations=2 tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using nonzero initial guess using NONE norm type for convergence test PC Object: (mg_levels_1_) 2048 MPI processes type: sor SOR: type = local_symmetric, iterations = 1, local iterations = 1, omega = 1 linear system matrix = precond matrix: Matrix Object: 2048 MPI processes type: mpiaij rows=30194, cols=30194 total: nonzeros=3368448, allocated nonzeros=3368448 total number of mallocs used during MatSetValues calls =0 not using I-node (on process 0) routines Up solver (post-smoother) same as down solver (pre-smoother) Down solver (pre-smoother) on level 2 ------------------------------- KSP Object: (mg_levels_2_) 2048 MPI processes type: richardson Richardson: damping factor=1 maximum iterations=2 tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using nonzero initial guess using NONE norm type for convergence test PC Object: (mg_levels_2_) 2048 MPI processes type: sor SOR: type = local_symmetric, iterations = 1, local iterations = 1, omega = 1 linear system matrix = precond matrix: Matrix Object: 2048 MPI processes type: mpiaij rows=531441, cols=531441 total: nonzeros=12476324, allocated nonzeros=12476324 total number of mallocs used during MatSetValues calls =0 not using I-node (on process 0) routines Up solver (post-smoother) same as down solver (pre-smoother) linear system matrix = precond matrix: Matrix Object: 2048 MPI processes type: mpiaij rows=531441, cols=531441 total: nonzeros=12476324, allocated nonzeros=12476324 total number of mallocs used during MatSetValues calls =0 not using I-node (on process 0) routines --- system solved with PETSc (in 1.009018e+01) --- PETSc error = 7.023397e-06 / 1.955848e-03 --- number of dof: 531441.0, on average, number of neighbors: 22.9, h: 3.5e-02 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /ccc/scratch/cont003/gen6654/jolivetp/Cive/ff++/src/mpi/FreeFem++-mpi-PETSc on a arch-linux2-c-opt named curie2688 with 2048 processors, by jolivetp Wed Oct 9 14:36:04 2013 Using Petsc Release Version 3.4.2, Jul, 02, 2013 Max Max/Min Avg Total Time (sec): 2.814e+02 1.00027 2.813e+02 Objects: 1.820e+02 1.00000 1.820e+02 Flops: 1.899e+09 1.00378 1.894e+09 3.880e+12 Flops/sec: 6.751e+06 1.00381 6.734e+06 1.379e+10 MPI Messages: 5.451e+04 1.13403 5.142e+04 1.053e+08 MPI Message Lengths: 1.365e+07 2.92421 1.485e+02 1.564e+10 MPI Reductions: 1.780e+02 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 2.7195e+02 96.7% 3.6934e+12 95.2% 2.312e+07 22.0% 1.354e+02 91.2% 1.770e+02 99.4% 1: MG Apply: 9.3823e+00 3.3% 1.8619e+11 4.8% 8.219e+07 78.0% 1.304e+01 8.8% 0.000e+00 0.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %f - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %f %M %L %R %T %f %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage MatMult 28 1.0 8.2036e-02113.7 1.02e+0664.1 5.0e+05 2.4e+02 0.0e+00 0 0 0 1 0 0 0 2 1 0 8335 MatLUFactorSym 1 1.0 3.0419e-01 2.2 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 2 0 0 0 0 2 0 MatLUFactorNum 1 1.0 2.6391e+00 1.3 1.80e+09 1.0 0.0e+00 0.0e+00 0.0e+00 1 95 0 0 0 1100 0 0 0 1399068 MatAssemblyBegin 5 1.0 2.7188e-02 3.4 0.00e+00 0.0 0.0e+00 0.0e+00 6.0e+00 0 0 0 0 3 0 0 0 0 3 0 MatAssemblyEnd 5 1.0 1.7389e-01 1.1 0.00e+00 0.0 1.3e+06 7.8e+00 2.4e+01 0 0 1 0 13 0 0 6 0 14 0 MatGetRowIJ 1 1.0 1.3818e-02 4.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetOrdering 1 1.0 3.6034e-02 5.5 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 1 0 0 0 0 1 0 MatView 6 1.5 9.9307e-02 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 2 0 0 0 0 2 0 MatGetRedundant 1 1.0 9.1782e-01 1.2 0.00e+00 0.0 1.3e+07 1.1e+03 4.0e+00 0 0 12 89 2 0 0 54 98 2 0 VecDot 1 1.0 5.3692e-04 8.3 1.62e+03107.8 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 1 0 0 0 0 1 1976 VecMDot 17 1.0 1.2540e+0014.6 2.47e+05107.8 0.0e+00 0.0e+00 1.7e+01 0 0 0 0 10 0 0 0 0 10 129 VecNorm 21 1.0 4.2841e-01 2.3 3.40e+04101.1 0.0e+00 0.0e+00 2.1e+01 0 0 0 0 12 0 0 0 0 12 52 VecScale 19 1.0 2.7823e-04 8.3 1.54e+04101.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 36291 VecCopy 1 1.0 3.6001e-05 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 7 1.0 9.4414e-0430.7 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 2 1.0 7.1526e-06 7.5 3.24e+03101.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 297203 VecMAXPY 18 1.0 1.6499e-0421.0 2.75e+05101.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1095186 VecScatterBegin 30 1.0 1.5733e-03 9.2 0.00e+00 0.0 8.4e+05 2.4e+02 0.0e+00 0 0 1 1 0 0 0 4 1 0 0 VecScatterEnd 30 1.0 9.7439e-0262.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 19 1.0 4.2821e-01 2.3 4.61e+04101.1 0.0e+00 0.0e+00 1.9e+01 0 0 0 0 11 0 0 0 0 11 71 KSPGMRESOrthog 17 1.0 1.2541e+0014.6 4.95e+05104.4 0.0e+00 0.0e+00 1.7e+01 0 0 0 0 10 0 0 0 0 10 259 KSPSetUp 5 1.0 7.0540e-02 4.7 0.00e+00 0.0 0.0e+00 0.0e+00 6.0e+00 0 0 0 0 3 0 0 0 0 3 0 KSPSolve 1 1.0 9.9855e+00 1.0 9.60e+07 1.1 8.3e+07 1.8e+01 3.5e+01 4 5 79 10 20 4 5358 10 20 18725 PCSetUp 1 1.0 2.5581e+02 1.0 1.80e+09 1.0 2.3e+07 6.3e+02 1.1e+02 91 95 21 90 61 94100 98 99 61 14435 PCApply 18 1.0 9.5934e+00 1.1 9.48e+07 1.1 8.2e+07 1.7e+01 0.0e+00 3 5 78 9 0 3 5356 10 0 19409 MGSetup Level 0 1 1.0 2.2033e+02 1.0 1.80e+09 1.0 2.1e+07 6.7e+02 2.7e+01 78 95 20 90 15 81100 91 99 15 16758 MGSetup Level 1 1 1.0 3.4892e-02 2.9 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 1 0 0 0 0 1 0 MGSetup Level 2 1 1.0 3.1304e-04 1.5 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 1 0 0 0 0 1 0 --- Event Stage 1: MG Apply MatMult 72 1.0 8.5709e-0121.7 8.23e+0549.4 1.7e+06 1.1e+02 0.0e+00 0 0 2 1 0 2 0 2 14 0 654 MatMultAdd 36 1.0 4.4831e+00 5.6 3.05e+0476.9 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 27 0 0 0 0 5 MatSolve 18 1.0 1.2495e-01 1.5 8.91e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 5 0 0 0 1 98 0 0 0 1460975 MatSOR 72 1.0 4.1496e+00 6.4 4.81e+0666.0 5.1e+06 1.1e+02 0.0e+00 1 0 5 4 0 22 2 6 42 0 737 VecScale 108 1.0 2.2933e-0336.0 5.78e+04 4.3 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 31375 VecSet 90 1.0 1.8263e-04 2.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 36 1.0 1.0910e-0315.6 3.05e+0476.9 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 18532 VecAYPX 36 1.0 2.0123e-04 8.4 1.52e+0476.9 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 50239 VecScatterBegin 180 1.0 7.7095e-02 1.9 0.00e+00 0.0 8.2e+07 1.7e+01 0.0e+00 0 0 78 9 0 0 0100100 0 0 VecScatterEnd 180 1.0 8.2083e+00 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 68 0 0 0 0 0 KSPSolve 90 1.0 8.4090e+00 1.9 9.39e+07 1.1 8.1e+07 1.5e+01 0.0e+00 2 5 76 8 0 70100 98 86 0 22072 PCApply 18 1.0 6.0672e+00 1.9 8.91e+07 1.0 7.5e+07 8.0e+00 0.0e+00 2 5 72 4 0 48 98 92 44 0 30087 MGSmooth Level 0 18 1.0 6.0675e+00 1.9 8.91e+07 1.0 7.5e+07 8.0e+00 0.0e+00 2 5 72 4 0 48 98 92 44 0 30086 MGSmooth Level 1 36 1.0 3.7234e+00 6.8 6.91e+0531.2 3.5e+06 6.0e+01 0.0e+00 1 0 3 1 0 19 0 4 16 0 123 MGResid Level 1 18 1.0 4.2995e-0160.9 1.71e+0535.0 1.2e+06 6.0e+01 0.0e+00 0 0 1 0 0 1 0 1 5 0 282 MGInterp Level 1 36 1.0 4.5696e+00 7.5 1.37e+0338.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 26 0 0 0 0 0 MGSmooth Level 2 36 1.0 1.1775e+0017.5 4.14e+0694.2 1.5e+06 2.4e+02 0.0e+00 0 0 1 2 0 3 1 2 26 0 2208 MGResid Level 2 18 1.0 9.2353e-02112.2 6.72e+0564.6 5.0e+05 2.4e+02 0.0e+00 0 0 0 1 0 0 0 1 9 0 4863 MGInterp Level 2 36 1.0 1.0967e+0056.4 2.91e+04101.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 3 0 0 0 0 17 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Container 1 1 572 0 Matrix 16 16 43114768 0 Matrix Null Space 1 1 620 0 Vector 60 60 357528 0 Vector Scatter 5 5 5380 0 Index Set 13 13 48276 0 Krylov Solver 5 5 23296 0 Preconditioner 5 5 5048 0 Viewer 2 1 736 0 --- Event Stage 1: MG Apply Vector 74 72 327744 0 ======================================================================================================================== Average time to get PetscTime(): 9.53674e-08 Average time for MPI_Barrier(): 0.000338984 Average time for zero size MPI_Send(): 3.28606e-06 #PETSc Option Table entries: -eps 1e-8 -iter 200 -ksp_monitor -ksp_view -log_summary -pc_mg_log -pc_type ml #End of PETSc Option Table entries