0 KSP Residual norm 2.423086291254e-01 1 KSP Residual norm 5.967557103446e-02 2 KSP Residual norm 2.025458752521e-02 3 KSP Residual norm 6.499208979650e-03 4 KSP Residual norm 2.101735164030e-03 5 KSP Residual norm 6.726224388821e-04 6 KSP Residual norm 2.231916073025e-04 7 KSP Residual norm 7.500267826027e-05 8 KSP Residual norm 2.447305974735e-05 9 KSP Residual norm 7.855426265931e-06 10 KSP Residual norm 2.534824436642e-06 11 KSP Residual norm 8.463544315151e-07 12 KSP Residual norm 2.740122317515e-07 13 KSP Residual norm 8.763734369677e-08 14 KSP Residual norm 2.820893232092e-08 15 KSP Residual norm 8.977876094073e-09 16 KSP Residual norm 2.811010861829e-09 17 KSP Residual norm 8.888694518856e-10 KSP Object: 2048 MPI processes type: gmres GMRES: restart=30, using Classical (unmodified) Gram-Schmidt Orthogonalization with no iterative refinement GMRES: happy breakdown tolerance 1e-30 maximum iterations=200, initial guess is zero tolerances: relative=1e-08, absolute=1e-50, divergence=10000 left preconditioning using PRECONDITIONED norm type for convergence test PC Object: 2048 MPI processes type: ml MG: type is MULTIPLICATIVE, levels=3 cycles=v Cycles per PCApply=1 Using Galerkin computed coarse grid matrices Coarse grid solver -- level ------------------------------- KSP Object: (mg_coarse_) 2048 MPI processes type: preonly maximum iterations=1, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using NONE norm type for convergence test PC Object: (mg_coarse_) 2048 MPI processes type: redundant Redundant preconditioner: First (color=0) of 2048 PCs follows KSP Object: (mg_coarse_redundant_) 1 MPI processes type: preonly maximum iterations=10000, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using NONE norm type for convergence test PC Object: (mg_coarse_redundant_) 1 MPI processes type: lu LU: out-of-place factorization tolerance for zero pivot 2.22045e-14 using diagonal shift on blocks to prevent zero pivot matrix ordering: nd factor fill ratio given 5, needed 4.37616 Factored matrix follows: Matrix Object: 1 MPI processes type: seqaij rows=2053, cols=2053 package used to perform factorization: petsc total: nonzeros=2464807, allocated nonzeros=2464807 total number of mallocs used during MatSetValues calls =0 using I-node routines: found 1567 nodes, limit used is 5 linear system matrix = precond matrix: Matrix Object: 1 MPI processes type: seqaij rows=2053, cols=2053 total: nonzeros=563235, allocated nonzeros=1094249 total number of mallocs used during MatSetValues calls =0 not using I-node routines linear system matrix = precond matrix: Matrix Object: 2048 MPI processes type: mpiaij rows=2053, cols=2053 total: nonzeros=563235, allocated nonzeros=563235 total number of mallocs used during MatSetValues calls =0 not using I-node (on process 0) routines Down solver (pre-smoother) on level 1 ------------------------------- KSP Object: (mg_levels_1_) 2048 MPI processes type: richardson Richardson: damping factor=1 maximum iterations=2 tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using nonzero initial guess using NONE norm type for convergence test PC Object: (mg_levels_1_) 2048 MPI processes type: sor SOR: type = local_symmetric, iterations = 1, local iterations = 1, omega = 1 linear system matrix = precond matrix: Matrix Object: 2048 MPI processes type: mpiaij rows=30118, cols=30118 total: nonzeros=3364139, allocated nonzeros=3364139 total number of mallocs used during MatSetValues calls =0 not using I-node (on process 0) routines Up solver (post-smoother) same as down solver (pre-smoother) Down solver (pre-smoother) on level 2 ------------------------------- KSP Object: (mg_levels_2_) 2048 MPI processes type: richardson Richardson: damping factor=1 maximum iterations=2 tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using nonzero initial guess using NONE norm type for convergence test PC Object: (mg_levels_2_) 2048 MPI processes type: sor SOR: type = local_symmetric, iterations = 1, local iterations = 1, omega = 1 linear system matrix = precond matrix: Matrix Object: 2048 MPI processes type: mpiaij rows=531441, cols=531441 total: nonzeros=12013842, allocated nonzeros=12013842 total number of mallocs used during MatSetValues calls =0 not using I-node (on process 0) routines Up solver (post-smoother) same as down solver (pre-smoother) linear system matrix = precond matrix: Matrix Object: 2048 MPI processes type: mpiaij rows=531441, cols=531441 total: nonzeros=12013842, allocated nonzeros=12013842 total number of mallocs used during MatSetValues calls =0 not using I-node (on process 0) routines --- system solved with PETSc (in 2.159356e+02) --- PETSc error = 4.226628e-09 / 1.955848e-03 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /ccc/scratch/cont003/gen6654/jolivetp/Cive/ff++/src/mpi/FreeFem++-mpi-PETSc on a arch-linux2-c-opt named curie2958 with 2048 processors, by jolivetp Tue Oct 8 23:28:05 2013 Using Petsc Release Version 3.4.2, Jul, 02, 2013 Max Max/Min Avg Total Time (sec): 2.314e+02 1.00010 2.314e+02 Objects: 1.800e+02 1.00000 1.800e+02 Flops: 1.878e+09 1.00382 1.873e+09 3.836e+12 Flops/sec: 8.118e+06 1.00383 8.096e+06 1.658e+10 MPI Messages: 5.452e+04 1.13438 5.141e+04 1.053e+08 MPI Message Lengths: 1.364e+07 2.96932 1.480e+02 1.559e+10 MPI Reductions: 1.760e+02 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 2.2788e+02 98.5% 3.6510e+12 95.2% 2.311e+07 22.0% 1.351e+02 91.2% 1.750e+02 99.4% 1: MG Apply: 3.4791e+00 1.5% 1.8518e+11 4.8% 8.218e+07 78.0% 1.296e+01 8.8% 0.000e+00 0.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %f - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %f %M %L %R %T %f %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage MatMult 28 1.0 3.4134e-03 6.7 1.02e+0664.1 5.0e+05 2.4e+02 0.0e+00 0 0 0 1 0 0 0 2 1 0 192737 MatLUFactorSym 1 1.0 1.5230e-01 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 2 0 0 0 0 2 0 MatLUFactorNum 1 1.0 2.1736e+00 1.1 1.78e+09 1.0 0.0e+00 0.0e+00 0.0e+00 1 95 0 0 0 1100 0 0 0 1679213 MatAssemblyBegin 5 1.0 1.7827e-02 3.8 0.00e+00 0.0 0.0e+00 0.0e+00 6.0e+00 0 0 0 0 3 0 0 0 0 3 0 MatAssemblyEnd 5 1.0 1.7449e-01 1.1 0.00e+00 0.0 1.3e+06 7.8e+00 2.4e+01 0 0 1 0 14 0 0 6 0 14 0 MatGetRowIJ 1 1.0 1.3925e-02 4.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetOrdering 1 1.0 4.2281e-02 6.3 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 1 0 0 0 0 1 0 MatView 6 1.5 8.4095e-02 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 2 0 0 0 0 2 0 MatGetRedundant 1 1.0 6.0411e-01 1.3 0.00e+00 0.0 1.3e+07 1.1e+03 4.0e+00 0 0 12 89 2 0 0 54 98 2 0 VecDot 1 1.0 4.6611e-04 6.3 1.62e+03107.8 0.0e+00 0.0e+00 1.0e+00 0 0 0 0 1 0 0 0 0 1 2276 VecMDot 17 1.0 8.0511e-0221.2 2.47e+05107.8 0.0e+00 0.0e+00 1.7e+01 0 0 0 0 10 0 0 0 0 10 2016 VecNorm 21 1.0 2.7842e-02 7.6 3.40e+04101.1 0.0e+00 0.0e+00 2.1e+01 0 0 0 0 12 0 0 0 0 12 802 VecScale 19 1.0 3.1328e-04 5.9 1.54e+04101.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 32231 VecCopy 1 1.0 7.9155e-05 0.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 7 1.0 1.0607e-0335.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 2 1.0 6.1989e-06 6.5 3.24e+03101.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 342927 VecMAXPY 18 1.0 1.2183e-0412.2 2.75e+05101.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1483109 VecScatterBegin 30 1.0 1.6823e-03 8.8 0.00e+00 0.0 8.4e+05 2.4e+02 0.0e+00 0 0 1 1 0 0 0 4 1 0 0 VecScatterEnd 30 1.0 1.5826e-0224.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 19 1.0 2.7723e-02 7.7 4.61e+04101.1 0.0e+00 0.0e+00 1.9e+01 0 0 0 0 11 0 0 0 0 11 1093 KSPGMRESOrthog 17 1.0 8.0619e-0220.5 4.95e+05104.4 0.0e+00 0.0e+00 1.7e+01 0 0 0 0 10 0 0 0 0 10 4030 KSPSetUp 5 1.0 9.0568e-0212.3 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 2 0 0 0 0 2 0 KSPSolve 1 1.0 2.1581e+02 1.0 1.88e+09 1.0 1.1e+08 1.5e+02 1.4e+02 93100100100 81 95105455110 82 17775 PCSetUp 1 1.0 2.1235e+02 1.0 1.78e+09 1.0 2.3e+07 6.3e+02 1.1e+02 92 95 21 90 61 93100 98 99 62 17190 PCApply 18 1.0 3.5604e+00 1.0 9.44e+07 1.1 8.2e+07 1.7e+01 0.0e+00 2 5 78 9 0 2 5356 10 0 52012 MGSetup Level 0 1 1.0 1.8164e+02 1.0 1.78e+09 1.0 2.1e+07 6.7e+02 2.7e+01 78 95 20 90 15 80100 91 99 15 20094 MGSetup Level 1 1 1.0 9.0053e-0212.9 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 1 0 0 0 0 1 0 MGSetup Level 2 1 1.0 3.0112e-04 1.6 0.00e+00 0.0 0.0e+00 0.0e+00 2.0e+00 0 0 0 0 1 0 0 0 0 1 0 --- Event Stage 1: MG Apply MatMult 72 1.0 2.5387e-0111.6 8.23e+0549.3 1.7e+06 1.1e+02 0.0e+00 0 0 2 1 0 4 0 2 14 0 2141 MatMultAdd 36 1.0 6.0475e-0114.1 3.05e+0476.9 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 9 0 0 0 0 33 MatSolve 18 1.0 1.0954e-01 1.4 8.87e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 5 0 0 0 3 98 0 0 0 1658356 MatSOR 72 1.0 5.6074e-01 7.2 4.81e+0665.9 5.0e+06 1.1e+02 0.0e+00 0 0 5 4 0 8 2 6 42 0 5274 VecScale 108 1.0 1.5306e-04 2.6 5.78e+04 5.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 465251 VecSet 90 1.0 2.3675e-04 2.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 36 1.0 1.4853e-04 2.2 3.05e+0476.9 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 136104 VecAYPX 36 1.0 1.1182e-04 8.1 1.52e+0476.9 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 90397 VecScatterBegin 180 1.0 4.9128e-02 1.2 0.00e+00 0.0 8.2e+07 1.7e+01 0.0e+00 0 0 78 9 0 1 0100100 0 0 VecScatterEnd 180 1.0 3.2941e+00 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 83 0 0 0 0 0 KSPSolve 90 1.0 3.4018e+00 1.3 9.35e+07 1.1 8.1e+07 1.5e+01 0.0e+00 1 5 76 8 0 87100 98 86 0 54268 PCApply 18 1.0 3.0453e+00 1.2 8.87e+07 1.0 7.5e+07 8.0e+00 0.0e+00 1 5 72 4 0 79 98 92 44 0 59649 MGSmooth Level 0 18 1.0 3.0455e+00 1.2 8.87e+07 1.0 7.5e+07 8.0e+00 0.0e+00 1 5 72 4 0 79 98 92 44 0 59646 MGSmooth Level 1 36 1.0 5.4845e-01 9.5 6.92e+0532.5 3.5e+06 6.0e+01 0.0e+00 0 0 3 1 0 7 0 4 16 0 834 MGResid Level 1 18 1.0 1.4357e-0152.4 1.71e+0534.7 1.2e+06 6.0e+01 0.0e+00 0 0 1 0 0 1 0 1 5 0 844 MGInterp Level 1 36 1.0 7.1015e-0121.5 1.44e+0340.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 10 0 0 0 0 2 MGSmooth Level 2 36 1.0 1.6066e-0124.8 4.14e+0694.2 1.5e+06 2.4e+02 0.0e+00 0 0 1 2 0 1 1 2 26 0 15564 MGResid Level 2 18 1.0 6.7300e-0289.1 6.72e+0564.6 5.0e+05 2.4e+02 0.0e+00 0 0 0 1 0 0 0 1 9 0 6426 MGInterp Level 2 36 1.0 1.9239e-0159.2 2.91e+04101.1 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 99 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Container 1 1 572 0 Matrix 16 16 43079636 0 Vector 59 58 342912 0 Vector Scatter 5 5 5380 0 Index Set 13 13 48296 0 Krylov Solver 5 5 23296 0 Preconditioner 5 5 5048 0 Viewer 2 1 736 0 --- Event Stage 1: MG Apply Vector 74 72 327744 0 ======================================================================================================================== Average time to get PetscTime(): 9.53674e-08 Average time for MPI_Barrier(): 8.55923e-05 Average time for zero size MPI_Send(): 3.78897e-06 #PETSc Option Table entries: -eps 1e-8 -iter 200 -ksp_monitor -ksp_view -log_summary -pc_mg_log -pc_type ml #End of PETSc Option Table entries