************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ########################################################## # # # WARNING!!! # # # # This code was compiled with a debugging option. # # To get timing results run ./configure # # using --with-debugging=no, the performance will # # be generally two or three times faster. # # # ########################################################## ./topopt on a arch-linux-c-debug named CTA-5CG8523WML with 1 processor, by coteant Thu Oct 22 15:50:31 2020 Using Petsc Release Version 3.12.1, unknown Max Max/Min Avg Total Time (sec): 1.444e+02 1.000 1.444e+02 Objects: 8.597e+03 1.000 8.597e+03 Flop: 7.407e+10 1.000 7.407e+10 7.407e+10 Flop/sec: 5.129e+08 1.000 5.129e+08 5.129e+08 Memory: 6.140e+07 1.000 6.140e+07 6.140e+07 MPI Messages: 0.000e+00 0.000 0.000e+00 0.000e+00 MPI Message Lengths: 0.000e+00 0.000 0.000e+00 0.000e+00 MPI Reductions: 0.000e+00 0.000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flop and VecAXPY() for complex vectors of length N --> 8N flop Summary of Stages: ----- Time ------ ----- Flop ------ --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total Count %Total Avg %Total Count %Total 0: Main Stage: 5.4196e+01 37.5% 3.8910e+10 52.5% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% 1: Stiff_Adj: 9.0193e+01 62.5% 3.5160e+10 47.5% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flop: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent AvgLen: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flop in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flop over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ ########################################################## # # # WARNING!!! # # # # This code was compiled with a debugging option. # # To get timing results run ./configure # # using --with-debugging=no, the performance will # # be generally two or three times faster. # # # ########################################################## Event Count Time (sec) Flop --- Global --- --- Stage ---- Total Max Ratio Max Ratio Max Ratio Mess AvgLen Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage DMCoarsen 3 1.0 3.0529e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DMCreateInterp 3 1.0 5.9271e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DMCreateInject 3 1.0 1.0066e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DMCreateMat 2 1.0 8.4187e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecView 13 1.0 2.6295e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecMax 2 1.0 3.8147e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecDot 32768 1.0 1.2418e+00 1.0 1.88e+09 1.0 0.0e+00 0.0e+00 0.0e+00 1 3 0 0 0 2 5 0 0 0 1510 VecMDot 1207 1.0 2.2223e-02 1.0 4.02e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1811 VecNorm 1394 1.0 1.2946e-02 1.0 1.90e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1467 VecScale 1389 1.0 5.7988e-03 1.0 9.33e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1609 VecCopy 208 1.0 2.1703e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 8839 1.0 3.7422e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 264 1.0 2.6989e-03 1.0 5.74e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2126 VecAYPX 75 1.0 2.6047e-03 1.0 8.35e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 321 VecAXPBYCZ 2 1.0 5.4121e-05 1.0 6.55e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1211 VecWAXPY 1 1.0 8.0109e-05 1.0 2.86e+04 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 357 VecMAXPY 1388 1.0 3.8049e-02 1.0 5.52e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1451 VecAssemblyBegin 8221 1.0 1.0631e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAssemblyEnd 8221 1.0 9.3167e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecPointwiseMult 6 1.0 5.9056e-04 1.0 1.72e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 291 VecScatterBegin 38 1.0 6.7425e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecNormalize 1357 1.0 2.2589e-02 1.0 2.53e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1120 MatMult 9553 1.0 3.2824e+01 1.0 3.54e+10 1.0 0.0e+00 0.0e+00 0.0e+00 23 48 0 0 0 61 91 0 0 0 1079 MatMultAdd 75 1.0 8.2624e-03 1.0 5.32e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 644 MatMultTranspose 75 1.0 8.2376e-03 1.0 5.32e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 646 MatSOR 1357 1.0 1.4291e+00 1.0 1.21e+09 1.0 0.0e+00 0.0e+00 0.0e+00 1 2 0 0 0 3 3 0 0 0 848 MatConvert 1 1.0 3.3350e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatScale 6 1.0 5.3896e-02 1.0 2.52e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 467 MatResidual 75 1.0 1.1283e-01 1.0 1.20e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1068 MatAssemblyBegin 34 1.0 1.2445e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 34 1.0 8.4243e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatZeroEntries 6 1.0 5.4941e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatTranspose 1 1.0 6.6626e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatPtAP 18 1.0 1.1118e+00 1.0 1.41e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 2 0 0 0 0 127 MatPtAPSymbolic 3 1.0 6.6872e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatPtAPNumeric 18 1.0 1.0448e+00 1.0 1.41e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 2 0 0 0 0 135 MatGetSymTrans 3 1.0 4.3201e-04 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 KSPSetUp 15 1.0 4.9961e-03 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 KSPSolve 6 1.0 3.0861e+00 1.0 2.69e+09 1.0 0.0e+00 0.0e+00 0.0e+00 2 4 0 0 0 6 7 0 0 0 871 KSPGMRESOrthog 1207 1.0 6.0579e-02 1.0 8.05e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1329 PCSetUp 21 1.0 1.1161e+00 1.0 1.41e+08 1.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 2 0 0 0 0 127 PCApply 25 1.0 2.7163e+00 1.0 2.54e+09 1.0 0.0e+00 0.0e+00 0.0e+00 2 3 0 0 0 5 7 0 0 0 935 --- Event Stage 1: Stiff_Adj VecSet 16384 1.0 5.9468e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 0 0 0 0 0 VecAXPY 8192 1.0 1.7947e-01 1.0 4.69e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 2612 VecPointwiseMult 8192 1.0 6.3374e-01 1.0 2.34e+08 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 1 1 0 0 0 370 MatScale 8192 1.0 7.2235e+01 1.0 3.43e+10 1.0 0.0e+00 0.0e+00 0.0e+00 50 46 0 0 0 80 98 0 0 0 475 MatAssemblyBegin 8192 1.0 1.4532e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 8192 1.0 1.4099e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatZeroEntries 8192 1.0 5.3056e+00 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 4 0 0 0 0 6 0 0 0 0 0 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Distributed Mesh 14 4 21664 0. Index Set 39 39 480856 0. IS L to G Mapping 14 4 144520 0. Star Forest Graph 28 8 7520 0. Discrete System 14 4 3776 0. Vector 240 221 21011616 0. Vec Scatter 20 9 6912 0. Matrix 16 15 29023160 0. Krylov Solver 6 6 199216 0. Preconditioner 6 6 6016 0. Viewer 8 7 5880 0. --- Event Stage 1: Stiff_Adj Vector 8192 8192 1887830016 0. ======================================================================================================================== Average time to get PetscTime(): 9.29832e-07 #PETSc Option Table entries: -Emax 68900.0 -info mat -log_view -maxItr 1 -maxStress 0.1 -nreg 4 -nx 33 -ny 17 -nz 17 -useFD 0 -volfrac 0.5 #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4 Configure options: --with-blas-lib=/usr/local/lib/libblas.a --with-lapack-lib=/usr/local/lib/liblapack.a ----------------------------------------- Libraries compiled on 2019-11-04 13:59:43 on CTA-5CG8523WML Machine characteristics: Linux-4.4.0-17763-Microsoft-x86_64-with-debian-10.1 Using PETSc directory: /home/coteant/programs/petsc Using PETSc arch: arch-linux-c-debug ----------------------------------------- Using C compiler: mpicc -fPIC -Wall -Wwrite-strings -Wno-strict-aliasing -Wno-unknown-pragmas -fstack-protector -fvisibility=hidden -g3 Using Fortran compiler: mpif90 -fPIC -Wall -ffree-line-length-0 -Wno-unused-dummy-argument -g ----------------------------------------- Using include paths: -I/home/coteant/programs/petsc/include -I/home/coteant/programs/petsc/arch-linux-c-debug/include ----------------------------------------- Using C linker: mpicc Using Fortran linker: mpif90 Using libraries: -Wl,-rpath,/home/coteant/programs/petsc/arch-linux-c-debug/lib -L/home/coteant/programs/petsc/arch-linux-c-debug/lib -lpetsc -Wl,-rpath,/usr/local/lib -L/usr/local/lib -Wl,-rpath,/usr/lib/x86_64-linux-gnu -L/usr/lib/x86_64-linux-gnu -Wl,-rpath,/usr/lib/gcc/x86_64-linux-gnu/8 -L/usr/lib/gcc/x86_64-linux-gnu/8 -Wl,-rpath,/lib/x86_64-linux-gnu -L/lib/x86_64-linux-gnu -llapack -lblas -lpthread -lm -lstdc++ -ldl -lmpichfort -lmpich -lgfortran -lm -lgfortran -lm -lgcc_s -lquadmath -lstdc++ -ldl ----------------------------------------- ########################################################## # # # WARNING!!! # # # # This code was compiled with a debugging option. # # To get timing results run ./configure # # using --with-debugging=no, the performance will # # be generally two or three times faster. # # # ##########################################################