450371-logsummary.txt************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- /home1/00042/tg457051/Development/mef90-default/VarFracQS/stampede-mef90-O/VarFracQS3D on a stampede- named c464-301.stampede.tacc.utexas.edu with 4096 processors, by tg457051 Sat Mar 23 18:48:40 2013 Using Petsc Release Version 3.2.0, Patch 7, unknown Max Max/Min Avg Total Time (sec): 3.957e+03 1.00000 3.957e+03 Objects: 2.576e+04 1.00000 2.576e+04 Flops: 1.086e+12 2.11739 7.905e+11 3.238e+15 Flops/sec: 2.745e+08 2.11739 1.998e+08 8.183e+11 MPI Messages: 7.278e+07 10.32486 3.003e+07 1.230e+11 MPI Message Lengths: 4.344e+10 3.13878 8.493e+02 1.045e+14 MPI Reductions: 7.230e+06 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 2.5763e+03 65.1% 3.2380e+15 100.0% 1.230e+11 100.0% 8.493e+02 100.0% 7.013e+06 97.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %f - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %f %M %L %R %T %f %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage DMMeshGetGlobalScatter 11 1.0 3.7420e-01 1.4 0.00e+00 0.0 5.6e+05 1.1e+02 7.7e+01 0 0 0 0 0 0 0 0 0 0 0 DMMeshAssembleMatrix 7005050 1.0 1.0010e+02 2.5 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 2 0 0 0 0 2 0 0 0 0 0 DMMeshUpdateOperator 7005050 1.0 1.3925e+01 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DistributeMesh 1 1.0 4.3743e+02 1.4 0.00e+00 0.0 3.3e+04 1.6e+04 0.0e+00 11 0 0 0 0 17 0 0 0 0 0 PartitionCreate 1 1.0 2.7071e+02627999.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 PartitionClosure 1 1.0 1.4730e+01109934.6 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DistributeCoords 1 1.0 4.2211e+00 1.1 0.00e+00 0.0 1.6e+04 6.8e+03 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 DistributeLabels 1 1.0 5.0949e+01 1.0 0.00e+00 0.0 2.5e+04 3.3e+03 0.0e+00 1 0 0 0 0 2 0 0 0 0 0 CreateOverlap 1 1.0 2.7008e+02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 5.0e+00 7 0 0 0 0 10 0 0 0 0 0 VecMax 758 1.0 1.2537e-01 1.8 0.00e+00 0.0 0.0e+00 0.0e+00 7.6e+02 0 0 0 0 0 0 0 0 0 0 0 VecMin 758 1.0 2.1504e-01 2.3 0.00e+00 0.0 0.0e+00 0.0e+00 7.6e+02 0 0 0 0 0 0 0 0 0 0 0 VecDot 11216 1.0 1.2606e+01 2.6 1.71e+07 2.6 0.0e+00 0.0e+00 1.1e+04 0 0 0 0 0 0 0 0 0 0 3872 VecTDot 4635002 1.0 7.9165e+02 1.5 2.10e+10 2.6 0.0e+00 0.0e+00 4.6e+06 17 2 0 0 64 27 2 0 0 66 75979 VecNorm 2323675 1.0 4.3012e+02 1.7 1.05e+10 2.6 0.0e+00 0.0e+00 2.3e+06 9 1 0 0 32 13 1 0 0 33 70028 VecScale 6987 1.0 4.2856e-02 5.1 5.32e+06 2.6 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 355059 VecCopy 9371 1.0 2.1868e-02 2.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 2325173 1.0 3.2163e+00 2.4 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 4638133 1.0 1.8223e+01 5.1 2.10e+10 2.6 0.0e+00 0.0e+00 0.0e+00 0 2 0 0 0 0 2 0 0 0 3302163 VecAYPX 2317502 1.0 6.8930e+00 2.6 1.05e+10 2.6 0.0e+00 0.0e+00 0.0e+00 0 1 0 0 0 0 1 0 0 0 4363718 VecWAXPY 4070 1.0 7.2284e-03 1.8 3.51e+06 2.6 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 1390141 VecScatterBegin 2335270 1.0 7.1873e+01 7.3 0.00e+00 0.0 1.2e+11 8.2e+02 0.0e+00 1 0 99 96 0 1 0 99 96 0 0 VecScatterEnd 2335270 1.0 1.9943e+0214.3 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 2 0 0 0 0 0 MatMult 2319028 1.0 3.8884e+02 2.8 4.70e+11 2.6 1.2e+11 8.2e+02 0.0e+00 6 40 99 95 0 9 40 99 95 0 3319606 MatSolve 2319786 1.0 1.9988e+02 3.3 3.78e+11 3.0 0.0e+00 0.0e+00 0.0e+00 3 33 0 0 0 5 33 0 0 0 5284543 MatLUFactorNum 770 1.0 2.0097e-01 2.1 3.94e+0747.2 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 173392 MatILUFactorSym 770 1.0 1.2418e-01 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 7.7e+02 0 0 0 0 0 0 0 0 0 0 0 MatScale 769 1.0 1.7675e-01 6.0 1.77e+07 2.6 4.0e+07 2.8e+02 0.0e+00 0 0 0 0 0 0 0 0 0 0 274706 MatAssemblyBegin 3826 1.0 9.2762e+01 3.7 0.00e+00 0.0 1.3e+08 3.4e+04 6.1e+03 2 0 0 4 0 3 0 0 4 0 0 MatAssemblyEnd 3826 1.0 9.7335e+0010.7 0.00e+00 0.0 2.1e+05 1.4e+02 1.6e+01 0 0 0 0 0 0 0 0 0 0 0 MatGetRowIJ 770 1.0 2.2493e-02 1.8 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetSubMatrice 770 1.0 1.1439e-01 1.4 0.00e+00 0.0 0.0e+00 0.0e+00 3.1e+03 0 0 0 0 0 0 0 0 0 0 0 MatGetOrdering 770 1.0 6.5882e-02 1.3 0.00e+00 0.0 0.0e+00 0.0e+00 1.5e+03 0 0 0 0 0 0 0 0 0 0 0 MatZeroEntries 1527 1.0 9.9822e-02 3.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 KSPSetup 1540 1.0 2.2715e-02100.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 KSPSolve 1527 1.0 1.4449e+03 1.0 9.08e+11 2.7 1.2e+11 8.2e+02 7.0e+06 36 78 99 95 96 56 78 99 95 99 1749300 PCSetUp 1540 1.0 8.9031e-01 1.3 3.94e+0747.2 0.0e+00 0.0e+00 5.4e+03 0 0 0 0 0 0 0 0 0 0 39140 PCSetUpOnBlocks 2321313 1.0 2.2694e+02 2.8 3.78e+11 3.0 0.0e+00 0.0e+00 2.3e+03 4 33 0 0 0 6 33 0 0 0 4654622 PCApply 2319786 1.0 2.2792e+02 2.8 3.78e+11 3.0 0.0e+00 0.0e+00 0.0e+00 4 33 0 0 0 6 33 0 0 0 4634466 TaoAppObjective 4059 1.0 1.3549e+02 1.1 3.13e+10 1.0 1.1e+09 1.8e+02 8.1e+03 3 4 1 0 0 5 4 1 0 0 947379 TaoAppGradient 4059 1.0 1.3549e+02 1.1 3.13e+10 1.0 1.1e+09 1.8e+02 8.1e+03 3 4 1 0 0 5 4 1 0 0 947371 TaoAppHessian 769 1.0 8.8963e+01 1.0 4.16e+09 1.0 5.9e+07 1.4e+04 1.5e+03 2 1 0 1 0 3 1 0 1 0 191592 TaoSolve 758 1.0 2.4537e+02 1.0 3.67e+10 1.0 2.5e+09 5.6e+02 1.2e+05 6 5 2 1 2 10 5 2 1 2 606359 PointwiseMinMax 8728 1.0 4.3825e-02 3.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 Identify Indices 9151 1.0 3.7516e+00 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 6.1e+03 0 0 0 0 0 0 0 0 0 0 0 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage TAO Solver 1 0 0 0 Viewer 201 199 141688 0 Distributed Mesh 2 1 4096 0 SectionReal 32 2 1232 0 SectionInt 8 0 0 0 Vector 14764 14717 39153368 0 Vector Scatter 782 769 796684 0 Index Set 8417 8407 14119624 0 Matrix 1546 1536 188752896 0 Krylov Solver 4 0 0 0 Preconditioner 4 0 0 0 TAO Application 1 0 0 0 ======================================================================================================================== Average time to get PetscTime(): 9.53674e-08 Average time for MPI_Barrier(): 2.79903e-05 Average time for zero size MPI_Send(): 3.42238e-06 #PETSc Option Table entries: -U_ksp_type cg -U_pc_type bjacobi -V_ksp_type cg -V_pc_type bjacobi -atnum 1 -bt 0 -btint 10000 -btscope 50 -bttol 0.010000 -bttype 1 -epsilon 0.500000 -irrev 2 -irrevtol 1.000000 -kepsilon 0.000000 -p 450371 -savestrain 0 -savestress 0 #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 Configure run at: Sun Jan 13 10:11:34 2013 Configure options: --COPTFLAGS= --CXXOPTFLAGS= --FOPTFLAGS= --download-boost --download-chaco --download-exodusii --download-metis --download-netcdf --download-parmetis --download-triangle --download-yaml --with-blas-lapack-dir=/opt/apps/intel/13/composer_xe_2013.1.117/mkl/lib/intel64 --with-clanguage=C++ --with-cmake=cmake --with-debugging=no --with-dynamic-loading=0 --with-fortran-datatypes --with-hdf5=/opt/apps/intel13/mvapich2_1_9/phdf5/1.8.9 --with-mpi-compilers --with-mpi-dir=/opt/apps/intel13/mvapich2/1.9 --with-mpiexec=mpirun_rsh --with-pic --with-shared-libraries=1 --with-sieve --with-x=0 PETSC_ARCH=stampede-mef90-O ----------------------------------------- Libraries compiled on Sun Jan 13 10:11:34 2013 on login3.stampede.tacc.utexas.edu Machine characteristics: Linux-2.6.32-279.14.1.el6.x86_64-x86_64-with-centos-6.3-Final Using PETSc directory: /home1/00042/tg457051/Development/petsc-3.2 Using PETSc arch: stampede-mef90-O ----------------------------------------- Using C compiler: /opt/apps/intel13/mvapich2/1.9/bin/mpicxx -wd1572 -fPIC ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: /opt/apps/intel13/mvapich2/1.9/bin/mpif90 -fPIC ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/home1/00042/tg457051/Development/petsc-3.2/stampede-mef90-O/include -I/home1/00042/tg457051/Development/petsc-3.2/include -I/home1/00042/tg457051/Development/petsc-3.2/include -I/home1/00042/tg457051/Development/petsc-3.2/stampede-mef90-O/include -I/home1/00042/tg457051/Development/petsc-3.2/include/sieve -I/home1/00042/tg457051/Development/petsc-3.2/stampede-mef90-O/cbind/include -I/home1/00042/tg457051/Development/petsc-3.2/stampede-mef90-O/forbind/include -I/opt/apps/intel13/mvapich2/1.9/include ----------------------------------------- Using C linker: /opt/apps/intel13/mvapich2/1.9/bin/mpicxx Using Fortran linker: /opt/apps/intel13/mvapich2/1.9/bin/mpif90 Using libraries: -Wl,-rpath,/home1/00042/tg457051/Development/petsc-3.2/stampede-mef90-O/lib -L/home1/00042/tg457051/Development/petsc-3.2/stampede-mef90-O/lib -lpetsc -Wl,-rpath,/home1/00042/tg457051/Development/petsc-3.2/stampede-mef90-O/lib -L/home1/00042/tg457051/Development/petsc-3.2/stampede-mef90-O/lib -ltriangle -lparmetis -lmetis -lpthread -lchaco -lyaml -Wl,-rpath,/opt/apps/intel/13/composer_xe_2013.1.117/mkl/lib/intel64 -L/opt/apps/intel/13/composer_xe_2013.1.117/mkl/lib/intel64 -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -liomp5 -lpthread -lexoIIv2for -lexodus -lnetcdf -Wl,-rpath,/opt/apps/limic2/0.5.5/lib -L/opt/apps/limic2/0.5.5/lib -Wl,-rpath,/opt/ofed/lib64 -L/opt/ofed/lib64 -Wl,-rpath,/opt/apps/intel13/mvapich2/1.9/lib -L/opt/apps/intel13/mvapich2/1.9/lib -Wl,-rpath,/opt/apps/intel/13/composer_xe_2013.1.117/compiler/lib/intel64 -L/opt/apps/intel/13/composer_xe_2013.1.117/compiler/lib/intel64 -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.6 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.6 -lmpichf90 -lifport -lifcore -lm -lm -lmpichcxx -ldl -lmpich -lopa -lmpl -libmad -lrdmacm -libumad -libverbs -lrt -llimic2 -lpthread -limf -lsvml -lirng -lipgo -ldecimal -lcilkrts -lstdc++ -lgcc_s -lirc -lirc_s -ldl -----------------------------------------