KSP Object: 1 MPI processes type: cg maximum iterations=10000, initial guess is zero tolerances: relative=1e-08, absolute=1e-50, divergence=10000 left preconditioning using PRECONDITIONED norm type for convergence test PC Object: 1 MPI processes type: jacobi linear system matrix = precond matrix: Matrix Object: 1 MPI processes type: seqaijcusp rows=10240000, cols=10240000 total: nonzeros=51187200, allocated nonzeros=51200000 total number of mallocs used during MatSetValues calls =0 not using I-node routines Total flops over all processors 1.19579e+12 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ex2f on a LINUX_GNU named lamyridge.trialphaenergy.com with 1 processor, by wdn Sun Oct 2 13:45:17 2011 Using Petsc Development HG revision: HG Date: Max Max/Min Avg Total Time (sec): 9.093e+02 1.00000 9.093e+02 Objects: 1.100e+01 1.00000 1.100e+01 Flops: 1.196e+12 1.00000 1.196e+12 1.196e+12 Flops/sec: 1.315e+09 1.00000 1.315e+09 1.315e+09 MPI Messages: 0.000e+00 0.00000 0.000e+00 0.000e+00 MPI Message Lengths: 0.000e+00 0.00000 0.000e+00 0.000e+00 MPI Reductions: 0.000e+00 0.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 9.0929e+02 100.0% 1.1958e+12 100.0% 0.000e+00 0.0% 0.000e+00 0.0% 0.000e+00 0.0% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %f - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %f %M %L %R %T %f %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage MatMult 5309 1.0 7.7951e+02 1.0 4.89e+11 1.0 0.0e+00 0.0e+00 0.0e+00 86 41 0 0 0 86 41 0 0 0 627 MatAssemblyBegin 1 1.0 9.5367e-07 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatAssemblyEnd 1 1.0 5.5532e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatView 1 1.0 3.7909e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecTDot 10616 1.0 8.6652e+01 1.0 2.17e+11 1.0 0.0e+00 0.0e+00 0.0e+00 10 18 0 0 0 10 18 0 0 0 2509 VecNorm 5310 1.0 3.1223e+00 1.0 1.09e+11 1.0 0.0e+00 0.0e+00 0.0e+00 0 9 0 0 0 0 9 0 0 0 34829 VecCopy 2 1.0 3.4727e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 3 1.0 1.4504e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 10617 1.0 1.7932e+01 1.0 2.17e+11 1.0 0.0e+00 0.0e+00 0.0e+00 2 18 0 0 0 2 18 0 0 0 12126 VecAYPX 5307 1.0 8.9635e+00 1.0 1.09e+11 1.0 0.0e+00 0.0e+00 0.0e+00 1 9 0 0 0 1 9 0 0 0 12126 VecPointwiseMult 5309 1.0 9.0499e+00 1.0 5.44e+10 1.0 0.0e+00 0.0e+00 0.0e+00 1 5 0 0 0 1 5 0 0 0 6007 VecCUSPCopyTo 5310 1.0 7.4815e+01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 8 0 0 0 0 8 0 0 0 0 0 VecCUSPCopyFrom 10618 1.0 1.4308e+02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 16 0 0 0 0 16 0 0 0 0 0 KSPSetup 1 1.0 1.6928e-05 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 KSPSolve 1 1.0 9.0533e+02 1.0 1.20e+12 1.0 0.0e+00 0.0e+00 0.0e+00100100 0 0 0 100100 0 0 0 1321 PCSetUp 1 1.0 9.5367e-07 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 PCApply 5309 1.0 9.2192e+00 1.0 5.44e+10 1.0 0.0e+00 0.0e+00 0.0e+00 1 5 0 0 0 1 5 0 0 0 5897 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage Matrix 1 1 778242500 0 Vector 7 7 573450112 0 Krylov Solver 1 1 1128 0 Preconditioner 1 1 784 0 Viewer 1 0 0 0 ======================================================================================================================== Average time to get PetscTime(): 0 #PETSc Option Table entries: -get_total_flops -ksp_rtol 1.0e-8 -ksp_type cg -ksp_view -log_summary -m 3200 -mat_type aijcusp -n 3200 -options_left -pc_type jacobi -vec_type cusp #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 Configure run at: Sat Oct 1 08:10:39 2011 Configure options: --PETSC_DIR=/home/wdn/Projects/Petsc/src/branches/master/petsc-dev --PETSC_ARCH=LINUX_GNU_OPTIMIZE_SERIAL_CUDA_40_LITE --with-mpi=0 --download-f-blas-lapack=yes --download-txpetscgpu=yes --with-cuda=1 --with-thrust=1 --with-cusp=1 --with-shared-libraries --with-debugging=no --with-64-bit-pointers=1 --with-large-file-io=1 ----------------------------------------- Libraries compiled on Sat Oct 1 08:10:39 2011 on lamyridge.trialphaenergy.com Machine characteristics: Linux-2.6.18-238.12.1.el5-x86_64-with-redhat-5.6-Final Using PETSc directory: /home/wdn/Projects/Petsc/src/branches/master/petsc-dev Using PETSc arch: LINUX_GNU_OPTIMIZE_SERIAL_CUDA_40_LITE ----------------------------------------- Using C compiler: gcc -fPIC -D_LARGEFILE64_SOURCE -D_FILE_OFFSET_BITS=64 -Wall -Wwrite-strings -Wno-strict-aliasing -Wno-unknown-pragmas -O ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: gfortran -fPIC -Wall -Wno-unused-variable -O ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/LINUX_GNU_OPTIMIZE_SERIAL_CUDA_40_LITE/include -I/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/include -I/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/include -I/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/LINUX_GNU_OPTIMIZE_SERIAL_CUDA_40_LITE/include -I/usr/local/cuda/include -I/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/LINUX_GNU_OPTIMIZE_SERIAL_CUDA_40_LITE/include/txpetscgpu/include -I/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/include/mpiuni ----------------------------------------- Using C linker: gcc Using Fortran linker: gfortran Using libraries: -Wl,-rpath,/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/LINUX_GNU_OPTIMIZE_SERIAL_CUDA_40_LITE/lib -L/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/LINUX_GNU_OPTIMIZE_SERIAL_CUDA_40_LITE/lib -lpetsc -lX11 -Wl,-rpath,/usr/local/cuda/lib64 -L/usr/local/cuda/lib64 -lcufft -lcublas -lcudart -lpthread -Wl,-rpath,/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/LINUX_GNU_OPTIMIZE_SERIAL_CUDA_40_LITE/lib -L/home/wdn/Projects/Petsc/src/branches/master/petsc-dev/LINUX_GNU_OPTIMIZE_SERIAL_CUDA_40_LITE/lib -lflapack -lfblas -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.1.2 -L/usr/lib/gcc/x86_64-redhat-linux/4.1.2 -ldl -lgcc_s -lgfortran -lm -lm -lstdc++ -lstdc++ -ldl -lgcc_s -ldl ----------------------------------------- #PETSc Option Table entries: -get_total_flops -ksp_rtol 1.0e-8 -ksp_type cg -ksp_view -log_summary -m 3200 -mat_type aijcusp -n 3200 -options_left -pc_type jacobi -vec_type cusp #End of PETSc Option Table entries There are no unused options. Norm of error 0.5317E-03 iterations 5308 real 909.33 user 908.64 sys 0.54