0 SNES Function norm 1.030411923746e+00 0 KSP Residual norm 2.680672878291e+02 1 KSP Residual norm 1.360552050306e-01 2 KSP Residual norm 1.396643813556e-03 3 KSP Residual norm 1.535114940668e-05 4 KSP Residual norm 2.556317697816e-07 1 SNES Function norm 3.234969897576e-05 0 KSP Residual norm 2.088836838588e+01 1 KSP Residual norm 2.281625249923e-05 2 KSP Residual norm 9.472233496752e-09 2 SNES Function norm 2.483925900537e-07 0 KSP Residual norm 1.711377134200e-01 1 KSP Residual norm 1.298587928589e-07 2 KSP Residual norm 6.987032000239e-11 3 SNES Function norm 2.558824601758e-11 SNES Object: 16 MPI processes type: newtonls maximum iterations=50, maximum function evaluations=10000 tolerances: relative=1e-08, absolute=1e-50, solution=1e-08 total number of linear solver iterations=8 total number of function evaluations=4 norm schedule ALWAYS SNESLineSearch Object: 16 MPI processes type: bt interpolation: cubic alpha=1.000000e-04 maxstep=1.000000e+08, minlambda=1.000000e-12 tolerances: relative=1.000000e-08, absolute=1.000000e-15, lambda=1.000000e-08 maximum iterations=40 KSP Object: 16 MPI processes type: gmres GMRES: restart=30, using Classical (unmodified) Gram-Schmidt Orthogonalization with no iterative refinement GMRES: happy breakdown tolerance 1e-30 maximum iterations=10000, initial guess is zero tolerances: relative=1e-09, absolute=1e-50, divergence=10000 left preconditioning using PRECONDITIONED norm type for convergence test PC Object: 16 MPI processes type: mg MG: type is MULTIPLICATIVE, levels=3 cycles=v Cycles per PCApply=1 Not using Galerkin computed coarse grid matrices Coarse grid solver -- level ------------------------------- KSP Object: (mg_coarse_) 16 MPI processes type: preonly maximum iterations=1, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using NONE norm type for convergence test PC Object: (mg_coarse_) 16 MPI processes type: redundant Redundant preconditioner: First (color=0) of 16 PCs follows KSP Object: (mg_coarse_redundant_) 1 MPI processes type: preonly maximum iterations=10000, initial guess is zero tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using NONE norm type for convergence test PC Object: (mg_coarse_redundant_) 1 MPI processes type: lu LU: out-of-place factorization tolerance for zero pivot 2.22045e-14 using diagonal shift on blocks to prevent zero pivot [INBLOCKS] matrix ordering: nd factor fill ratio given 5, needed 12.0667 Factored matrix follows: Mat Object: 1 MPI processes type: seqaij rows=410881, cols=410881 package used to perform factorization: petsc total: nonzeros=2.47589e+07, allocated nonzeros=2.47589e+07 total number of mallocs used during MatSetValues calls =0 not using I-node routines linear system matrix = precond matrix: Mat Object: 1 MPI processes type: seqaij rows=410881, cols=410881 total: nonzeros=2.05184e+06, allocated nonzeros=2.05184e+06 total number of mallocs used during MatSetValues calls =0 not using I-node routines linear system matrix = precond matrix: Mat Object: 16 MPI processes type: mpiaij rows=410881, cols=410881 total: nonzeros=2.05184e+06, allocated nonzeros=2.05184e+06 total number of mallocs used during MatSetValues calls =0 Down solver (pre-smoother) on level 1 ------------------------------- KSP Object: (mg_levels_1_) 16 MPI processes type: richardson Richardson: damping factor=1 maximum iterations=2 tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using nonzero initial guess using NONE norm type for convergence test PC Object: (mg_levels_1_) 16 MPI processes type: sor SOR: type = local_symmetric, iterations = 1, local iterations = 1, omega = 1 linear system matrix = precond matrix: Mat Object: 16 MPI processes type: mpiaij rows=1640961, cols=1640961 total: nonzeros=8.19968e+06, allocated nonzeros=8.19968e+06 total number of mallocs used during MatSetValues calls =0 Up solver (post-smoother) same as down solver (pre-smoother) Down solver (pre-smoother) on level 2 ------------------------------- KSP Object: (mg_levels_2_) 16 MPI processes type: richardson Richardson: damping factor=1 maximum iterations=2 tolerances: relative=1e-05, absolute=1e-50, divergence=10000 left preconditioning using nonzero initial guess using NONE norm type for convergence test PC Object: (mg_levels_2_) 16 MPI processes type: sor SOR: type = local_symmetric, iterations = 1, local iterations = 1, omega = 1 linear system matrix = precond matrix: Mat Object: 16 MPI processes type: mpiaij rows=6558721, cols=6558721 total: nonzeros=3.27834e+07, allocated nonzeros=3.27834e+07 total number of mallocs used during MatSetValues calls =0 Up solver (post-smoother) same as down solver (pre-smoother) linear system matrix = precond matrix: Mat Object: 16 MPI processes type: mpiaij rows=6558721, cols=6558721 total: nonzeros=3.27834e+07, allocated nonzeros=3.27834e+07 total number of mallocs used during MatSetValues calls =0 ************************************************************************************************************************ *** WIDEN YOUR WINDOW TO 120 CHARACTERS. Use 'enscript -r -fCourier9' to print this document *** ************************************************************************************************************************ ---------------------------------------------- PETSc Performance Summary: ---------------------------------------------- ./ex5 on a arch-linux2-c-opt named helios91 with 16 processors, by tnicolas Thu Oct 15 13:56:57 2015 Using Petsc Release Version 3.6.0, Jun, 09, 2015 Max Max/Min Avg Total Time (sec): 2.411e+01 1.00033 2.410e+01 Objects: 2.490e+02 1.00000 2.490e+02 Flops: 1.841e+10 1.00010 1.841e+10 2.945e+11 Flops/sec: 7.638e+08 1.00038 7.637e+08 1.222e+10 MPI Messages: 8.910e+02 1.71676 7.085e+02 1.134e+04 MPI Message Lengths: 4.294e+07 1.02414 5.999e+04 6.800e+08 MPI Reductions: 4.050e+02 1.00000 Flop counting convention: 1 flop = 1 real number operation of type (multiply/divide/add/subtract) e.g., VecAXPY() for real vectors of length N --> 2N flops and VecAXPY() for complex vectors of length N --> 8N flops Summary of Stages: ----- Time ------ ----- Flops ----- --- Messages --- -- Message Lengths -- -- Reductions -- Avg %Total Avg %Total counts %Total Avg %Total counts %Total 0: Main Stage: 2.4103e+01 100.0% 2.9450e+11 100.0% 1.134e+04 100.0% 5.999e+04 100.0% 4.040e+02 99.8% ------------------------------------------------------------------------------------------------------------------------ See the 'Profiling' chapter of the users' manual for details on interpreting output. Phase summary info: Count: number of times phase was executed Time and Flops: Max - maximum over all processors Ratio - ratio of maximum to minimum over all processors Mess: number of messages sent Avg. len: average message length (bytes) Reduct: number of global reductions Global: entire computation Stage: stages of a computation. Set stages with PetscLogStagePush() and PetscLogStagePop(). %T - percent time in this phase %F - percent flops in this phase %M - percent messages in this phase %L - percent message lengths in this phase %R - percent reductions in this phase Total Mflop/s: 10e-6 * (sum of flops over all processors)/(max time over all processors) ------------------------------------------------------------------------------------------------------------------------ Event Count Time (sec) Flops --- Global --- --- Stage --- Total Max Ratio Max Ratio Max Ratio Mess Avg len Reduct %T %F %M %L %R %T %F %M %L %R Mflop/s ------------------------------------------------------------------------------------------------------------------------ --- Event Stage 0: Main Stage SNESSolve 1 1.0 2.3738e+01 1.0 1.84e+10 1.0 1.1e+04 6.2e+04 3.6e+02 98100 97100 88 98100 97100 89 12407 SNESFunctionEval 4 1.0 4.6840e-02 1.0 1.81e+07 1.0 1.9e+02 5.1e+03 0.0e+00 0 0 2 0 0 0 0 2 0 0 6161 SNESJacobianEval 9 1.0 4.1105e-01 1.0 0.00e+00 0.0 4.3e+02 3.0e+03 1.8e+01 2 0 4 0 4 2 0 4 0 4 0 SNESLineSearch 3 1.0 8.9235e-02 1.0 3.82e+07 1.0 2.9e+02 5.1e+03 1.2e+01 0 0 3 0 3 0 0 3 0 3 6835 VecDot 3 1.0 4.6122e-03 1.2 2.47e+06 1.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 1 0 0 0 0 1 8532 VecMDot 8 1.0 3.9144e-02 4.4 1.31e+07 1.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 2 0 0 0 0 2 5362 VecNorm 18 1.0 4.2523e-02 1.1 1.48e+07 1.0 0.0e+00 0.0e+00 1.8e+01 0 0 0 0 4 0 0 0 0 4 5553 VecScale 77 1.0 1.4639e-02 2.2 4.63e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 5032 VecCopy 9 1.0 1.4708e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecSet 133 1.0 4.0860e-02 1.2 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 VecAXPY 3 1.0 5.7969e-03 1.1 2.47e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 6789 VecAYPX 22 1.0 2.2357e-02 1.5 5.65e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 4034 VecWAXPY 3 1.0 7.6721e-03 1.0 1.23e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 2565 VecMAXPY 11 1.0 3.1315e-02 1.1 1.97e+07 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 10053 VecPointwiseMult 6 1.0 1.0839e-03 1.1 3.87e+05 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 5679 VecScatterBegin 188 1.0 5.5496e-02 1.3 0.00e+00 0.0 9.7e+03 5.8e+04 0.0e+00 0 0 86 83 0 0 0 86 83 0 0 VecScatterEnd 188 1.0 1.6134e-01 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 VecReduceArith 6 1.0 4.9202e-03 1.4 4.93e+06 1.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 15996 VecReduceComm 3 1.0 6.2704e-0439.3 0.00e+00 0.0 0.0e+00 0.0e+00 3.0e+00 0 0 0 0 1 0 0 0 0 1 0 VecNormalize 11 1.0 2.2433e-02 1.5 1.36e+07 1.0 0.0e+00 0.0e+00 1.1e+01 0 0 0 0 3 0 0 0 0 3 9648 MatMult 33 1.0 2.0806e-01 1.2 9.15e+07 1.0 1.6e+03 4.3e+03 0.0e+00 1 0 14 1 0 1 0 14 1 0 7020 MatMultAdd 22 1.0 9.0072e-02 1.7 2.54e+07 1.0 7.3e+02 1.4e+03 0.0e+00 0 0 6 0 0 0 0 6 0 0 4505 MatMultTranspose 30 1.0 8.7991e-02 1.2 3.46e+07 1.0 9.9e+02 1.4e+03 0.0e+00 0 0 9 0 0 0 0 9 0 0 6288 MatSolve 11 1.0 7.6623e-01 1.0 5.40e+08 1.0 0.0e+00 0.0e+00 0.0e+00 3 3 0 0 0 3 3 0 0 0 11280 MatSOR 44 1.0 1.5401e+00 1.2 3.12e+08 1.0 3.2e+03 3.8e+03 8.8e+01 6 2 28 2 22 6 2 28 2 22 3235 MatLUFactorSym 1 1.0 6.2216e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 3 0 0 0 0 3 0 0 0 0 0 MatLUFactorNum 3 1.0 1.9370e+01 1.0 1.73e+10 1.0 0.0e+00 0.0e+00 0.0e+00 80 94 0 0 0 80 94 0 0 0 14304 MatCopy 2 1.0 1.8473e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatConvert 1 1.0 2.2478e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatResidual 22 1.0 1.4124e-01 1.3 5.65e+07 1.0 1.1e+03 3.8e+03 0.0e+00 1 0 9 1 0 1 0 9 1 0 6384 MatAssemblyBegin 15 1.0 1.9669e-02 3.7 0.00e+00 0.0 0.0e+00 0.0e+00 2.8e+01 0 0 0 0 7 0 0 0 0 7 0 MatAssemblyEnd 15 1.0 1.0671e-01 1.0 0.00e+00 0.0 4.2e+02 6.2e+02 4.0e+01 0 0 4 0 10 0 0 4 0 10 0 MatGetRowIJ 1 1.0 1.5703e-02 1.1 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 0 0 0 0 0 0 0 0 0 0 0 MatGetSubMatrice 3 1.0 8.1118e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 2 0 0 0 0 2 0 MatGetOrdering 1 1.0 2.5992e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 1 0 0 0 0 1 0 0 0 0 0 MatView 6 1.5 4.8237e-03 1.9 0.00e+00 0.0 0.0e+00 0.0e+00 4.0e+00 0 0 0 0 1 0 0 0 0 1 0 MatRedundantMat 3 1.0 1.2165e-01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 2 0 0 0 0 2 0 KSPGMRESOrthog 8 1.0 5.9303e-02 2.2 2.63e+07 1.0 0.0e+00 0.0e+00 8.0e+00 0 0 0 0 2 0 0 0 0 2 7078 KSPSetUp 15 1.0 1.6701e-02 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 1.6e+01 0 0 0 0 4 0 0 0 0 4 0 KSPSolve 3 1.0 2.3295e+01 1.0 1.84e+10 1.0 1.1e+04 6.4e+04 3.4e+02 97100 93100 83 97100 93100 83 12613 PCSetUp 3 1.0 2.0743e+01 1.0 1.73e+10 1.0 1.9e+03 6.1e+04 2.0e+02 85 94 16 17 49 85 94 16 17 49 13364 PCApply 11 1.0 2.6292e+00 1.1 9.60e+08 1.0 8.3e+03 6.7e+04 8.8e+01 11 5 73 82 22 11 5 73 82 22 5834 ------------------------------------------------------------------------------------------------------------------------ Memory usage is given in bytes: Object Type Creations Destructions Memory Descendants' Mem. Reports information only for process 0. --- Event Stage 0: Main Stage SNES 1 1 1332 0 SNESLineSearch 1 1 864 0 DMSNES 4 4 2816 0 Vector 124 124 218327280 0 Vector Scatter 15 15 11131936 0 Matrix 20 20 451837424 0 Distributed Mesh 7 7 34336 0 Star Forest Bipartite Graph 14 14 11872 0 Discrete System 7 7 5936 0 Index Set 35 35 11493640 0 IS L to G Mapping 6 6 4340480 0 Krylov Solver 5 5 23280 0 DMKSP interface 3 3 1944 0 Preconditioner 5 5 4912 0 Viewer 2 1 760 0 ======================================================================================================================== Average time to get PetscTime(): 9.53674e-08 Average time for MPI_Barrier(): 1.81198e-06 Average time for zero size MPI_Send(): 5.43892e-06 #PETSc Option Table entries: -da_grid_x 21 -da_grid_y 21 -da_refine 7 -ksp_monitor -ksp_rtol 1e-9 -log_summary -mg_levels_ksp_type richardson -pc_mg_levels 3 -pc_type mg -snes_monitor -snes_view #End of PETSc Option Table entries Compiled without FORTRAN kernels Compiled with full precision matrices (default) sizeof(short) 2 sizeof(int) 4 sizeof(long) 8 sizeof(void*) 8 sizeof(PetscScalar) 8 sizeof(PetscInt) 4 Configure options: --prefix=/csc/softs/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real --with-debugging=0 --with-x=0 --with-cc=mpicc --with-fc=mpif90 --with-cxx=mpicxx --with-fortran --known-mpi-shared-libraries=1 --with-scalar-type=real --with-precision=double --CFLAGS="-g -O3 -mavx -mkl" --CXXFLAGS="-g -O3 -mavx -mkl" --FFLAGS="-g -O3 -mavx -mkl" ----------------------------------------- Libraries compiled on Mon Sep 28 20:22:47 2015 on helios85 Machine characteristics: Linux-2.6.32-573.1.1.el6.Bull.80.x86_64-x86_64-with-redhat-6.4-Santiago Using PETSc directory: /csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0 Using PETSc arch: arch-linux2-c-opt ----------------------------------------- Using C compiler: mpicc -g -O3 -mavx -mkl -fPIC ${COPTFLAGS} ${CFLAGS} Using Fortran compiler: mpif90 -g -O3 -mavx -mkl -fPIC ${FOPTFLAGS} ${FFLAGS} ----------------------------------------- Using include paths: -I/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/arch-linux2-c-opt/include -I/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/include -I/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/include -I/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/arch-linux2-c-opt/include -I/opt/mpi/bullxmpi/1.2.8.2/include ----------------------------------------- Using C linker: mpicc Using Fortran linker: mpif90 Using libraries: -Wl,-rpath,/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/arch-linux2-c-opt/lib -L/csc/releases/buildlog/anl/petsc-3.6.0/intel-15.0.0.090/bullxmpi-1.2.8.2/real/petsc-3.6.0/arch-linux2-c-opt/lib -lpetsc -lhwloc -lxml2 -lssl -lcrypto -Wl,-rpath,/opt/mpi/bullxmpi/1.2.8.2/lib -L/opt/mpi/bullxmpi/1.2.8.2/lib -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -lmpi_f90 -lmpi_f77 -lm -lifport -lifcore -lm -lmpi_cxx -ldl -Wl,-rpath,/opt/mpi/bullxmpi/1.2.8.2/lib -L/opt/mpi/bullxmpi/1.2.8.2/lib -lmpi -lnuma -lrt -lnsl -lutil -Wl,-rpath,/opt/mpi/bullxmpi/1.2.8.2/lib -L/opt/mpi/bullxmpi/1.2.8.2/lib -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -lmkl_intel_lp64 -lmkl_intel_thread -lmkl_core -liomp5 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -limf -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -lsvml -lirng -lipgo -ldecimal -lcilkrts -lstdc++ -lgcc_s -lirc -lpthread -lirc_s -Wl,-rpath,/opt/mpi/bullxmpi/1.2.8.2/lib -L/opt/mpi/bullxmpi/1.2.8.2/lib -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -L/usr/lib/gcc/x86_64-redhat-linux/4.4.7 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/compiler/lib/intel64 -Wl,-rpath,/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -L/opt/intel/composer_xe_2015.0.090/mkl/lib/intel64 -ldl -----------------------------------------