[petsc-users] Debug AOCreateBasic
Rongliang Chen
rongliang.chan at gmail.com
Fri Nov 1 00:27:17 CDT 2013
Hi there,
My code died in the AOCreateBasic and the error messages are followed.
Do you have any suggestions to debug this?
Notes:
1. In this case, it has about 30,000,000 unstructured mesh and use 96
processors (I also tried 1024 processors and it has the same problem).
2. My code works well for a smaller case (about 25,000,000 unstructured
meshes) .
3. I also check the memory usage of this case and it is very small
because the solution stage does not start yet.
Best,
Rongliang
--------------------------------------------------------------------------
MPI_ABORT was invoked on rank 1 in communicator MPI_COMM_WORLD
with errorcode 59.
NOTE: invoking MPI_ABORT causes Open MPI to kill all MPI processes.
You may or may not see output from other processes, depending on
exactly when Open MPI kills them.
--------------------------------------------------------------------------
[0]PETSC ERROR:
------------------------------------------------------------------------
[0]PETSC ERROR: Caught signal number 15 Terminate: Somet process (or the
batch system) has told this process to end
[0]PETSC ERROR: Try option -start_in_debugger or -on_error_attach_debugger
[0]PETSC ERROR: or see
http://www.mcs.anl.gov/petsc/documentation/faq.html#valgrind[0]PETSC
ERROR: or try http://valgrind.org on GNU/linux and Apple Mac OS X to
find memory corruption errors
[0]PETSC ERROR: likely location of problem given in stack below
[0]PETSC ERROR: --------------------- Stack Frames
------------------------------------
[0]PETSC ERROR: Note: The EXACT line numbers in the stack are not available,
[0]PETSC ERROR: INSTEAD the line number of the start of the function
[0]PETSC ERROR: is given.
[0]PETSC ERROR: [0] AOCreate_Basic line 203
src/vec/is/ao/impls/basic/aobasic.c
[0]PETSC ERROR: [0] AOSetType line 35 src/vec/is/ao/interface/aoreg.c
[0]PETSC ERROR: [0] AOCreateBasicIS line 380
src/vec/is/ao/impls/basic/aobasic.c
[0]PETSC ERROR: [0] AOCreateBasic line 335
src/vec/is/ao/impls/basic/aobasic.c
[0]PETSC ERROR: [0] DataPartitionVertices_Block line 1634
/projects/ronglian/soft/3Dfluid_new/3DWindturbine/WindturbineFor3.4/codes/readbinary3d.c
[0]PETSC ERROR: [0] ReadBinary line 184
/projects/ronglian/soft/3Dfluid_new/3DWindturbine/WindturbineFor3.4/codes/readbinary3d.c
[0]PETSC ERROR: [0] LoadGrid line 720
/projects/ronglian/soft/3Dfluid_new/3DWindturbine/WindturbineFor3.4/codes/loadgrid3d.c
[0]PETSC ERROR: --------------------- Error Message
------------------------------------
[0]PETSC ERROR: Signal received!
[0]PETSC ERROR:
------------------------------------------------------------------------
[0]PETSC ERROR: Petsc Development GIT revision:
ee17fca9fd6ac48e6579ef235144daafbb22b801 GIT Date: 2013-10-23 14:21:20
-0500
[0]PETSC ERROR: See docs/changes/index.html for recent updates.
[0]PETSC ERROR: See docs/faq.html for hints about trouble shooting.
[0]PETSC ERROR: See docs/index.html for manual pages.
[0]PETSC ERROR:
------------------------------------------------------------------------
[0]PETSC ERROR: ./fsi3d on a Janus-debug-64bit named node0880 by
ronglian Thu Oct 31 21:58:20 2013
[0]PETSC ERROR: Libraries linked from
/projects/ronglian/soft/petsc-dev-latest/Janus-debug-64bit/lib
[0]PETSC ERROR: Configure run at Thu Oct 24 21:24:31 2013
[0]PETSC ERROR: Configure options --known-level1-dcache-size=32768
--known-level1-dcache-linesize=64 --known-level1-dcache-assoc=8
--known-memcmp-ok=1 --known-sizeof-char=1 --known-sizeof-void-p=8
--known-sizeof-short=2 --known-sizeof-int=4 --known-sizeof-long=8
--known-sizeof-long-long=8 --known-sizeof-float=4
--known-sizeof-double=8 --known-sizeof-size_t=8 --known-bits-per-byte=8
--known-sizeof-MPI_Comm=8 --known-sizeof-MPI_Fint=4
--known-mpi-long-double=1 --known-mpi-c-double-complex=0
--download-blacs=1 --download-f-blas-lapack=1 --download-metis=1
--download-parmetis=1 --download-scalapack=1 --download-superlu_dist=1
--known-mpi-shared-libraries=0 --with-64-bit-indices --with-batch=1
--with-mpi-shared=1 --download-exodusii=1 --download-hdf5=1
--download-netcdf=1 --known-64-bit-blas-indices --with-debugging=1
COPTFLAGS="-O0 -g"
[0]PETSC ERROR:
------------------------------------------------------------------------
[0]PETSC ERROR: User provided function() line 0 in unknown file
[1]PETSC ERROR:
------------------------------------------------------------------------
[1]PETSC ERROR: Caught signal number 15 Terminate: Somet process (or the
batch system) has told this process to end
[1]PETSC ERROR: Try option -start_in_debugger or -on_error_attach_debugger
[1]PETSC ERROR: or see
http://www.mcs.anl.gov/petsc/documentation/faq.html#valgrind[1]PETSC
ERROR: or try http://valgrind.org on GNU/linux and Apple Mac OS X to
find memory corruption errors
[1]PETSC ERROR: likely location of problem given in stack below
More information about the petsc-users
mailing list