[mpich-discuss] Internal memory allocation error?
Brian Harker
brian.harker at gmail.com
Fri Oct 17 16:19:01 CDT 2008
Hello list-
I have a fortran 90 program that loops over pixels in an image in
parallel. There are 211K total pixels in the field-of-view, and the
code always crashes around the 160K^th pixel, give or take a hundred
or so, with the following message:
Fatal error in MPI_Recv: Other MPI error, error stack:
MPI_Recv(186).............................: MPI_Recv(buf=0x82210d0,
count=2, MPI_INTEGER, src=0, tag=MPI_ANY_TAG, MPI_COMM_WORLD,
status=0x82210e0) failed
MPIDI_CH3i_Progress_wait(214).............: an error occurred while
handling an event returned by MPIDU_Sock_Wait()
MPIDI_CH3I_Progress_handle_sock_event(436):
MPIDI_EagerContigIsend(567)...............: failure occurred while
allocating memory for a request object[cli_2]: aborting job:
Now, I have no dynamically allocatable variables in the code, so the
error means there is not enough memory in the buffer for all the
communication at this step? I have increased MP_BUFFER_MEM from the
default 64M to 128M with no change in the error. Is it possible that
I'm just trying to do too much at once with my dual-core processor? I
wouldn't think so, I'm only running the code with 6 processes...and I
don't believe this is a data problem.
Any ideas would be appreciated, and I can post any other information
anyone wants. Thanks!
--
Cheers,
Brian
brian.harker at gmail.com
"In science, there is only physics; all the rest is stamp-collecting."
-Ernest Rutherford
More information about the mpich-discuss
mailing list