[mpich-discuss] Error

Jayesh Krishna jayesh at mcs.anl.gov
Wed Jul 1 12:45:40 CDT 2009


Hi,
 "signal 11 (Segmentation fault) " . Check the memory accesses in your
program. I would suggest reading through the code carefully and using
compiler options (like bound checking) to find out the bug. This does not
look like an MPICH2 bug (After debugging your code thoroughly if you find
that it is indeed an MPICH2 bug please provide us more details).
 
Regards,
Jayesh

  _____  

From: xiao zhang [mailto:littledddna at gmail.com] 
Sent: Wednesday, July 01, 2009 12:11 PM
To: Jayesh Krishna
Subject: Re: [mpich-discuss] Error


Thank you very much for your reply, Jayesh.
Actually, when I run it with very small array, it also gave me error like
the following:


[cli_36]: aborting job:
Fatal error in MPI_Wait: Internal MPI error!, error stack:
MPI_Wait(140): MPI_Wait(request=0xe4e7460, status0x872cc6c) failed
(unknown)(): Internal MPI error!
mpiexec: Warning: tasks 0-1,3-6,8-9,12,14,16-20,22,24,26-28,32-35,37-39
died with signal 15 (Terminated).
mpiexec: Warning: tasks 2,7,10-11,13,15,21,23,25,29-31 died with signal 11
(Segmentation fault).
mpiexec: Warning: task 36 exited with status 1.


And I am totally lost......

                 Xiao











On Wed, Jul 1, 2009 at 12:00 PM, Jayesh Krishna <jayesh at mcs.anl.gov>
wrote:


Hi,
  "Subscript out of range for array itab_w
(/rstor/xzhang/olam/3.0/modules/mem_sflux.f90: 270) " Are you accessing an
array element which is out of range (Most of the fortran compilers allow
you to check "array and string bounds")?
 
Regards,
Jayesh


  _____  

From: mpich-discuss-bounces at mcs.anl.gov
[mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of
littledddna at gmail.com
Sent: Wednesday, July 01, 2009 9:58 AM
To: mpich-discuss at mcs.anl.gov
Subject: [mpich-discuss] Error


HI~ everyone ~ I met this error when I use MPICH version2 

The error is like:

Fatal error in MPI_Send: Other MPI error, error stack:
MPI_Send(173)...............................: MPI_Send(buf=0x8a550d0,
count=2, MPI_INTEGER, dest=10, tag=10, MPI_COMM_WORLD) failed
MPIDI_CH3_Progress_wait(212)................: an error occurred while
handling an event returned by MPIDU_Sock_Wait()
MPIDI_CH3I_Progress_handle_sock_event(772)..:
MPIDI_CH3_Sockconn_handle_connect_event(589): [ch3:sock] failed to
connnect to remote process
MPIDU_Socki_handle_connect(791).............: connection failure
(set=0,sock=1,errno=111:Connection refused)
0: Subscript out of range for array itab_w
(/rstor/xzhang/olam/3.0/modules/mem_sflux.f90: 270)
subscript=-1, lower bound=1, upper bound=7887, dimension=1
[cli_23]: aborting job:
Fatal error in MPI_Send: Other MPI error, error stack:
MPI_Send(173)...............................: MPI_Send(buf=0x8a550d0,
count=2, MPI_INTEGER, dest=26, tag=10, MPI_COMM_WORLD) failed
MPIDI_CH3_Progress_wait(212)................: an error occurred while
handling an event returned by MPIDU_Sock_Wait()
MPIDI_CH3I_Progress_handle_sock_event(772)..:
MPIDI_CH3_Sockconn_handle_connect_event(589): [ch3:sock] failed to
connnect to remote process
MPIDU_Socki_handle_connect(791).............: connection failure
(set=0,sock=4,errno=111:Connection refused)
0: Subscript out of range for array itab_w
(/rstor/xzhang/olam/3.0/modules/mem_sflux.f90: 270)
subscript=-1, lower bound=1, upper bound=7689, dimension=1
[cli_27]: aborting job:


What does this error mean? I totally got no clues....
Thank you very much


-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20090701/2c38ead3/attachment.htm>


More information about the mpich-discuss mailing list