[mpich-discuss] File I/O causing collective abort of all ranks

Rajeev Thakur thakur at mcs.anl.gov
Tue Sep 23 13:06:47 CDT 2008


It sure is odd. Can you run with just one process to see if it still fails.
(mpiexec -n 1 a.out) If it does, try a simple sequential program that just
does I/O and run it as a.out to see if there is a problem with doing I/O
from that machine.

Rajeev
 

> -----Original Message-----
> From: owner-mpich-discuss at mcs.anl.gov 
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Brian Harker
> Sent: Tuesday, September 23, 2008 12:14 PM
> To: mpich-discuss at mcs.anl.gov
> Subject: [mpich-discuss] File I/O causing collective abort of 
> all ranks
> 
> Hello list-
> 
> I have a problem with process 0 being able to open a file for writing
> and subsequently write to it.  The pertinent section of code looks as
> follows:
> 
> ========================================
> if ( proc_id == 0 ) then
> 
>   open( unit = 1, file = "fubar.dat", status="new" )
>   do i = 1, ny
>     write(1,*) ( array(i,j), i = 1, nx )
>   end do
>   close(1)
> 
> end if
> ========================================
> 
> When this part of the code is reached, the program seems to hang for a
> long time while trying to open the file, then spits out the following
> error message:
> 
> rank 0 in job 11  $HOSTNAME_#####  caused collective abort of 
> all ranks
>    exit status of rank 0: killed by signal 9
> 
> I am confused about this error, because it is seemingly isolated to
> this particular write-to-file by process 0.  During execution, my
> slave processes write out other files using this exact same syntax.
> Has anyone run across this?  I can't seem to find any useful
> information on the interweb.  I have run into this problem with both
> MPICH2-1.0.6p1 and MPICH2-1.0.7.  I am using the Intel fortran
> compiler, ifort 10.1.012.
> 
> Thanks in advance for any input!
> 
> 
> 
> -- 
> Cheers,
> Brian
> brian.harker at gmail.com
> 
> 
> "In science, there is only physics; all the rest is stamp-collecting."
>  -Ernest Rutherford
> 
> 




More information about the mpich-discuss mailing list