MPI Failure at line 839 of nonblocking.c (MPI_File_write_all : MPI_ERR_IO: input/output error)
Rob Latham
robl at mcs.anl.gov
Mon Sep 24 14:53:59 CDT 2012
On Wed, Aug 08, 2012 at 02:17:19PM -0600, Jim Edwards wrote:
> I am getting this error from parallel-netcdf using openmpi 1.4.5 and intel
> 12.1.4 and a lustre filesystem. Because this is
> non-blocking I am having a lot of difficulty pinpointing the issue, do you
> have any suggestions? I buffer multiple variables before
> calling the nfmpi_wait_all and if I turn off this buffering functionality
> it appears to work fine. All of this functionality works on several
> other systems so I
> think that it must be an issue lower in the software stack.
Hi Jim. Sorry to resurrect this old thread, especially when there's
not a lot of new information for you.
Openmpi-1.5.2 (i think) contains a big ROMIO re-sync, including some
Lustre collective I/O improvements: your hunch that the problem lies
with a lower level in the software stack (the MPI-IO library) is
entirely consistent with that observation.
==rob
--
Rob Latham
Mathematics and Computer Science Division
Argonne National Lab, IL USA
More information about the parallel-netcdf
mailing list