mpi error from ncmpi_end_indep_data
Wei-keng Liao
wkliao at eecs.northwestern.edu
Tue Dec 30 14:14:46 CST 2014
Hi, Jim
Just a reminder, all PnetCDF nonblocking APIs can be called in both
collective and independent data modes. Only the wait APIs must be called
in its data mode, i.e. ncmpi_wait() in independent mode and ncmpi_wait_all()
in collective mode. So, you can safely remove the begin and end indep_data()
in PIO. In particular, ncmpi_end_indep_data() can be expensive and should
be avoided if possible.
Wei-keng
On Dec 30, 2014, at 1:12 PM, Jim Edwards wrote:
> Okay great - thank you!
>
> On Tue, Dec 30, 2014 at 12:11 PM, Wei-keng Liao <wkliao at eecs.northwestern.edu> wrote:
> Hi, Jim
>
> After I move the call to ncmpi_wait() to after ncmpi_end_indep_data()
> I can reproduce the error.
>
> The problem has been fixed in the latest revision of PnetCDF.
>
>
> Wei-keng
>
> On Dec 30, 2014, at 12:13 PM, Wei-keng Liao wrote:
>
> > Hi, Jim
> >
> > I was not able to reproduce the error. Could you try the following program?
> > Also, could you add an error checking for the ncmpi APIs before ncmpi_end_indep_data.
> > I wonder if there is an error returned from one of those APIs.
> >
> > #include <stdio.h>
> > #include <stdlib.h>
> > #include <mpi.h>
> > #include <pnetcdf.h>
> >
> > #define NY 4
> > #define NX 10
> > #define NDIMS 2
> >
> > #define ERR \
> > if (err != NC_NOERR) { \
> > printf("Error at line=%d: %s\n", __LINE__, ncmpi_strerror(err)); \
> > }
> >
> > int main(int argc, char** argv)
> > {
> > int rank, nprocs, err;
> > int ncid, varid, dimid[2], req, st;
> > MPI_Offset start[2], count[2], stride[2];
> > unsigned char buffer[NY][NX];
> >
> > MPI_Init(&argc, &argv);
> > MPI_Comm_rank(MPI_COMM_WORLD, &rank);
> > MPI_Comm_size(MPI_COMM_WORLD, &nprocs);
> >
> > err = ncmpi_create(MPI_COMM_WORLD, "testfile.nc", NC_CLOBBER|NC_64BIT_DATA,
> > MPI_INFO_NULL, &ncid);
> > ERR
> >
> > err = ncmpi_def_dim(ncid, "Y", NY, &dimid[0]); ERR
> > err = ncmpi_def_dim(ncid, "X", NX*nprocs, &dimid[1]); ERR
> > err = ncmpi_def_var(ncid, "var", NC_UBYTE, NDIMS, dimid, &varid); ERR
> > err = ncmpi_enddef(ncid); ERR
> >
> > start[0] = 0; start[1] = NX*rank;
> > count[0] = NY/2; count[1] = NX/2;
> > stride[0] = 2; stride[1] = 2;
> > err = ncmpi_buffer_attach(ncid, NY*NX); ERR
> >
> > err = ncmpi_begin_indep_data(ncid); ERR
> > err = ncmpi_bput_vars_uchar(ncid, varid, start, count, stride,
> > &buffer[0][0], &req);
> > ERR
> > err = ncmpi_wait(ncid, 1, &req, &st); ERR
> > err = ncmpi_end_indep_data(ncid); ERR
> >
> > err = ncmpi_buffer_detach(ncid); ERR
> > err = ncmpi_close(ncid); ERR
> >
> > MPI_Finalize();
> > return 0;
> > }
> >
> >
> > Wei-keng
> >
> > On Dec 30, 2014, at 10:31 AM, Jim Edwards wrote:
> >
> >> Hi Wei-keng,
> >>
> >> I have a code block that looks like:
> >>
> >> ncmpi_begin_indep_data(file->fh);
> >>
> >> usage = 0;
> >>
> >> if(ios->io_rank==file->indep_rank){
> >>
> >> ierr = ncmpi_bput_vars_uchar(file->fh, varid, start, count, stride, op, &request);;
> >>
> >> pio_push_request(file, request);
> >>
> >> ierr = ncmpi_inq_buffer_usage(ncid, &usage);
> >>
> >> // printf("%s %d %d\n",__FILE__,__LINE__,usage);
> >>
> >> }
> >>
> >>
> >> ncmpi_end_indep_data(file->fh);
> >>
> >>
> >> It's generating an error message on the ncmpi_end_indep_data call:
> >> MPI_FILE_WRITE_AT(83): File does not exist
> >>
> >> Regardless of the error message the write seems to work fine, any idea what's causing this?
> >>
> >> I'm using pnetcdf svn revision 1920 with intel 15 compiler and mpich.
> >>
> >> - Jim
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >>
> >> --
> >> Jim Edwards
> >>
> >> CESM Software Engineer
> >> National Center for Atmospheric Research
> >> Boulder, CO
> >
>
>
>
>
> --
> Jim Edwards
>
> CESM Software Engineer
> National Center for Atmospheric Research
> Boulder, CO
More information about the parallel-netcdf
mailing list