performance issue

Wei-Keng Liao wkliao at northwestern.edu
Thu Aug 10 12:09:28 CDT 2023


File striping size of 128 MB seems too big to me.
I have never tried anything larger than 16 MB and
learned that large sizes often performed worse.
striping_unit=134217728

Have you tried just 1 MB?

Wei-keng

On Aug 10, 2023, at 12:01 PM, Jim Edwards <jedwards at ucar.edu> wrote:



On Thu, Aug 10, 2023 at 10:53 AM Wei-Keng Liao <wkliao at northwestern.edu<mailto:wkliao at northwestern.edu>> wrote:
From the file header dump you sent earlier, no PnetCDF hint is necessary.

What MPI-IO hints are you using?
I am now using:
MPICH_MPIIO_HINTS=*:romio_cb_read=enable:romio_cb_write=enable:striping_factor=48:striping_unit=134217728

we explored the parameter space and this seems to be about the best for this particular file.   But I think that there is still something
wrong at a low level and am working on trying to figure that out.


One question. Did you delete the output file before each run?

Yes



Wei-keng

On Aug 10, 2023, at 11:35 AM, Jim Edwards <jedwards at ucar.edu<mailto:jedwards at ucar.edu>> wrote:

We are having a lot of problems with darshan on our system - I'm working with the darshan developers to resolve.

I am setting MPIIO hints - but I see that there are some hints specific to pnetcdf - do you have any recommendations?

On Thu, Aug 10, 2023 at 10:15 AM Wei-Keng Liao <wkliao at northwestern.edu<mailto:wkliao at northwestern.edu>> wrote:
Hi, Jim

FYI. Darshan now can capture the I/O activities of PnetCDF,
in addition to the already MPI-IO and POSIX-IO.

Wei-keng

On Aug 9, 2023, at 6:22 PM, Wei-Keng Liao <wkliao at northwestern.edu<mailto:wkliao at northwestern.edu>> wrote:

In that case, I have the E3SM-IO benchmark that has a fairly complicate I/O
partitioning pattern. It used the decomposition maps generated from PIO.
https://github.com/Parallel-NetCDF/E3SM-IO<https://urldefense.com/v3/__https://github.com/Parallel-NetCDF/E3SM-IO__;!!Dq0X2DkFhyF93HkjWTBQKhk!UJAnknjDPQWY43szbAutlokjDrSHyBOcTlhEOChqYtThxnr812hxwvb8aSmHcEPiNOxNdOUHZi9Z2fyXbg2j6JH1HTo$>

Wei-keng

On Aug 9, 2023, at 6:17 PM, Jim Edwards <jedwards at ucar.edu<mailto:jedwards at ucar.edu>> wrote:

I think that your example case is too simple - it's doing a simple block decomposition.
In order to get the performance difference I am observing I need to do a more complicated
mapping.   I will work on a program that reproduces the problem without pio but it may take a
while.





--
Jim Edwards

CESM Software Engineer
National Center for Atmospheric Research
Boulder, CO



--
Jim Edwards

CESM Software Engineer
National Center for Atmospheric Research
Boulder, CO

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/parallel-netcdf/attachments/20230810/da4240ab/attachment-0001.html>


More information about the parallel-netcdf mailing list