[MOAB-dev] DMMoabLoadFromFile() - parallel performance issue
Grindeanu, Iulian R.
iulian at mcs.anl.gov
Mon Dec 14 18:29:48 CST 2015
Hi Jim,
DMMoab loading just delegates to moab's hdf5 reader
We were seeing performance degradations for larger number of tasks, on the order of 8k, 16k tasks
can you say something about your:
hdf5 version, config flags (I assume optimized, production )
machine characteristics (I/O performance)?
mpi version / compiler/ mpi IO version?
for more fine grain profiling, can you use this branch, and use CPUTIME option at reading?
https://bitbucket.org/fathomteam/moab/branch/iulian07/timing_h5mreader
hanging out for a mesh of 30 million looks like a problem/issue.
Do you have empty partitions? I assume not (you would have seen a warning at partitioning).
Iulian
________________________________
From: moab-dev-bounces at mcs.anl.gov [moab-dev-bounces at mcs.anl.gov] on behalf of WARNER, JAMES E. (LARC-D309) [james.e.warner at nasa.gov]
Sent: Monday, December 14, 2015 5:25 PM
To: moab-dev at mcs.anl.gov
Subject: [MOAB-dev] DMMoabLoadFromFile() - parallel performance issue
Hi Vijay & Iulian,
Hope you are doing well! I have a question regarding some strange behavior we’re seeing with the DMMoabLoadFromFile() function…
After doing some recent profiling of our MOAB-based finite element code, we noticed that we are spending a disproportionate amount of CPU time within the DMMoabLoadFromFile() function, which gets slower / remains constant as we increase the number of processors. We also recently attempted a scalability test with ~30M FEM nodes on 500 processors which hung in DMMoabLoadFromFile() for about 45 minutes before we killed the job. We then re-ran the test on one processor and it made it through successfully in several seconds.
To reproduce the problem we’re seeing, we wrote a test case (attached here) that simply loads a smaller mesh with approximately 16K nodes and prints the run time. When I run the code on an increasing number of processors, I get something like:
NP=1: Time to read file: 0.0416839 [sec.]
NP=2: Time to read file: 1.42497 [sec.]
NP=4: Time to read file: 1.13678 [sec.]
NP=8: Time to read file: 1.0475 [sec.]
…
If it is relevant/helpful – we are using the mbpart tool to partition the mesh. Do you have any ideas why we are not seeing scalability here? Any thoughts/tips would be appreciated! Let me know if you would like any more information.
Thanks,
Jim
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/moab-dev/attachments/20151215/44fa5070/attachment.html>
More information about the moab-dev
mailing list