[MOAB-dev] [Fathom] Parallel read issues: mbconvert vs mbparallelcomm_test (PARALLEL_RESOLVE_SHARED_ENTS)
Tim Tautges
tautges at mcs.anl.gov
Tue Apr 6 12:36:53 CDT 2010
Resolving shared entities for a broadcast doesn't make any sense. In this read method, all processors have the same
exact mesh, so all entities will be shared. Maybe in principal this should work anyway, but it's pretty far outside the
mainstream.
- tim
Dmitry Karpeev wrote:
> Hi,
> It appears that some of the problems I had earlier with reading meshes
> in parallel have to do with
> the resolution of shared entities. mbconvert typically read the files
> in without a problem, while mbparallelcomm_test
> refused to read some files and not others, etc (it was confusing).
>
> I stripped off most of the mbconvert code to make a simple
> benchmarking tool (tools/mbparread).
> Unlike mbparallelcomm_test, mbparread (and mbconvert, from which it
> derives) does not set any parallel reader
> options by default. When reintroducing them one by one, I stumbled
> upon the following issue:
> feeding PARALLEL_RESOLVE_SHARED_ENTS can cause mbparread to fail.
> It appears that problems are more complicated when reading
> tjunc6RIB_16.h5m, but with blocks-64.h5m
> it is more straightforward (this is a 2 proc version on my laptop):
>
> With PARALLEL=BCAST:
> ------------------------------------------------------------------------------------------------------------------------------------------------------
> [hal Mon Apr 05 05:37 PM] ~/fathom/moab/trunk
>> $MPI2_PREFIX/bin/mpiexec -n 2
> /home/karpeev/fathom/moab/unstable/build/tools/mbparread -O
> PARTITION_DISTRIBUTE -O CPUTIME -O PARTITION=MATERIAL_SET -O
> PARALLEL=BCAST /home/karpeev/fathom/moab/data/blocks-64.h5m
> Read times: 0.448433 0.00270009 5.96046e-06 0.892087 (PARALLEL
> READ/PARALLEL CHECK_GIDS_SERIAL/PARALLEL GET_FILESET_ENTS/PARALLEL
> BROADCAST/)
> [proc 0 of 2]: Read "/home/karpeev/fathom/moab/data/blocks-64.h5m"
> [proc 1 of 2]: Read "/home/karpeev/fathom/moab/data/blocks-64.h5m"
>
> With PARALLEL=BCAST and PARALLEL_RESOLVE_SHARED_ENTS
> ------------------------------------------------------------------------------------------------------------------------------------------------------
> [hal Mon Apr 05 05:38 PM] ~/fathom/moab/trunk
>> $MPI2_PREFIX/bin/mpiexec -n 2
> /home/karpeev/fathom/moab/unstable/build/tools/mbparread -O
> PARTITION_DISTRIBUTE -O CPUTIME -O PARTITION=MATERIAL_SET -O
> PARALLEL=BCAST -O PARALLEL_RESOLVE_SHARED_ENTS
> /home/karpeev/fathom/moab/data/blocks-64.h5m
> Failed to load "/home/karpeev/fathom/moab/data/blocks-64.h5m".
> Error code: MB_ENTITY_NOT_FOUND (4)
> Error message: Failed in step PARALLEL RESOLVE_SHARED_ENTS
>
>
> Failed to load "/home/karpeev/fathom/moab/data/blocks-64.h5m".
> Error code: MB_ENTITY_NOT_FOUND (4)
> Error message: Failed in step PARALLEL RESOLVE_SHARED_ENTS
> ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------
>
> The other read modes (BCAST_DELETE, READ_DELETE, READ_PART) appear to
> work fine with and without PARALLEL_RESOLVE_SHARED_ENTS.
> Any ideas what may be going on?
> The mesh files I used can be found in ~karpeev/fathom/moab/data on MCS
> machines and on cosmea.
>
> Thanks.
> Dmitry.
> _______________________________________________
> Fathom mailing list
> Fathom at lists.mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/fathom
>
--
================================================================
"You will keep in perfect peace him whose mind is
steadfast, because he trusts in you." Isaiah 26:3
Tim Tautges Argonne National Laboratory
(tautges at mcs.anl.gov) (telecommuting from UW-Madison)
phone: (608) 263-8485 1500 Engineering Dr.
fax: (608) 263-4499 Madison, WI 53706
More information about the moab-dev
mailing list