[petsc-users] DMPlex with dof in cells AND at nodes
Thibault Bridel-Bertomeu
thibault.bridelbertomeu at gmail.com
Sun Jan 24 10:02:44 CST 2021
Hi everyone,
I have continued working on my problem, and I have made some progress ...
although it is not working in parallel as well as I would like yet.
I have troubles with the normals at the faces : it appears that for a given
mesh, the normals at the faces depend on the partitioning, especially for
those faces that belong to the boundary between two MPI-proc domains. If I
read correctly everything I printed out to try and understand, it seems
that those faces have normals pointing outside of the MPI-proc domain, no
matter what their normal was in the sequential version.
My problem is that my algorithms heavily rely on those normals, and they
have to have consistent orientations ... how can I recover for a given face
belonging to a given MPI-process the normal it would have had in sequential
?
Thank you very much in advance !!
Thibault
Le mer. 13 janv. 2021 à 18:36, Thibault Bridel-Bertomeu <
thibault.bridelbertomeu at gmail.com> a écrit :
> Hello Matt,
>
> Thank you for the follow up.
> Here are the print out for the two DMs as you suggested (the smallest
> tetrahedral mesh possible in a cube).
>
> :: [DEBUG] Visualizing DM in console ::
>
> DM Object: 2 MPI processes
>
> type: plex
>
> DM_0x7faa62d416f0_1 in 3 dimensions:
>
> 0-cells: 14 14
>
> 1-cells: 49 49
>
> 2-cells: 60 60
>
> 3-cells: 36 (12) [12] 36 (12) [12]
>
> Labels:
>
> depth: 4 strata with value/size (0 (14), 1 (49), 2 (60), 3 (36))
>
> celltype: 5 strata with value/size (0 (14), 1 (49), 3 (60), 6 (24), 12
> (12))
>
> Cell Sets: 1 strata with value/size (7 (24))
>
> Face Sets: 6 strata with value/size (1 (4), 2 (4), 3 (4), 4 (4), 5 (4),
> 6 (4))
>
> vtk: 1 strata with value/size (1 (12))
>
> ghost: 2 strata with value/size (2 (12), 1 (27))
>
>
> :: [DEBUG] Visualizing DM_nodal in console ::
>
> DM Object: 2 MPI processes
>
> type: plex
>
> DM_0x7faa62d416f0_3 in 3 dimensions:
>
> 0-cells: 14 14
>
> 1-cells: 49 49
>
> 2-cells: 60 60
>
> 3-cells: 36 (12) [12] 36 (12) [12]
>
> Labels:
>
> depth: 4 strata with value/size (0 (14), 1 (49), 2 (60), 3 (36))
>
> celltype: 5 strata with value/size (0 (14), 1 (49), 3 (60), 6 (24), 12
> (12))
>
> Cell Sets: 1 strata with value/size (7 (24))
>
> Face Sets: 6 strata with value/size (1 (4), 2 (4), 3 (4), 4 (4), 5 (4),
> 6 (4))
>
> vtk: 1 strata with value/size (1 (12))
>
> ghost: 2 strata with value/size (2 (12), 1 (27))
>
> As you predicted, the sum of the numbers of local cells is greater than
> the total number of cells.
>
> I was doing some more digging, and it appears that I have issues because
> the normals of some faces attached to the vertices are not populated.
> In other words, "call DMPlexGetGeometryFVM(dm, facegeom, cellgeom,
> minradius, ierr); CHKERRA(ierr)" yields a facegeom Vec that is a local Vec.
> For the mesh that are described above, that Vec has a global size of 720
> (12 * 60 faces), and also a local size of 720 (12 * 60) because (I think ..
> ?) the mesh is so small, with the distribution chosen, both processes know
> the existence of all the entities (vertices, edges, faces & cells).
> However, for, say MPI proc no.1, a lot of places in this Vec are filled
> with zeros. As I use the normal normalized by the face area, I get NaNs
> because I am trying to divide by zero.
> Now the question is, a vertex on a given MPI proc knows that it has, say,
> 6 faces attached, and I can get that number along with the indices of those
> faces with "call DMPlexGetTransitiveClosure(dmNodal, inode, PETSC_FALSE,
> vertexclosure, ierr); CHKERRA(ierr)", but then when I fetch the face
> characteristics in the facegeom Vec, some are not populated (full of
> zeros). Is this the consequence of what you were explaining, "only the
> point closure is guaranteed when distributing" ?
> Or is there a way to do some equivalent to GlobalToLocal for that facegeom
> Vec ?
>
> Thank you very much for your support,
>
> Thibault
>
> Le mer. 13 janv. 2021 à 15:00, Matthew Knepley <knepley at gmail.com> a
> écrit :
>
>> On Tue, Jan 12, 2021 at 4:12 PM Thibault Bridel-Bertomeu <
>> thibault.bridelbertomeu at gmail.com> wrote:
>>
>>> Good evening Matthew,
>>>
>>> Thank you for your answer, and sorry for the delay in mine ! I am trying
>>> to figure things out but when one gets in the distribution of a DMPlex, it
>>> becomes quite complex!
>>>
>>> Le lun. 11 janv. 2021 à 16:24, Matthew Knepley <knepley at gmail.com> a
>>> écrit :
>>>
>>>> On Mon, Jan 11, 2021 at 6:41 AM Thibault Bridel-Bertomeu <
>>>> thibault.bridelbertomeu at gmail.com> wrote:
>>>>
>>>>> Dear all,
>>>>>
>>>>> I haven't been on this mailing list since december, so first of all,
>>>>> happy new year to everyone !
>>>>>
>>>>> I am facing another challenge with the DMPlex structure : although I
>>>>> am doing cell-centered finite volume (using PetscFV), I also need to do
>>>>> some manipulations and store information that are vertex-based.
>>>>> The problem I have is when I run the code in parallel : when I
>>>>> visualize the nodal data using a vtk viewer, it shows NaN at all the
>>>>> vertices in close proximity to the MPI partition line.
>>>>> Here is how I proceed to work on the vertex-based data :
>>>>>
>>>>> First comes the creation of the DM that I'll be using for the
>>>>> cell-centered FVM
>>>>>
>>>>> call DMPlexCreateFromFile(comm, name, PETSC_TRUE, dm, ierr)
>>>>> call DMSetBasicAdjacency(dm, PETSC_TRUE, PETSC_TRUE, ierr)
>>>>> call DMPlexDistribute(dm, 1, PETSC_NULL_SF, dmDist, ierr)
>>>>> ...
>>>>> dm = dmDist
>>>>> ...
>>>>> call DMConstructGhostCells(dm, ..., ..., dmGhost, ierr)
>>>>> ...
>>>>> dm = dmGhost
>>>>> ...
>>>>>
>>>>> I set both boolean to PETSC_TRUE in the SetBasicAdjacency so even in
>>>>> MPI the neighbouring relationships are conserved for all strata. Is that
>>>>> true ? Like, does doing this actually enable to get all the transitive
>>>>> closure of a given vertex, even if part of it is in another MPI block ?
>>>>>
>>>>
>>>> What you say should be true, but it is not. When I designed everything,
>>>> I was very much stuck in the traditional mesh way of looking at things, and
>>>> made some limiting decisions. You can still get what you want (almost), but
>>>> you need to distribute the mesh with an overlap of 1. If you do, then you
>>>> can get the transitive support of vertices.
>>>>
>>>
>>> I actually already distribute the mesh with an overlap of 1, but the
>>> result is the same. Do I need to re-distribute the clone "dmNodal" somehow
>>> ? Is the cloning process maybe not preserving the MPI support of the
>>> different strata ?
>>>
>>
>> No. However, it is easy to check what you have using -dm_view. I would
>> encourage you to put
>>
>> DMViewFromOptions(dm, NULL, "-dm_view");
>> DMViewFromOptions(dmNodal, NULL, "-nodal_dm_view");
>>
>> so that you can give those options and send the output (for a very simple
>> mesh) with the emails. What you should
>> see is that for both meshes, the number of local cells adds up to more
>> than the number of global cells.
>>
>>
>>> The problem is basically that many places in the code assume that the
>>>> overlap always contains the closure of any point in it. It is difficult to
>>>> be able to switch this assumption to the star (transitive support)so I have
>>>> not done that yet.
>>>>
>>>
>>>>
>>>>> Then I need a vertex-based DM, so I figure it's pretty much what's
>>>>> done in CreateMassMatrix in TS ex11.c.
>>>>>
>>>>> call DMClone(dm, dmNodal, ierr)
>>>>> call DMGetCoordinateSection(dm, coordSection, ierr); CHKERRA(ierr)
>>>>> call DMGetCoordinatesLocal(dm, coordinates, ierr); CHKERRA(ierr)
>>>>> call DMSetCoordinateSection(dmNodal, PETSC_DETERMINE, coordSection,
>>>>> ierr);
>>>>> call PetscSectionCreate(PETSC_COMM_WORLD, sectionNodal, ierr);
>>>>> CHKERRA(ierr)
>>>>> call DMPlexGetDepthStratum(dm, 0, vstart, vend, ierr); CHKERRA(ierr)
>>>>> call PetscSectionSetChart(sectionNodal, vstart, vend, ierr);
>>>>> CHKERRA(ierr)
>>>>> do inode = vstart, vend-1
>>>>> ! 3 dof per node, ndim = 3
>>>>> call PetscSectionSetDof(sectionNodal, inode, ndim, ierr); CHKERRA(ierr)
>>>>> end do
>>>>> call PetscSectionSetUp(sectionNodal, ierr); CHKERRA(ierr)
>>>>> call DMSetLocalSection(dmNodal, sectionNodal, ierr); CHKERRA(ierr)
>>>>> call PetscSectionDestroy(sectionNodal, ierr); CHKERRA(ierr)
>>>>>
>>>>> From what I understood, this new dmNodal basically has all the
>>>>> characteristics of the ori dm, except it's primary layer is vertex-based
>>>>> with 3 DOF per vertex.
>>>>>
>>>>
>>>> Take out the coordinate stuff above. This is now handled internally in
>>>> a better way. Other than that, it looks good.
>>>>
>>>
>>> You mean the SetCoordinateSection is already handled by the cloning
>>> process ?
>>>
>>
>> Yes.
>>
>>
>>>
>>>>
>>>>> Then I get a local vector and work on it :
>>>>>
>>>>> call DMGetLocalVector(dmNodal, nodalvec, ierr)
>>>>> call VecGetArrayF90(nodalvec, nodalvecarray, ierr)
>>>>> call DMPlexGetDepthStratum(dm, 0, vstart, vend, ierr)
>>>>> call DMPlexGetDepthStratum(dm, 2, fstart, fend, ierr)
>>>>> ! For each node ...
>>>>> do inode = vstart, vend-1
>>>>> A = 0 ! matrix, ndim x ndim
>>>>> b = 0 ! vector, ndim
>>>>> call DMPlexGetTransitiveClosure(dmNodal, inode, PETSC_FALSE,
>>>>> vertexclosure, ierr);
>>>>> ! For each face attached to that node
>>>>> do iface = 1, size(vertexclosure)
>>>>> if ((vertexclosure(iface) < fstart) .or. (vertexclosure(iface) >=
>>>>> fend)) cycle
>>>>> A = A + ...
>>>>> b = b + ...
>>>>> end do
>>>>> nodalvecarray(1+(inode-vstart)*ndim:3+(inode-vstart)*ndim) = matmul(A,
>>>>> b)
>>>>> ...
>>>>> end do
>>>>>
>>>>
>>>> There are some subtleties here I think. You are looping over all
>>>> vertices, but you only own some of them.
>>>>
>>>
>>> Here I am quite surprised by your remark : from the random print I did,
>>> the DMPlexGetDepthStratum returns different vstart and vend for different
>>> MPI processes, hence I assumed it was a local numbering.
>>>
>>
>> It is a local numbering. If you have a mesh partition, it is usually to
>> have vertices present in multiple local meshes. We think of one process as
>> "owning" the vertex. That is what I meant here.
>>
>>
>>> And then I assumed that, with the DMPlexDistribute, it ensured that the
>>> "vertexclosure"
>>>
>>
>> This is the wrong terminology. Closures mean including the boundary of
>> something. Vertices have no boundary. You want the dual notion "support",
>> or its transitive closure "star". So is the vertex star included when
>> distributing? No, only the point closure is guaranteed when distributing.
>> This was the point of my explanation. However, if you request overlap = 1,
>> then you get everything in the star of the original boundary.
>>
>>
>>> contained (among others) all the faces connected to each "inode", even
>>> if the closure is spread among multiple MPI processes. So you mean that all
>>> these assumptions are actually wrong to some degree ?
>>>
>>>
>>>> What do you do when putting these values in the global vector? I think
>>>> there are at least three choices:
>>>>
>>>> 1) Everyone has everything, so you compute all the vertex stuff
>>>> redundantly, and then use INSERT_VALUES for the scatter
>>>>
>>>> 2) Only vertices are redundant, so the transitive supports of vertices
>>>> are partial but non-overlapping, and you use ADD_VALUES for the scatter
>>>>
>>>> 3) Only cells are redundant, so you do the complete vertex computation
>>>> only for local vertices
>>>>
>>>
>>> I distributed with overlap = 1, so I expect I have a halo of redundant
>>> cells in each MPI block (option no. 3).
>>>
>>
>> Yes.
>>
>>
>>> With the (..., PETSC_TRUE, PETSC_TRUE, ...) setting of the distribute
>>> however, I also expected to have a halo of vertices and a halo of edges and
>>> a halo of faces. Therefore, completing the vertex computation only for
>>> local vertices should have been fine since anyways even local vertices know
>>> their complete closure in terms of faces,
>>>
>>
>> Yes, this should work.
>>
>>
>>> but I am missing something because it does not turn out right ...
>>>
>>
>> Start with a very simple mesh, say 2 cells or 4 cells. Then you can print
>> everything out.
>>
>> Thanks,
>>
>> Matt
>>
>>
>>>
>>>>
>>>>> Then I want to visualize it :
>>>>>
>>>>> call VecRestoreArrayF90(nodalvec, nodalvecarray, ierr)
>>>>> call DMCreateGlobalVector(dmNodal, globalnodalvec, ierr)
>>>>> call DMLocalToGlobal(dmNodal, nodalvec, INSERT_VALUES, globalnodalvec,
>>>>> ierr)
>>>>> call PetscViewerCreate(PETSC_COMM_WORLD, vtkViewer, ierr);
>>>>> CHKERRA(ierr)
>>>>> call PetscViewerSetType(vtkViewer, PETSCVIEWERVTK, ierr); CHKERRA(ierr)
>>>>> call PetscViewerFileSetName(vtkViewer, "toto.vtk", ierr); CHKERRA(ierr)
>>>>> call VecView(globalnodalvec, vtkViewer, ierr); CHKERRA(ierr)
>>>>> call PetscViewerDestroy(vtkViewer, ierr); CHKERRA(ierr)
>>>>>
>>>>> In the toto.vtk file, I get vertex-based values as expected, and the
>>>>> whole shebang works in sequential. Now when I run it in parallel, I get
>>>>> vertex-based values inside the MPI blocks, but in the neighbourhood of the
>>>>> join between the MPI blocks, all the vertex-based values are NaN (see
>>>>> attachment).
>>>>>
>>>>
>>>> If you have redundant cells, meaning you distributed with overlap = 1,
>>>> did you remember to full up the values for those ghost cells using
>>>> DMGlobalToLocal() or something like that? If not, they will start as
>>>> NaNs.
>>>>
>>>
>>> Indeed, in the vertex computation, at some point I use the values in the
>>> cells. But I got the solution vector from TSGetSolution, then I do a
>>> GlobalToLocal, then I even add a InsertBoundaryValues to fill the ghost
>>> cells correctly.
>>> Even by doing this, when I print the nodalvecarray on each process, they
>>> somehow contain some NaNs. This means, I think, that at some point, a given
>>> vertex tries to get the information of a face that is supposed to be in its
>>> closure but is actually not .. (?)
>>>
>>> Thank you for your help and feedback !
>>>
>>> Cheers,
>>>
>>> Thibault
>>>
>>>
>>>>
>>>> Thanks,
>>>>
>>>> Matt
>>>>
>>>>
>>>>> All in all, I think I do not understand fully the consequences of the
>>>>> DMPlexDistribute with useCone=true & useClosure=true ... but I cannot
>>>>> figure out where the NaN are coming from : is it because the vertices do
>>>>> not actually know all their closure through the MPI join, or is it some
>>>>> Global / Local error I made ?
>>>>>
>>>>> Thank you very much in advance for your support !!
>>>>>
>>>>> Thibault
>>>>>
>>>>
>>>>
>>>> --
>>>> What most experimenters take for granted before they begin their
>>>> experiments is infinitely more interesting than any results to which their
>>>> experiments lead.
>>>> -- Norbert Wiener
>>>>
>>>> https://www.cse.buffalo.edu/~knepley/
>>>> <http://www.cse.buffalo.edu/~knepley/>
>>>>
>>>
>>
>> --
>> What most experimenters take for granted before they begin their
>> experiments is infinitely more interesting than any results to which their
>> experiments lead.
>> -- Norbert Wiener
>>
>> https://www.cse.buffalo.edu/~knepley/
>> <http://www.cse.buffalo.edu/~knepley/>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20210124/5c547d74/attachment-0001.html>
More information about the petsc-users
mailing list