<div class="gmail_quote">On Tue, May 8, 2012 at 7:32 AM, Stefano Zampini <span dir="ltr"><<a href="mailto:stefano.zampini@gmail.com" target="_blank">stefano.zampini@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div class="gmail_quote"><div class="im"><blockquote class="gmail_quote" style="margin:0pt 0pt 0pt 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div>MatSetValues() is fine since only the global ordering is used. MatSetValuesLocal() would ideally continue to work with the old local space because the user has to redistribute their mesh to use the new ordering (maybe there should be a translation filter).</div>


<div><br></div></blockquote></div><div><br>What do you mean for "translation filter"?<br></div></div></blockquote><div><br></div><div>Translating a set of local indices for a different partition, but I misunderstood what you were suggesting.</div>

<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="gmail_quote"><div> </div><div class="im"><blockquote class="gmail_quote" style="margin:0pt 0pt 0pt 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">


<div></div><div>But how does this function actually work? Can the subdomain boundaries move, or are subdomains just moved to a different process?</div><div><br></div></blockquote></div><div><br>Asking the first question: <br>

The function works by first reordering local nodes with INTERNAL and INTERFACE nodes (if the second argument LocalIS is not PETSC_NULL), in order to be able to use memcpy when using local scatters of PCIS (e.g. pcis->N_to_B). Then, it reorders the global numbering (if GlobalIS is not PETSC_NULL) by exploiting data locality in the global vector and reduce the amount of communications when scattering from global to local and viceversa; in practice, a numbering of global nodes is constructed in such a way that the greatest number of subdomain INTERNAL nodes will belong to the local part of the global vector. Then, remaining INTERNAL nodes (notes that if the parallel problem is well balanced, this will not occur) and INTERFACE nodes are placed in the remaining global nodes, keeping them sorted as they were in the previous global numbering. I think the latter part  can be further modified. When I'll add to matis.c, maybe you can take a look at the code and give me your suggestions. <br>

</div></div></blockquote><div><br></div><div>Okay, this is different from what I thought you were describing. Moving the global dofs is a tricky thing because the vectors we want to solve against and interpret results for come from a higher level. Changing the global ordering really produces a different matrix (MatMult is different), so it's not "optimization". If the linear operator stayed the same, but you just changed the format, perhaps we could call it "optimize".</div>

<div><br></div><div>Are you aware of PCRedistribute? It could be made to perform the transformation you are thinking about for MATIS, solving the problem in the "optimized" ordering, and mapping the solution back to the original ordering.</div>

<div><br></div><div>But why is this needed in the first place? Don't people who use MATIS also create a reasonable global ordering at setup time? It's bad for other parts of the algorithm to have a global numbering that doesn't line up well with the partition of elements.</div>

<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="gmail_quote"><div>

<br>Regarding the other question: <br>With MATIS (and in principle every subassembled matrix) you are forced to use the local matrices assembled by the user or an assembled group of them (as you surely know since you are developing a PA). You cannot safely (in a theoretical way) move subdomain boundaries since you don't know (at least in principle) how to reproduce the matrix values on the local interface since they are neumann boundaries for the local PDE. The main question is: how can I split the assembled values on the nodes which previously were internal to the subdomain?</div>

</div></blockquote><div><br></div><div>This can't be done, at least without more information or making assumptions about the discretization. That's why I asked.</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">

<div class="gmail_quote"><div> (I think this is the main question when thinking to a MatConvert implementation which converts from a matrix obtained by DMDAGetMatrix to a MATIS). <br></div></div></blockquote><div><br></div>

<div>There is no reason why DMCreateMatrix() cannot return a MATIS. In any case, that's a better model than assembling in one format and trying to convert, especially since conversion really can't done in this direction (a MATIS can be converted to MATAIJ, not the other way).</div>

<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div class="gmail_quote"><div>

 </div><div class="im"><blockquote class="gmail_quote" style="margin:0pt 0pt 0pt 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div></div><div><br></div><div>For a different, but related algorithm, I'm considering writing a MatPA (PA = partially assembled) which would choose some points to assemble internally. The reason for storing in partially assembled form is that I will want to solve a number of pinned Neumann problems to determine the coarse space and I'd rather not have to take unnecessary submatrices. Do you think this is a bad idea and I should just use MatIS? (I also want support for multiple subdomains per process and multiple processes per subdomain, though that could be added to MatIS.)</div>


</blockquote></div></div><br>Cool. Are you implementing FETI-DP? I'm planning to implement it in the very next future for isogeometric analysis. Asking to your question, it depends on how you need to solve these "pinned Neumann problems": you need to solve unrelated local problems defined by the local matrices or solve a parallel Neumann problem? In the first case, MatIS is your choice. In the other case, I think you need to resort to a multigrid for a new MATIS object defined by the subdomains of interest.<br clear="all">


However, I will be glad to help you with PA if you want.</blockquote></div><br><div>I'm not that interested in FETI-DP since it's basically the same as BDDC and doesn't fix the exponential dependence on the number of levels. Also, I hate that it doesn't work well for irregular coefficients on subdomain boundaries. So I have a different algorithm in mind, more naturally viewed as a multigrid or multilevel Schwarz, but with aggressive coarsening enabled by the subdomains. One part of the method is compatible relaxation, for which pinned Neumann problems are used. The idea is to start with just enough pinning to make the subdomains non-singular, then enrich the coarse space by adaptively selecting vertices or edge/face moments that are not well controlled (as determined by compatible relaxation). Once the coarse injection is found, we construct the coarse basis functions by harmonic extension (again using pinned Neumann problems). With the multigrid hierarchy is constructed, the smoother would be based on overlapping subdomains or a multi-stage smoother involving both pinned Neumann problems and surface strips (or perhaps just point-block Jacobi). The problem with only using the pinned Neumann problems in the smoother seems to be that we can't remove the exponential dependence without allowing the smoother to move the injection (compatible relaxation is good for telling us whether a coarse space is good, but not actually the right thing for multigrid convergence).</div>