<div dir="ltr">Sorry I missed this....<div class="gmail_extra"><br><br><div class="gmail_quote">On Thu, Jul 31, 2014 at 10:44 AM, Stephan Kramer <span dir="ltr"><<a href="mailto:s.kramer@imperial.ac.uk" target="_blank">s.kramer@imperial.ac.uk</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">An update on this. We have been using mark/gamg-zerod branch and it fixes for us the issue with zero diagonals in the coarsened operators (making the sor smoother, but bizarrely not the jacobi smoother fail). </blockquote>
<div><br></div><div>Jacobi tests for zero diagonals.</div><div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">We in fact have some cases where cheby+jacobi does not converge (indefinite pc), but cheby+sor (with the mark/gamg-zerod branch) works well, so we'd be very much interested in getting this (or something similar) merged in master. Maybe the lv[i-Istart]==0.0 thing isn't entirely robust? We'd be happy to contribute.<br>
<br>
As an aside, we also changed the ordering of DOFs as suggested, so that we provide the right block structure to gamg. However, as soon as we actually set the block size (MatSetBlockSizes) the convergence deteriorates substantially (going from ~50 to ~650 iterations). Without setting the block size but with the new ordering, the n/o iterations is roughly the same as before (when our dofs were not interlaced). Any idea what might be going wrong?<br>
</blockquote><div><br></div><div>This should not happen. I am pretty sure you have a bug. If this is elasticity then you could multiply your matrix times each of the null space vectors that you create and verify the null space. If you can remove any Diri BCs then the null space should be an exact null space then that would make it easier to test (just take the norm of A*v_i, i=1:3)</div>
<div> </div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<br>
Cheers<span class="HOEnZb"><font color="#888888"><br>
Stephan</font></span><div class="HOEnZb"><div class="h5"><br>
<br>
<br>
<br>
On 01/04/14 19:17, Jed Brown wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Stephan Kramer <<a href="mailto:s.kramer@imperial.ac.uk" target="_blank">s.kramer@imperial.ac.uk</a>> writes:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
Yes indeed. I've come to realize this now by looking into how smoothed<br>
aggregation with a near null space actually works. We currently have<br>
our dofs numbered the wrong way around (vertices on the inside,<br>
velocity component on the outside - which made sense for other eqns we<br>
solve with the model) so will take a bit of work, but might well be<br>
worth the effort<br>
</blockquote>
<br>
The memory streaming and cache reuse is much better if you interlace the<br>
degrees of freedom. This is as true now as it was at the time of the<br>
PETSc-FUN3D papers. When evaluating the "physics", it can be useful to<br>
pack the interlaced degrees of freedom into a vector-friendly ordering.<br>
<br>
The AMG solve is plenty expensive that you can pack/solve/unpack an<br>
interlaced vector at negligible cost without changing the rest of your<br>
code.<br>
<br>
Mark, should we provide some more flexible way to label "fields"? It<br>
will be more complicated than the present code and I think packing into<br>
interlaced format is faster anyway.<br>
<br>
</blockquote>
<br>
</div></div></blockquote></div><br></div></div>