<div dir="ltr"><div> </div><div>>> If you want to solve systems accurately, you should non-dimensionalize the system prior to discretization. This would mean that</div><div>your C and b have elements in the [1, D] range, where D is the dynamic range of your problem, say 1e4, rather than these huge</div><div>numbers you have now.</div><div><br></div><div>@Matt: We have done non-dimensionalization and the diagonal matrix ranges from 1 to 1e4 now. Still it takes 4-5 iterations to converge for the non-dimensional diagonal matrix. The convergence trend is looking much better now, though:</div><div><br></div><div><p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)">Residual norms for temperature_ solve.</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>0 KSP preconditioned resid norm 4.724547545716e-04 true resid norm 2.529423250889e+00 ||r(i)||/||b|| 4.397759655853e-05</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>1 KSP preconditioned resid norm 6.504853596318e-06 true resid norm 2.197130494439e-02 ||r(i)||/||b|| 3.820021755431e-07</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>2 KSP preconditioned resid norm 7.733420341215e-08 true resid norm 3.539290481432e-04 ||r(i)||/||b|| 6.153556501117e-09</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>3 KSP preconditioned resid norm 6.419092250844e-10 true resid norm 5.220398494466e-06 ||r(i)||/||b|| 9.076400273607e-11</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>4 KSP preconditioned resid norm 5.095955157158e-12 true resid norm 2.484163999489e-08 ||r(i)||/||b|| 4.319070053474e-13</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>5 KSP preconditioned resid norm 6.828200916501e-14 true resid norm 2.499229854610e-10 ||r(i)||/||b|| 4.345264170970e-15</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)">Linear temperature_ solve converged due to CONVERGED_RTOL iterations 5</p><p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><br></p><p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><br></p></div><div>Only when all the equations are scaled individually the convergence is achieved in a single iteration. In the above, all equations are scaled using the same non-dimensional parameter. Do you think this is reasonable or do you expect the diagonal system to converge in a single iteration irrespective of the range of diagonal entries?  </div><div><br></div><div>@Barry: <br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex"><div style="word-wrap:break-word;line-break:after-white-space"><div><blockquote type="cite"><div><div dir="ltr"><div><br></div></div></div></blockquote><div><br></div>   What is the result of -ksp_view on the solve? </div></div></blockquote><div><br></div><p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)">KSP Object: (temperature_) 1 MPI processes</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>type: gmres</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">    </span>restart=30, using Classical (unmodified) Gram-Schmidt Orthogonalization with one step of iterative refinement when needed</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">    </span>happy breakdown tolerance 1e-30</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>maximum iterations=1000, nonzero initial guess</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>tolerances:<span class="gmail-Apple-converted-space">  </span>relative=1e-12, absolute=1e-50, divergence=10000.</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>left preconditioning</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>using PRECONDITIONED norm type for convergence test</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)">PC Object: (temperature_) 1 MPI processes</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>type: shell</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">    </span>IEPSemiImplicitHierarchyIntegrator::helmholtz_precond::Temperature</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>linear system matrix = precond matrix:</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">  </span>Mat Object: 1 MPI processes</p>

<p style="margin:0px;font-stretch:normal;font-size:13px;line-height:normal;font-family:Menlo;color:rgb(252,33,37)"><span class="gmail-Apple-converted-space">    </span>type: shell</p>

<div><span class="gmail-Apple-converted-space" style="color:rgb(252,33,37);font-family:Menlo;font-size:13px">    </span><span style="color:rgb(252,33,37);font-family:Menlo;font-size:13px">rows=1, cols=1</span> </div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex"><div style="word-wrap:break-word;line-break:after-white-space"><div><br></div><div>   The way you describe your implementation it does not sound like standard PETSc practice. </div></div></blockquote><div><br></div><div>Yes, we do it differently in IBAMR. Succinctly, the main solver is a matrix-free one, whereas the preconditioner is a FAC multigrid solver with its bottom solver formed on the coarsest level of AMR grid using PETSc (matrix-based KSP).  </div><div><br></div><div>In the above -ksp_view <span style="color:rgb(252,33,37);font-family:Menlo;font-size:13px">temperature_ is the matrix-free KSP solver and </span><span style="color:rgb(252,33,37);font-family:Menlo;font-size:13px">IEPSemiImplicitHierarchyIntegrator::helmholtz_precond is the FAC preconditioner.</span></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex"><div style="word-wrap:break-word;line-break:after-white-space"><div><br></div><div>With PETSc using a matrix-free operation mA and a matrix from which KSP will build the preconditioner  A one uses  KSPSetOperator(ksp,mA,A); and then just selects the preconditioner with -pc_type xxx  For example to use Jacobi preconditioning one uses -pc_type jacobi (note that this only uses the diagonal of A, the rest of A is never used).</div></div></blockquote><div><br></div><div>We run -pc_type jacobi on the bottom solver of the FAC preconditioner.  </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex"><div style="word-wrap:break-word;line-break:after-white-space"><div><br></div><div>If you wish to precondition mA by fully solving with the matrix A one can use -ksp_monitor_true_residual -pc_type ksp -ksp_ksp_type yyy -ksp_pc_type xxx  -ksp_ksp_monitor_true_residual with, for example, yyy of richardson and xxx of jacobi</div></div></blockquote><div><br></div><div>Yes, this is what we do.  </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left-width:1px;border-left-style:solid;border-left-color:rgb(204,204,204);padding-left:1ex"><div style="word-wrap:break-word;line-break:after-white-space"><div><br></div><div>  Barry</div><div><br></div><div><br></div><div><br></div><div><br><blockquote type="cite"><div><div dir="ltr"><div>To verify that I am indeed solving a diagonal system I printed the PETSc matrix from the preconditioner and viewed it in Matlab. It indeed shows it to be a diagonal system. Attached is the plot of the spy command on the printed matrix. The matrix in binary form is also attached. </div><div><br></div><div>My understanding is that because the C coefficient is varying in 4 orders of magnitude, i.e., Max(C)/Min(C) ~ 10^4, the matrix is poorly scaled. When I rescale my matrix by 1/C then the system converges in 1 iteration as expected. Is my understanding correct, and that scaling 1/C should be done even for a diagonal system?</div><div><br></div><div>When D is non-zero, then scaling by 1/C seems to be very inconvenient as D is stored as side-centered data for the matrix free solver. </div><div><br></div><div>In the case that I do not scale my equations by 1/C, is there some solver setting that improves the convergence rate? (With D as non-zero, I have also tried gmres as the ksp solver in the matrix-based preconditioner to get better performance, but it didn't matter much.)</div><div><br></div><div><br></div><div>Thanks,</div><div>Ramakrishnan Thirumalaisamy</div><div>San Diego State University.</div></div>

<span id="gmail-m_-7898380077486584962cid:f_ku60x9u71"><Temperature_fill.pdf></span><span id="gmail-m_-7898380077486584962cid:f_ku60x9tq0"><matrix_temperature></span></div></blockquote></div><br></div></blockquote></div><br clear="all"><div><br></div>-- <br><div dir="ltr" class="gmail_signature"><div dir="ltr"><div>--Amneet <br><br></div><div><br></div><div><br></div></div></div></div>