<div dir="ltr">I would like to put a non-overlapping ASM solve on the GPU. It's not clear that we have a model for this. <div><br></div><div>PCApply_ASM currently pipelines the scater with the subdomain solves. I think we would want to change this and do a 1) scatter begin loop, 2) scatter end and non-blocking solve loop, 3) solve-wait and scatter begging loop and 4) scatter end loop.<br></div><div><br></div><div>I'm not sure how to go about doing this.</div><div> * Should we make a new PCApply_ASM_PARALLEL or dump this pipelining algorithm and rewrite PCApply_ASM?</div><div> * Add a solver-wait method to KSP?</div><div><br></div><div>Thoughts?</div><div><br></div><div>Mark</div></div>