<div dir="ltr"><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000">Barry,</div><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000"><br></div><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000">We have a macro-ized basefile to produced all 4 precisions for SuperLU_DIST. In the past, we didn't generate single precision code, since there was no request. </div><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000"><br></div><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000">Since now we started working on mixed precision algorithms. I generated single precision code a few weeks ago, and fixed some bugs on the way. A few days ago, I nailed down a final bug (hopefully) related to single precision SGEMM or STRSM BLAS problems on GPU (i.e., single precision cuBLAS error). All I know is that CPU code is working on Summit, but it has computation errors after I enable GPU. I plan to do some more investigation on this before asking Nvidia folks.</div><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000"><br></div><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000">I am busy this week, but should get to it next week. I suggest you wait for me to clear this, instead of duplicating the work.</div><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000"><br></div><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000">Sherry</div><div class="gmail_default" style="font-family:verdana,sans-serif;font-size:small;color:#000000"><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">On Tue, Jul 21, 2020 at 5:22 PM Barry Smith <<a href="mailto:bsmith@petsc.dev">bsmith@petsc.dev</a>> wrote:<br></div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><br>
Felix,<br>
<br>
What are your needs, do you want this for CPUs or for GPUs? Do you wish to run all your code in single precision or just the SuperLU_Dist solver while the rest of your code double?<br>
<br>
If you want to run everything on CPUs using single precision then adding the support is very easy, we can provide that for you any time. The other cases will require more thought.<br>
<br>
Barry<br>
<br>
<br>
> On Jul 21, 2020, at 8:58 AM, <a href="mailto:flw@rzg.mpg.de" target="_blank">flw@rzg.mpg.de</a> wrote:<br>
> <br>
> Dear PETSc support team,<br>
> some time ago you told me that you are planning on releasing a version that supports SUPERLU_DIST in single-precision soon. Can you tell me roughly what time frame you had in mind?<br>
> <br>
> Best regards,<br>
> Felix<br>
> <br>
<br>
</blockquote></div>