<html xmlns:o="urn:schemas-microsoft-com:office:office" xmlns:w="urn:schemas-microsoft-com:office:word" xmlns:m="http://schemas.microsoft.com/office/2004/12/omml" xmlns="http://www.w3.org/TR/REC-html40">
<head>
<meta http-equiv="Content-Type" content="text/html; charset=utf-8">
<meta name="Generator" content="Microsoft Word 15 (filtered medium)">
<style><!--
/* Font Definitions */
@font-face
{font-family:"Cambria Math";
panose-1:2 4 5 3 5 4 6 3 2 4;}
@font-face
{font-family:Calibri;
panose-1:2 15 5 2 2 2 4 3 2 4;}
@font-face
{font-family:Menlo;
panose-1:2 11 6 9 3 8 4 2 2 4;}
/* Style Definitions */
p.MsoNormal, li.MsoNormal, div.MsoNormal
{margin:0cm;
font-size:11.0pt;
font-family:"Calibri",sans-serif;}
a:link, span.MsoHyperlink
{mso-style-priority:99;
color:blue;
text-decoration:underline;}
span.EmailStyle20
{mso-style-type:personal-reply;
font-family:"Calibri",sans-serif;
color:windowtext;}
p.p1, li.p1, div.p1
{mso-style-name:p1;
margin:0cm;
font-size:13.5pt;
font-family:Menlo;
color:black;}
span.s1
{mso-style-name:s1;}
span.apple-converted-space
{mso-style-name:apple-converted-space;}
.MsoChpDefault
{mso-style-type:export-only;
font-size:10.0pt;}
@page WordSection1
{size:612.0pt 792.0pt;
margin:72.0pt 72.0pt 72.0pt 72.0pt;}
div.WordSection1
{page:WordSection1;}
--></style>
</head>
<body lang="EN-GB" link="blue" vlink="purple" style="word-wrap:break-word">
<div class="WordSection1">
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">The PCApply timing on<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">gpu<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="p1"><span class="s1"><span style="font-size:11.0pt">PCApply</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">6 1.0 1.0235e+01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 39</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">39</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0 </span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0 </span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0 0.00e+00</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0 0.00e+00</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0<o:p></o:p></span></span></p>
<p class="p1"><span class="s1"><span style="font-size:11.0pt"><o:p> </o:p></span></span></p>
<p class="p1"><span class="s1"><span style="font-size:11.0pt">and cpu <o:p></o:p></span></span></p>
<p class="p1"><span style="font-size:11.0pt"><o:p> </o:p></span></p>
<p class="p1"><span class="s1"><span style="font-size:11.0pt">PCApply</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">6 1.0 1.0242e+01 1.0 0.00e+00 0.0 0.0e+00 0.0e+00 0.0e+00 41</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">41</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0 </span></span><span class="apple-converted-space"><span style="font-size:11.0pt">
</span></span><span class="s1"><span style="font-size:11.0pt">0</span></span><span style="font-size:11.0pt"><o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">are close. It is hard for me tell if hypre on gpu is on or not.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">Best,<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">Karthik.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span style="font-size:12.0pt;color:black">From: </span></b><span style="font-size:12.0pt;color:black">"Chockalingam, Karthikeyan (STFC,DL,HC)" <karthikeyan.chockalingam@stfc.ac.uk><br>
<b>Date: </b>Friday, 8 October 2021 at 14:55<br>
<b>To: </b>Mark Adams <mfadams@lbl.gov><br>
<b>Cc: </b>"petsc-users@mcs.anl.gov" <petsc-users@mcs.anl.gov><br>
<b>Subject: </b>Re: [petsc-users] hypre on gpus<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">Thanks Mark, I will try your recommendations.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">Should I also change
</span><span style="font-family:Menlo;color:black">-dm_vec_type to hypre </span><span style="mso-fareast-language:EN-US">currently I have it as mpicuda?<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US">Karthik.<o:p></o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<p class="MsoNormal"><span style="mso-fareast-language:EN-US"><o:p> </o:p></span></p>
<div style="border:none;border-top:solid #B5C4DF 1.0pt;padding:3.0pt 0cm 0cm 0cm">
<p class="MsoNormal"><b><span style="font-size:12.0pt;color:black">From: </span></b><span style="font-size:12.0pt;color:black">Mark Adams <mfadams@lbl.gov><br>
<b>Date: </b>Friday, 8 October 2021 at 14:33<br>
<b>To: </b>"Chockalingam, Karthikeyan (STFC,DL,HC)" <karthikeyan.chockalingam@stfc.ac.uk><br>
<b>Cc: </b>"petsc-users@mcs.anl.gov" <petsc-users@mcs.anl.gov><br>
<b>Subject: </b>Re: [petsc-users] hypre on gpus<o:p></o:p></span></p>
</div>
<div>
<p class="MsoNormal"><o:p> </o:p></p>
</div>
<div>
<p class="MsoNormal">Hypre does not record its flops with PETSc's timers.<o:p></o:p></p>
<div>
<p class="MsoNormal">Configure with and without CUDA and see if the timings change in PCApply.<o:p></o:p></p>
</div>
<div>
<p class="MsoNormal">Hypre does not dynamically switch between CUDA and CPU solves at this time, but you want to use <span style="font-family:Menlo;color:black">-dm_mat_type hypre.</span><o:p></o:p></p>
</div>
<div>
<p class="MsoNormal"><span style="font-family:Menlo;color:black">Mark</span><o:p></o:p></p>
</div>
</div>
<p class="MsoNormal"><o:p> </o:p></p>
<div>
<div>
<p class="MsoNormal">On Fri, Oct 8, 2021 at 6:59 AM Karthikeyan Chockalingam - STFC UKRI <<a href="mailto:karthikeyan.chockalingam@stfc.ac.uk">karthikeyan.chockalingam@stfc.ac.uk</a>> wrote:<o:p></o:p></p>
</div>
<blockquote style="border:none;border-left:solid #CCCCCC 1.0pt;padding:0cm 0cm 0cm 6.0pt;margin-left:4.8pt;margin-top:5.0pt;margin-right:0cm;margin-bottom:5.0pt">
<div>
<div>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">Hello,<o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">I am trying to run ex45 (in KSP tutorial) using hypre on gpus. I have attached the python configuration file and -log_view output from running the below command options<o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-family:Menlo;color:black">mpirun -n 2 ./ex45 -log_view -da_grid_x 169 -da_grid_y 169 -da_grid_z 169 -dm_mat_type mpiaijcusparse -dm_vec_type mpicuda -ksp_type
gmres -pc_type hypre -pc_hypre_type boomeramg -ksp_gmres_restart 31 -pc_hypre_boomeramg_strong_threshold 0.7 -ksp_monitor</span><o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-family:Menlo;color:black"> </span><o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">The problem was solved and converged but from the output file I suspect hypre is not running on gpus as PCApply and DMCreate does
<b>not</b> record any gpu Mflop/s. However, some events such KSPSolve, MatMult etc are running on gpus.<o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">Can you please let me know if I need to add any extra flag to the attached arch-ci-linux-cuda11-double-xx.py script file to get hypre working on gpus?<o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">Thanks,<o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto">Karthik.<o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"><span style="font-family:Menlo;color:#C00000"> </span><o:p></o:p></p>
<p class="MsoNormal" style="mso-margin-top-alt:auto;mso-margin-bottom-alt:auto"> <o:p></o:p></p>
</div>
<p><span style="font-size:6.0pt">This email and any attachments are intended solely for the use of the named recipients. If you are not the intended recipient you must not use, disclose, copy or distribute this email or any of its attachments and should notify
the sender immediately and delete this email from your system. UK Research and Innovation (UKRI) has taken every reasonable precaution to minimise risk of this email or any attachments containing viruses or malware but the recipient should carry out its own
virus and malware checks before opening the attachments. UKRI does not accept any liability for any losses or damages which the recipient may sustain due to presence of any viruses. </span><o:p></o:p></p>
</div>
</blockquote>
</div>
</div>
</body>
</html>