<div dir="auto">Derek<div dir="auto"><br></div><div dir="auto">I have fixed the optimized plan few weeks ago</div><div dir="auto"><br></div><div dir="auto"><a href="https://bitbucket.org/petsc/petsc/commits/c3caad8634d376283f7053f3b388606b45b3122c">https://bitbucket.org/petsc/petsc/commits/c3caad8634d376283f7053f3b388606b45b3122c</a><br></div><div dir="auto"><br></div><div dir="auto">Maybe this will fix your problem too?</div><div dir="auto"><br></div><div dir="auto">Stefano</div><div dir="auto"><br></div></div><br><div class="gmail_quote"><div dir="ltr" class="gmail_attr">Il Gio 21 Mar 2019, 04:21 Zhang, Junchao via petsc-users <<a href="mailto:petsc-users@mcs.anl.gov">petsc-users@mcs.anl.gov</a>> ha scritto:<br></div><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex">
<div>
<div dir="ltr">
<div dir="ltr">Hi, Derek,
<div> Try to apply this tiny (but dirty) patch on your version of PETSc to disable the VecScatterMemcpyPlan optimization to see if it helps. </div>
<div> Thanks.<br clear="all">
<div>
<div dir="ltr" class="m_4030954464987556008gmail_signature" data-smartmail="gmail_signature">
<div dir="ltr">--Junchao Zhang</div>
</div>
</div>
</div>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">On Wed, Mar 20, 2019 at 6:33 PM Junchao Zhang <<a href="mailto:jczhang@mcs.anl.gov" target="_blank" rel="noreferrer">jczhang@mcs.anl.gov</a>> wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div dir="ltr">
<div>Did you see the warning with small scale runs? Is it possible to provide a test code?</div>
<div>You mentioned "changing PETSc now would be pretty painful". Is it because it will affect your performance (but not your code)? If yes, could you try PETSc master and run you code with or without -vecscatter_type sf. I want to isolate the problem and
see if it is due to possible bugs in VecScatter.</div>
<div>If the above suggestion is not feasible, I will disable VecScatterMemcpy. It is an optimization I added. Sorry I did not have an option to turn off it because I thought it was always useful:) I will provide you a patch later to disable it. With that you
can run again to isolate possible bugs in VecScatterMemcpy.</div>
<div>Thanks.</div>
<div>
<div dir="ltr" class="m_4030954464987556008gmail-m_958850508585676410gmail_signature">
<div dir="ltr">--Junchao Zhang</div>
</div>
</div>
<br>
</div>
<br>
<div class="gmail_quote">
<div dir="ltr" class="gmail_attr">On Wed, Mar 20, 2019 at 5:40 PM Derek Gaston via petsc-users <<a href="mailto:petsc-users@mcs.anl.gov" target="_blank" rel="noreferrer">petsc-users@mcs.anl.gov</a>> wrote:<br>
</div>
<blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex">
<div dir="ltr">
<div dir="ltr">
<div dir="ltr">
<div dir="ltr">
<div dir="ltr">
<div>Trying to track down some memory corruption I'm seeing on larger scale runs (3.5B+ unknowns). Was able to run Valgrind on it... and I'm seeing quite a lot of uninitialized value errors coming from ghost updating. Here are some of the traces:</div>
<div><br>
</div>
<div>==87695== Conditional jump or move depends on uninitialised value(s)<br>
==87695== at 0x73236D3: PetscMallocAlign (mal.c:28)<br>
==87695== by 0x7323C70: PetscMallocA (mal.c:390)<br>
==87695== by 0x739048E: VecScatterMemcpyPlanCreate_Index (vscat.c:284)<br>
==87695== by 0x73A5D97: VecScatterMemcpyPlanCreate_PtoP (vpscat_mpi1.c:312)<br>
==64730== by 0x7393E8A: VecScatterSetUp_vectype_private (vscat.c:857)<br>
==64730== by 0x7395E5D: VecScatterSetUp_MPI1 (vpscat_mpi1.c:2543)<br>
==64730== by 0x73DDD39: VecScatterSetUp (vscatfce.c:212)<br>
==64730== by 0x73DCD73: VecScatterCreateWithData (vscreate.c:333)<br>
==64730== by 0x7444232: VecCreateGhostWithArray (pbvec.c:685)<br>
==64730== by 0x744490D: VecCreateGhost (pbvec.c:741)<br>
</div>
<div><br>
</div>
<div>==133582== Conditional jump or move depends on uninitialised value(s)<br>
==133582== at 0x4030384: memcpy@@GLIBC_2.14 (vg_replace_strmem.c:1034)<br>
==133582== by 0x739E4F9: PetscMemcpy (petscsys.h:1649)<br>
==133582== by 0x739E4F9: VecScatterMemcpyPlanExecute_Pack (vecscatterimpl.h:150)<br>
==133582== by 0x739E4F9: VecScatterBeginMPI1_1 (vpscat_mpi1.h:69)<br>
==133582== by 0x73DD964: VecScatterBegin (vscatfce.c:110)<br>
==133582== by 0x744E195: VecGhostUpdateBegin (commonmpvec.c:225)<br>
</div>
<div><br>
</div>
<div>This is from a Git checkout of PETSc... the hash I branched from is: 0e667e8fea4aa from December 23rd (updating would be really hard at this point as I've completed 90% of my dissertation with this version... and changing PETSc now would be pretty painful!).</div>
<div><br>
</div>
<div>Any ideas? Is it possible it's in my code? Is it possible that there are later PETSc commits that already fix this?</div>
<div><br>
</div>
<div>Thanks for any help,</div>
<div>Derek<br>
</div>
<div><br>
</div>
</div>
</div>
</div>
</div>
</div>
</blockquote>
</div>
</blockquote>
</div>
</div>
</div>
</blockquote></div>