<div dir="ltr"><div dir="ltr">On Wed, Oct 25, 2023 at 8:32 AM Joauma Marichal via petsc-maint <<a href="mailto:petsc-maint@mcs.anl.gov">petsc-maint@mcs.anl.gov</a>> wrote:<br></div><div class="gmail_quote"><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class="msg-1889958021554448809">
<div lang="FR-BE" style="overflow-wrap: break-word;">
<div class="m_-1889958021554448809WordSection1">
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)">Hello, <u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)"> <u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)">I am using the DMSwarm library in some Eulerian-Lagrangian approach to have vapor bubbles in water. <u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)">I have obtained nice results recently and wanted to perform bigger simulations. Unfortunately, when I increase the number of processors used to run the simulation, I get the
following error:<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)"><u></u> <u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">free(): invalid size</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] *** Process received signal ***</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] Signal: Aborted (6)</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] Signal code:</span></span><span class="m_-1889958021554448809apple-converted-space"><span lang="EN-US">
</span></span><span class="m_-1889958021554448809s1"><span lang="EN-US">(-6)</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [ 0] /lib64/libc.so.6(+0x4eb20)[0x7f56cd4c9b20]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [ 1] /lib64/libc.so.6(gsignal+0x10f)[0x7f56cd4c9a9f]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [ 2] /lib64/libc.so.6(abort+0x127)[0x7f56cd49ce05]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [ 3] /lib64/libc.so.6(+0x91037)[0x7f56cd50c037]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [ 4] /lib64/libc.so.6(+0x9819c)[0x7f56cd51319c]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [ 5] /lib64/libc.so.6(+0x99aac)[0x7f56cd514aac]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [ 6] /gpfs/home/acad/ucl-tfl/marichaj/marha/lib_petsc/lib/libpetsc.so.3.019(PetscSFSetUpRanks+0x4c4)[0x7f56cea71e64]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [ 7] /gpfs/home/acad/ucl-tfl/marichaj/marha/lib_petsc/lib/libpetsc.so.3.019(+0x841642)[0x7f56cea83642]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [ 8] /gpfs/home/acad/ucl-tfl/marichaj/marha/lib_petsc/lib/libpetsc.so.3.019(PetscSFSetUp+0x9e)[0x7f56cea7043e]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1">[cns136:590327] [ 9] /gpfs/home/acad/ucl-tfl/marichaj/marha/lib_petsc/lib/libpetsc.so.3.019(VecScatterCreate+0x164e)[0x7f56cea7bbde]</span><u></u><u></u></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1">[cns136:590327] [10] /gpfs/home/acad/ucl-tfl/marichaj/marha/lib_petsc/lib/libpetsc.so.3.019(DMSetUp_DA_3D+0x3e38)[0x7f56cee84dd8]</span><u></u><u></u></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1">[cns136:590327] [11] /gpfs/home/acad/ucl-tfl/marichaj/marha/lib_petsc/lib/libpetsc.so.3.019(DMSetUp_DA+0xd8)[0x7f56cee9b448]</span><u></u><u></u></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1">[cns136:590327] [12] /gpfs/home/acad/ucl-tfl/marichaj/marha/lib_petsc/lib/libpetsc.so.3.019(DMSetUp+0x20)[0x7f56cededa20]</span><u></u><u></u></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [13] ./cobpor[0x4418dc]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [14] ./cobpor[0x408b63]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [15] /lib64/libc.so.6(__libc_start_main+0xf3)[0x7f56cd4b5cf3]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] [16] ./cobpor[0x40bdee]</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">[cns136:590327] *** End of error message ***</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">--------------------------------------------------------------------------</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">Primary job</span></span><span class="m_-1889958021554448809apple-converted-space"><span lang="EN-US">
</span></span><span class="m_-1889958021554448809s1"><span lang="EN-US">terminated normally, but 1 process returned</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1">a non-zero exit code. </span><span class="m_-1889958021554448809s1"><span lang="EN-US">Per user-direction, the job has been aborted.</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">--------------------------------------------------------------------------</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">--------------------------------------------------------------------------</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">mpiexec noticed that process rank 84 with PID 590327 on node cns136 exited on signal 6 (Aborted).</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="m_-1889958021554448809p1"><span class="m_-1889958021554448809s1"><span lang="EN-US">--------------------------------------------------------------------------</span></span><span lang="EN-US"><u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)"><u></u> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)">When I reduce the number of processors the error disappears and when I run my code without the vapor bubbles it also works.
<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)">The problem seems to take place at this moment:<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)"><u></u> <u></u></span></p>
<p class="MsoNormal" style="line-height:13.5pt;background:rgb(30,30,30)"><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(220,220,170)">DMCreate</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">(PETSC_COMM_WORLD,swarm);<u></u><u></u></span></p>
<p class="MsoNormal" style="line-height:13.5pt;background:rgb(30,30,30)"><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">
</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(220,220,170)">DMSetType</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">(*swarm,DMSWARM);<u></u><u></u></span></p>
<p class="MsoNormal" style="line-height:13.5pt;background:rgb(30,30,30)"><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">
</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(220,220,170)">DMSetDimension</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">(*swarm,</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(181,206,168)">3</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">);<u></u><u></u></span></p>
<p class="MsoNormal" style="line-height:13.5pt;background:rgb(30,30,30)"><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">
</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(220,220,170)">DMSwarmSetType</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">(*swarm,DMSWARM_PIC);<u></u><u></u></span></p>
<p class="MsoNormal" style="line-height:13.5pt;background:rgb(30,30,30)"><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">
</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(220,220,170)">DMSwarmSetCellDM</span><span lang="EN-US" style="font-size:9pt;font-family:Menlo;color:rgb(212,212,212)">(*swarm,*dmcell);<u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)"><u></u> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)"><u></u> <u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)">Thanks a lot for your help. </span></p></div></div></div></blockquote><div><br></div><div>Things that would help us track this down:</div><div><br></div><div>1) The smallest example where it fails</div><div><br></div><div>2) The smallest number of processes where it fails</div><div><br></div><div>3) A stack trace of the failure</div><div><br></div><div>4) A simple example that we can run that also fails</div><div><br></div><div> Thanks,</div><div><br></div><div> Matt</div><div> </div><blockquote class="gmail_quote" style="margin:0px 0px 0px 0.8ex;border-left:1px solid rgb(204,204,204);padding-left:1ex"><div class="msg-1889958021554448809"><div lang="FR-BE" style="overflow-wrap: break-word;"><div class="m_-1889958021554448809WordSection1">
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)">Best regards, </span><span style="color:rgb(33,33,33)"><u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)"> </span><span style="color:rgb(33,33,33)"><u></u><u></u></span></p>
<p class="MsoNormal"><span lang="EN-US" style="color:rgb(33,33,33)">Joauma</span><span style="color:rgb(33,33,33)"><u></u><u></u></span></p>
<p class="MsoNormal"><span lang="NL"><u></u><u></u></span></p>
</div>
</div>
</div></blockquote></div><br clear="all"><div><br></div><span class="gmail_signature_prefix">-- </span><br><div dir="ltr" class="gmail_signature"><div dir="ltr"><div><div dir="ltr"><div><div dir="ltr"><div>What most experimenters take for granted before they begin their experiments is infinitely more interesting than any results to which their experiments lead.<br>-- Norbert Wiener</div><div><br></div><div><a href="http://www.cse.buffalo.edu/~knepley/" target="_blank">https://www.cse.buffalo.edu/~knepley/</a><br></div></div></div></div></div></div></div></div>