Thank you Pavan and Darius for your help.<br><br>I am in the process of running MPI application with check-pointing, but I am facing a problem in running the application(without checkpoint) at the first place. I tried running the application on 2 processing nodes with default HYDRA process manager.<br>
<br>Command : $: ../bin/mpiexec -np 2 ./mpiexample<br>(host file has domain names for 2 hosts) <br><br>The error shown is - <br>
<i>Fatal error in MPI_Send: Other MPI error, error stack:<br> MPI_Send(174).................</i><i>....: MPI_Send(buf=0x7fff379fabb8, count=1, MPI_INT, dest=1, tag=0, MPI_COMM_WORLD) failed<br> MPIDI_CH3I_Progress(165)......</i><i>....: <br>
MPID_nem_mpich2_blocking_recv(</i><i>895): <br> MPID_nem_tcp_connpoll(1714)...</i><i>....: Communication error</i><br><br>Can you please suggest how can I find the cause for this error.<br><br>Thanks,<br>Kishor<br><div class="gmail_quote">
On Wed, Jul 14, 2010 at 2:03 PM, Darius Buntinas <span dir="ltr"><<a href="mailto:buntinas@mcs.anl.gov" target="_blank">buntinas@mcs.anl.gov</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
Here's a wiki page that has some info on building it and running applications. Let me know if you have trouble with this.<br>
<br>
<a href="http://wiki.mcs.anl.gov/mpich2/index.php/Checkpointing" target="_blank">http://wiki.mcs.anl.gov/mpich2/index.php/Checkpointing</a><br>
<font color="#888888"><br>
-d<br>
</font><div><div></div><div><br>
On Jul 13, 2010, at 9:37 AM, kishor kharbas wrote:<br>
<br>
> Hi,<br>
><br>
> Does the beta version - mpich2-1.3a2 have support for BLCR ?<br>
> If so where can I find guidelines regarding usage of the functionality, if could not find it in the user guide document included with the above version.<br>
><br>
><br>
> Thanks,<br>
> Kishor<br>
> On Mon, Jul 12, 2010 at 11:14 AM, Darius Buntinas <<a href="mailto:buntinas@mcs.anl.gov" target="_blank">buntinas@mcs.anl.gov</a>> wrote:<br>
><br>
> The next release of MPICH2 (1.3) will include checkpointing support using BLCR. You can try the beta release that's available under 'downloads' on the MPICH2 website:<br>
><br>
> <a href="http://www.mcs.anl.gov/research/projects/mpich2/" target="_blank">http://www.mcs.anl.gov/research/projects/mpich2/</a><br>
><br>
> You'll need to install BLCR version 0.8.2 (which is currently the latest version).<br>
><br>
> -d<br>
><br>
> On Jul 12, 2010, at 9:05 AM, kishor kharbas wrote:<br>
><br>
> > Hello,<br>
> ><br>
> > I would like to know whether there are any plans for including Berkeley lab checkpoint restart(BLCR) in MPICH2 runtime environment.<br>
> ><br>
> > Thanks,<br>
> > Kishor Kharbas<br>
> > MS Student<br>
> > Department of Computer Science<br>
> > NC State University<br>
> > Raleigh, NC 27606<br>
> > _______________________________________________<br>
> > mpich-discuss mailing list<br>
> > <a href="mailto:mpich-discuss@mcs.anl.gov" target="_blank">mpich-discuss@mcs.anl.gov</a><br>
> > <a href="https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss" target="_blank">https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss</a><br>
><br>
> _______________________________________________<br>
> mpich-discuss mailing list<br>
> <a href="mailto:mpich-discuss@mcs.anl.gov" target="_blank">mpich-discuss@mcs.anl.gov</a><br>
> <a href="https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss" target="_blank">https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss</a><br>
><br>
><br>
><br>
> --<br>
> MS Student<br>
> Department of Computer Science<br>
> NC State University<br>
> Raleigh, NC 27606<br>
> _______________________________________________<br>
> mpich-discuss mailing list<br>
> <a href="mailto:mpich-discuss@mcs.anl.gov" target="_blank">mpich-discuss@mcs.anl.gov</a><br>
> <a href="https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss" target="_blank">https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss</a><br>
<br>
_______________________________________________<br>
mpich-discuss mailing list<br>
<a href="mailto:mpich-discuss@mcs.anl.gov" target="_blank">mpich-discuss@mcs.anl.gov</a><br>
<a href="https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss" target="_blank">https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss</a><br>
</div></div></blockquote></div><br><br clear="all"><br>-- <br>MS Student<br>Department of Computer Science<br>NC State University<br>Raleigh, NC 27606<br>