<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content="text/html; charset=us-ascii" http-equiv=Content-Type>
<META name=GENERATOR content="MSHTML 8.00.6001.18812"></HEAD>
<BODY>
<DIV dir=ltr align=left><FONT color=#0000ff size=2 face=Arial><SPAN
class=273504714-13102009>Hi,</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT color=#0000ff size=2 face=Arial><SPAN
class=273504714-13102009> We are currently working on adding
fault-tolerance to MPICH2. So in couple of months we might have something that
you can work with.</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT color=#0000ff size=2 face=Arial><SPAN
class=273504714-13102009> On a side note, what kind of process crash do you
see ? Is this an application error (which you should fix anyway)? Is it due to
an internal MPICH2 error ? Please provide us more details.</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT color=#0000ff size=2 face=Arial><SPAN
class=273504714-13102009></SPAN></FONT> </DIV>
<DIV dir=ltr align=left><FONT color=#0000ff size=2 face=Arial><SPAN
class=273504714-13102009>Regards,</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT color=#0000ff size=2 face=Arial><SPAN
class=273504714-13102009>Jayesh</SPAN></FONT></DIV><BR>
<DIV dir=ltr lang=en-us class=OutlookMessageHeader align=left>
<HR tabIndex=-1>
<FONT size=2 face=Tahoma><B>From:</B> mpich-discuss-bounces@mcs.anl.gov
[mailto:mpich-discuss-bounces@mcs.anl.gov] <B>On Behalf Of </B>abhishek
pandey<BR><B>Sent:</B> Tuesday, October 13, 2009 7:23 AM<BR><B>To:</B>
mpich-discuss@mcs.anl.gov<BR><B>Subject:</B> [mpich-discuss] If one process of
Cluster crashes<BR></FONT><BR></DIV>
<DIV></DIV>Hi,<BR><BR>I am using MPICH2 on windows and sometime I face the
problem of crashing of one process in cluster. Is there any way to handle this ?
I do not want to start the cluster all over again.<BR>As far as I know, if one
process of cluster goes down anyhow then the cluster also goes down.
<BR><BR><BR>Thanks,<BR>Abhishek.<BR></BODY></HTML>