[mpich-discuss] Don't crash on node failures
    Pavan Balaji 
    balaji at mcs.anl.gov
       
    Wed Apr 14 19:34:01 CDT 2010
    
    
  
On 04/14/2010 03:13 AM, Jürgen Kaiser wrote:
> Can I force MPI to not abort the whole job when a node crashes? I would
> like to let the remaining MPI-processes perform some action in that case
> and then proceed.
This support is not currently available in MPICH2, but we are actively 
working on it. We hope to have this in the 1.3 release of mpich2, though 
it's possible that it might get delayed to the next major version.
  -- Pavan
-- 
Pavan Balaji
http://www.mcs.anl.gov/~balaji
    
    
More information about the mpich-discuss
mailing list