[mpich-discuss] Fault tolerance and Losing a Node

Hiatt, Dave M dave.m.hiatt at citi.com
Wed Sep 15 15:11:36 CDT 2010


I thought I saw recently a note that 1.3.1 can support programmatic recovery and continuing processing in the face of losing one of the members of a communicator.  Is there a tutorial, documentation, and a best practices document on implementing this feature (if indeed it does exist) like I hope it does?

Thanks

"People get held back by the voice inside em" - K'naan - In the Beginning

Dave M. Hiatt
Director, Risk Analytics
CitiMortgage
1000 Technology Drive
O'Fallon, MO 63368-2240

Telephone:  636-261-1408

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20100915/5d0959f5/attachment.htm>


More information about the mpich-discuss mailing list