[mpich-discuss] Fault tolerance and Losing a Node
Hiatt, Dave M
dave.m.hiatt at citi.com
Wed Sep 15 15:11:36 CDT 2010
I thought I saw recently a note that 1.3.1 can support programmatic recovery and continuing processing in the face of losing one of the members of a communicator. Is there a tutorial, documentation, and a best practices document on implementing this feature (if indeed it does exist) like I hope it does?
Thanks
"People get held back by the voice inside em" - K'naan - In the Beginning
Dave M. Hiatt
Director, Risk Analytics
CitiMortgage
1000 Technology Drive
O'Fallon, MO 63368-2240
Telephone: 636-261-1408
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20100915/5d0959f5/attachment.htm>
More information about the mpich-discuss
mailing list