<br clear="all"><div><font face="Times New Roman" size="2">
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">Hi,</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">I&#39;m 
in my second year of my graduate education. And now I&#39;m learning 
MPI.</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">I 
have learned from file &#39;CHANGES&#39; in mpich2-1.3.1 package,&nbsp; that the&nbsp;newest 
version of MPICH2&nbsp;will not cause the whole job to abort if a process failure 
occurs.</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">The 
following is quoted from file &#39;CHANGES&#39;:</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; COLOR: red; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">OVERALL: 
Improved tolerance to process and communication failures<br>when error handler 
is set to MPI_ERRORS_RETURN. If a communication<br>operation fails (e.g., due to 
a process failure) MPICH2 will return<br>an error, and further communication to 
that process is not<br>possible. However, communication with other processes 
will still<br>proceed normally. Note, however, that the behavior 
collective<br>operations on communicators containing the failed process 
is<br>undefined, and may give incorrect results or hang some 
processes</span><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">.</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">I 
have done some simple tests, but the results confused me. </span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">The 
following&nbsp;is my source code, </span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; COLOR: red; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">#include 
...</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; COLOR: red; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">...</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; COLOR: red; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">void 
recover(MPI_Comm *comm, int *err_code,...)<br>{<br>&nbsp;printf(&quot;Pid%d: in 
recovery...\n&quot;,getpid());<br>}<br>&nbsp;<br>int main(int argc,char* 
argv[])<br>{<br>&nbsp;int rank,size;<br>&nbsp;int tag = 99;<br>&nbsp;char 
buf[20]=&quot;&quot;;<br>&nbsp;MPI_Comm comm;<br>&nbsp;MPI_Status status;<br>&nbsp;MPI_Errhandler 
errh;<br>&nbsp;MPI_Init(&amp;argc,&amp;argv);<br>&nbsp;MPI_Comm_size(MPI_COMM_WORLD,&amp;size);<br>&nbsp;MPI_Comm_rank(MPI_COMM_WORLD,&amp;rank);<br>&nbsp;MPI_Comm_dup(MPI_COMM_WORLD, 
&amp;comm);<br>&nbsp;&nbsp;&nbsp; <br>&nbsp;MPI_Errhandler_create(recover, &amp;errh);<br>&nbsp;&nbsp;&nbsp; 
&nbsp;MPI_Errhandler_set(comm,errh);<br>&nbsp;<br>&nbsp;printf(&quot;P%d: pid = %d\n&quot;, rank, 
getpid());<br>&nbsp;if(rank == 0)<br>&nbsp;{<br>&nbsp;&nbsp;strcpy( buf, 
&quot;haha!\n&quot;);<br>&nbsp;&nbsp;MPI_Send(buf, 10, MPI_CHAR, 1, tag , comm);<br>&nbsp;&nbsp;strcpy(buf, 
&quot;hehe\n&quot;);<br>&nbsp;&nbsp;MPI_Send(buf, 10, MPI_CHAR, 2, tag, comm);</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; COLOR: red; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;&nbsp;MPI_Recv(buf, 
20, MPI_CHAR, 2, tag , comm, &amp;status);<br>&nbsp;}<br>&nbsp;else 
<br>&nbsp;{<br>&nbsp;&nbsp;sleep(40);<br>&nbsp;&nbsp;MPI_Recv(buf, 10, MPI_CHAR, 0, tag , comm, 
&amp;status);</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; COLOR: red; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;&nbsp;if(rank 
== 2)<br>&nbsp;&nbsp;{<br>&nbsp;&nbsp;&nbsp;strcat(buf, &quot; by P2!\n&quot;);<br>&nbsp;&nbsp;&nbsp;MPI_Send(buf, 20, MPI_CHAR, 
0, tag, comm);</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; COLOR: red; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;&nbsp;}<br>
&nbsp;}<br>&nbsp;printf(&quot;P%d: 
%s\n&quot;,rank, buf);<br>&nbsp;MPI_Finalize();<br>&nbsp;return 1;<br>}</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">I 
kill process P2 by kill command and its pid, but the results of the job 
is:</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; COLOR: red; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">mpirun 
-np 3 ./hello<br>P0: pid = 17157<br>P2: pid = 17159<br>P1: pid = 
17158<br>APPLICATION TERMINATED WITH THE EXIT STRING: Hangup (signal 
1)</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">Where 
does the problem lie? Looking forward eagerly for your reply.</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体">&nbsp;</span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">Yours 
sincerely,</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">Rui</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt"><br>
</span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">------</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>

<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">Rui 
Wang</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p>
<p class="MsoNormal" style="MARGIN: 0cm 0cm 0pt; TEXT-ALIGN: left; mso-pagination: widow-orphan" align="left"><span lang="EN-US" style="FONT-SIZE: 10pt; FONT-FAMILY: &#39;Times New Roman&#39;,&#39;serif&#39;; mso-fareast-font-family: 宋体; mso-font-kerning: 0pt">Institute 
of Computing technology, Chinese Academy of Science, Beijing, PR. 
China</span><span lang="EN-US" style="FONT-SIZE: 12pt; FONT-FAMILY: 宋体; mso-font-kerning: 0pt; mso-bidi-font-family: 宋体"></span></p></font></div><br>