PS: <div><br></div><div>When the code error is reported at iteration n, if I restart the program at iteration n-1, then everything goes fine for several iteration rounds. Then, the error occurs again.<br><br><div class="gmail_quote">
On Fri, Feb 11, 2011 at 12:59 AM, Xiao Li <span dir="ltr"><<a href="mailto:shinelee.thewise@gmail.com">shinelee.thewise@gmail.com</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
<div>Hi,</div><div><br></div><div>I use a small MPI program and get the following error. </div><div><br></div><div><blockquote class="gmail_quote" style="margin-top:0px;margin-right:0px;margin-bottom:0px;margin-left:0.8ex;border-left-width:1px;border-left-color:rgb(204, 204, 204);border-left-style:solid;padding-left:1ex">
Fatal error in MPI_Recv: Other MPI error, error stack:<br>MPI_Recv(186)........................: MPI_Recv(buf=0012FA20, count=1, MPI_INT,<br>src=MPI_ANY_SOURCE, tag=5, MPI_COMM_WORLD, status=0012FA80) failed<br>MPIDI_CH3I_Progress(335).............:<br>
MPID_nem_mpich2_blocking_recv(906)...:<br>MPID_nem_newtcp_module_poll(37)......:<br>MPID_nem_newtcp_module_connpoll(2655):<br>gen_read_fail_handler(1145)..........: read from socket failed - The specified network name is no longer available.</blockquote>
</div><div><br></div><div>The code framework is something like this below.</div><div><br></div><div>if rank == 0</div><div>{</div><div> for iter=1 to N</div><div> MPI_Recv any</div><div> get proc rank from status</div>
<div> MPI_Send proc</div><div> end</div><div>}</div><div>else</div><div>{</div><div> for iter=1 to N</div><div> MPI_Send to 0</div><div> MPI_Recv from 0</div><div> do some computation here</div>
<div> end<br></div><div>}</div><div><br></div><div>I do check my code carefully. And I even rewrite the core computation code in a series way. Then I get no error. Even more strange is that the code will crash at different for loop iteration. I suspect the MPI can not work in my network environment. The network is composed by four Windows XP machines with 100/mbps Ethernet network. Would you help me on this issue?</div>
<div><br></div><div>cheers</div><div>Xiao</div><font color="#888888"><div> </div>
</font></blockquote></div><br></div>