<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content="text/html; charset=us-ascii" http-equiv=Content-Type>
<META name=GENERATOR content="MSHTML 8.00.6001.18783"></HEAD>
<BODY>
<DIV dir=ltr align=left><SPAN class=406120114-18072009><FONT color=#0000ff
size=2 face=Arial>What are the exact parameters you passed to configure when
building MPICH2? Are the two machines identical?</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=406120114-18072009><FONT color=#0000ff
size=2 face=Arial></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=406120114-18072009><FONT color=#0000ff
size=2 face=Arial>Rajeev</FONT></SPAN></DIV><BR>
<BLOCKQUOTE
style="BORDER-LEFT: #0000ff 2px solid; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; MARGIN-RIGHT: 0px">
<DIV dir=ltr lang=en-us class=OutlookMessageHeader align=left>
<HR tabIndex=-1>
<FONT size=2 face=Tahoma><B>From:</B> mpich-discuss-bounces@mcs.anl.gov
[mailto:mpich-discuss-bounces@mcs.anl.gov] <B>On Behalf Of </B>Gra
zeus<BR><B>Sent:</B> Saturday, July 18, 2009 12:06 AM<BR><B>To:</B>
mpich-discuss@mcs.anl.gov<BR><B>Subject:</B> [mpich-discuss] I wonder if my
mpdboot is the cause ofproblem...help me!<BR></FONT><BR></DIV>
<DIV></DIV>
<TABLE border=0 cellSpacing=0 cellPadding=0>
<TBODY>
<TR>
<TD vAlign=top>hello,
<DIV><BR></DIV>
<DIV>thx for answer yesterday.</DIV>
<DIV>I tested my code in one machine (with"mpiexec -n 2
./myprog"),everything work fine - my program can use MPI_Send,MPI_Recv
without any problems.</DIV>
<DIV><BR></DIV>
<DIV>today, I setup mpich2 on two machines. Both machines can
communicate with others, ssh are tested on both machines, mpd work,
mpdringtest work.</DIV>
<DIV><BR></DIV>
<DIV>however,when i run my program that use MPI_Send and MPI_Recv,
MPI_Recv is blocked forever.</DIV>
<DIV>so i write new simple code to test MPI_Send,MPI_Recv like
this</DIV>
<DIV><BR></DIV>
<DIV> int myrank;</DIV>
<DIV>
<DIV> MPI_Status status;</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Init( &argc, &argv );</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Comm_rank( MPI_COMM_WORLD, &myrank
);</DIV>
<DIV><SPAN style="WHITE-SPACE: pre" class=Apple-tab-span></SPAN>if
(myrank == 0) </DIV>
<DIV><SPAN style="WHITE-SPACE: pre" class=Apple-tab-span></SPAN>{</DIV>
<DIV><SPAN style="WHITE-SPACE: pre" class=Apple-tab-span></SPAN>int
senddata = 1;</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Send(&senddata, 1, MPI_INT, 1, 0,
MPI_COMM_WORLD);</DIV>
<DIV><SPAN style="WHITE-SPACE: pre" class=Apple-tab-span></SPAN>}</DIV>
<DIV><SPAN style="WHITE-SPACE: pre" class=Apple-tab-span></SPAN>else if
(myrank == 1) </DIV>
<DIV><SPAN style="WHITE-SPACE: pre" class=Apple-tab-span></SPAN>{</DIV>
<DIV><SPAN style="WHITE-SPACE: pre" class=Apple-tab-span></SPAN>int
recvdata = 0;</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Recv(&recvdata, 1, MPI_INT, 0, 0,
MPI_COMM_WORLD, &status);</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>printf("received :%d:\n", recvdata);</DIV>
<DIV><SPAN style="WHITE-SPACE: pre" class=Apple-tab-span></SPAN>}</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Finalize();</DIV>
<DIV><BR></DIV>
<DIV><BR></DIV>
<DIV>i got this error</DIV>
<DIV><BR></DIV>
<DIV><BR></DIV>
<DIV>
<DIV>Assertion failed in file ch3_progress.c at line 489: pkt->type
>= 0 && pkt->type < MPIDI_NEM_PKT_END</DIV>
<DIV>internal ABORT - process 1</DIV>
<DIV>Fatal error in MPI_Finalize: Other MPI error, error stack:</DIV>
<DIV>MPI_Finalize(315)..................: MPI_Finalize failed</DIV>
<DIV>MPI_Finalize(207)..................: </DIV>
<DIV>MPID_Finalize(92)..................: </DIV>
<DIV>PMPI_Barrier(476)..................: MPI_Barrier(comm=0x44000002)
failed</DIV>
<DIV>MPIR_Barrier(82)...................: </DIV>
<DIV>MPIC_Sendrecv(164).................: </DIV>
<DIV>MPIC_Wait(405).....................: </DIV>
<DIV>MPIDI_CH3I_Progress(150)...........: </DIV>
<DIV>MPID_nem_mpich2_blocking_recv(1074): </DIV>
<DIV>MPID_nem_tcp_connpoll(1667)........: </DIV>
<DIV>state_commrdy_handler(1517)........: </DIV>
<DIV>MPID_nem_tcp_recv_handler(1413)....: socket closed</DIV>
<DIV><BR></DIV>
<DIV>////////////////////////////////////////////////////////////////</DIV>
<DIV><BR></DIV>
<DIV>I also tried example/cpi that come with install package ->
result is the example program freezed, without any errors.(I assume it
stopped at MPI_Bcast())</DIV>
<DIV><BR></DIV>
<DIV>Can anyone help me with this?</DIV>
<DIV>This code and my program can run smoothly when I use 1 machine
(with option , -n 2, -n 4 .... etc) but whenever I start mpdboot
with 2 machines, mpi processes can't communicate with other mpi
processes via MPI_Send,MPI_Recv.</DIV>
<DIV><BR></DIV>
<DIV>thx,</DIV>
<DIV>gra</DIV></DIV>
<DIV><BR></DIV></DIV>
<DIV><BR></DIV></TD></TR></TBODY></TABLE><BR></BLOCKQUOTE></BODY></HTML>