<table cellspacing="0" cellpadding="0" border="0" ><tr><td valign="top" style="font: inherit;">one of them is quad core and another one is dual core. however, OS,account,my password,install path are all the same.<div>I use this configuration "./configure --prefix=/opt/localhomes/myname/mpich2-install" in both machines.</div><br>--- On <b>Sat, 7/18/09, Rajeev Thakur <i><thakur@mcs.anl.gov></i></b> wrote:<br><blockquote style="border-left: 2px solid rgb(16, 16, 255); margin-left: 5px; padding-left: 5px;"><br>From: Rajeev Thakur <thakur@mcs.anl.gov><br>Subject: Re: [mpich-discuss] I wonder if my mpdboot is the cause ofproblem...help me!<br>To: mpich-discuss@mcs.anl.gov<br>Date: Saturday, July 18, 2009, 7:02 AM<br><br><div id="yiv189604466">
<div dir="ltr" align="left"><span class="406120114-18072009"><font color="#0000ff" size="2" face="Arial">What are the exact parameters you passed to configure when
building MPICH2? Are the two machines identical?</font></span></div>
<div dir="ltr" align="left"><span class="406120114-18072009"><font color="#0000ff" size="2" face="Arial"></font></span> </div>
<div dir="ltr" align="left"><span class="406120114-18072009"><font color="#0000ff" size="2" face="Arial">Rajeev</font></span></div><br>
<blockquote style="BORDER-LEFT:#0000ff 2px solid;PADDING-LEFT:5px;MARGIN-LEFT:5px;MARGIN-RIGHT:0px;">
<div dir="ltr" lang="en-us" class="OutlookMessageHeader" align="left">
<hr tabindex="-1">
<font size="2" face="Tahoma"><b>From:</b> mpich-discuss-bounces@mcs.anl.gov
[mailto:mpich-discuss-bounces@mcs.anl.gov] <b>On Behalf Of </b>Gra
zeus<br><b>Sent:</b> Saturday, July 18, 2009 12:06 AM<br><b>To:</b>
mpich-discuss@mcs.anl.gov<br><b>Subject:</b> [mpich-discuss] I wonder if my
mpdboot is the cause ofproblem...help me!<br></font><br></div>
<div></div>
<table border="0" cellspacing="0" cellpadding="0">
<tbody>
<tr>
<td valign="top">hello,
<div><br></div>
<div>thx for answer yesterday.</div>
<div>I tested my code in one machine (with"mpiexec -n 2
./myprog"),everything work fine - my program can use MPI_Send,MPI_Recv
without any problems.</div>
<div><br></div>
<div>today, I setup mpich2 on two machines. Both machines can
communicate with others, ssh are tested on both machines, mpd work,
mpdringtest work.</div>
<div><br></div>
<div>however,when i run my program that use MPI_Send and MPI_Recv,
MPI_Recv is blocked forever.</div>
<div>so i write new simple code to test MPI_Send,MPI_Recv like
this</div>
<div><br></div>
<div> int myrank;</div>
<div>
<div> MPI_Status status;</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>MPI_Init( &argc, &argv );</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>MPI_Comm_rank( MPI_COMM_WORLD, &myrank
);</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>if
(myrank == 0) </div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>{</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>int
senddata = 1;</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>MPI_Send(&senddata, 1, MPI_INT, 1, 0,
MPI_COMM_WORLD);</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>}</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>else if
(myrank == 1) </div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>{</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>int
recvdata = 0;</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>MPI_Recv(&recvdata, 1, MPI_INT, 0, 0,
MPI_COMM_WORLD, &status);</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>printf("received :%d:\n", recvdata);</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>}</div>
<div><span style="WHITE-SPACE:pre;" class="Apple-tab-span"></span>MPI_Finalize();</div>
<div><br></div>
<div><br></div>
<div>i got this error</div>
<div><br></div>
<div><br></div>
<div>
<div>Assertion failed in file ch3_progress.c at line 489: pkt->type
>= 0 && pkt->type < MPIDI_NEM_PKT_END</div>
<div>internal ABORT - process 1</div>
<div>Fatal error in MPI_Finalize: Other MPI error, error stack:</div>
<div>MPI_Finalize(315)..................: MPI_Finalize failed</div>
<div>MPI_Finalize(207)..................: </div>
<div>MPID_Finalize(92)..................: </div>
<div>PMPI_Barrier(476)..................: MPI_Barrier(comm=0x44000002)
failed</div>
<div>MPIR_Barrier(82)...................: </div>
<div>MPIC_Sendrecv(164).................: </div>
<div>MPIC_Wait(405).....................: </div>
<div>MPIDI_CH3I_Progress(150)...........: </div>
<div>MPID_nem_mpich2_blocking_recv(1074): </div>
<div>MPID_nem_tcp_connpoll(1667)........: </div>
<div>state_commrdy_handler(1517)........: </div>
<div>MPID_nem_tcp_recv_handler(1413)....: socket closed</div>
<div><br></div>
<div>////////////////////////////////////////////////////////////////</div>
<div><br></div>
<div>I also tried example/cpi that come with install package ->
result is the example program freezed, without any errors.(I assume it
stopped at MPI_Bcast())</div>
<div><br></div>
<div>Can anyone help me with this?</div>
<div>This code and my program can run smoothly when I use 1 machine
(with option , -n 2, -n 4 .... etc) but whenever I start mpdboot
with 2 machines, mpi processes can't communicate with other mpi
processes via MPI_Send,MPI_Recv.</div>
<div><br></div>
<div>thx,</div>
<div>gra</div></div>
<div><br></div></div>
<div><br></div></td></tr></tbody></table><br></blockquote>
</div></blockquote></td></tr></table><br>