<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content="text/html; charset=us-ascii" http-equiv=Content-Type>
<META name=GENERATOR content="MSHTML 8.00.6001.18783"></HEAD>
<BODY>
<DIV dir=ltr align=left><SPAN class=031414115-18072009><FONT color=#0000ff
size=2 face=Arial>Are the CPUs identical on them? Is one 32-bit, the other
64-bit?</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=031414115-18072009></SPAN> </DIV><BR>
<BLOCKQUOTE
style="BORDER-LEFT: #0000ff 2px solid; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; MARGIN-RIGHT: 0px">
<DIV dir=ltr lang=en-us class=OutlookMessageHeader align=left>
<HR tabIndex=-1>
<FONT size=2 face=Tahoma><B>From:</B> mpich-discuss-bounces@mcs.anl.gov
[mailto:mpich-discuss-bounces@mcs.anl.gov] <B>On Behalf Of </B>Gra
zeus<BR><B>Sent:</B> Saturday, July 18, 2009 10:27 AM<BR><B>To:</B>
mpich-discuss@mcs.anl.gov<BR><B>Subject:</B> Re: [mpich-discuss] I wonder if
my mpdboot is the causeofproblem...help me!<BR></FONT><BR></DIV>
<DIV></DIV>
<TABLE border=0 cellSpacing=0 cellPadding=0>
<TBODY>
<TR>
<TD vAlign=top>one of them is quad core and another one is dual core.
however, OS,account,my password,install path are all the same.
<DIV>I use this configuration "./configure
--prefix=/opt/localhomes/myname/mpich2-install" in both
machines.</DIV><BR>--- On <B>Sat, 7/18/09, Rajeev Thakur
<I><thakur@mcs.anl.gov></I></B> wrote:<BR>
<BLOCKQUOTE
style="BORDER-LEFT: rgb(16,16,255) 2px solid; PADDING-LEFT: 5px; MARGIN-LEFT: 5px"><BR>From:
Rajeev Thakur <thakur@mcs.anl.gov><BR>Subject: Re:
[mpich-discuss] I wonder if my mpdboot is the cause ofproblem...help
me!<BR>To: mpich-discuss@mcs.anl.gov<BR>Date: Saturday, July 18, 2009,
7:02 AM<BR><BR>
<DIV id=yiv189604466>
<DIV dir=ltr align=left><SPAN class=406120114-18072009><FONT
color=#0000ff size=2 face=Arial>What are the exact parameters you
passed to configure when building MPICH2? Are the two machines
identical?</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=406120114-18072009><FONT
color=#0000ff size=2 face=Arial></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=406120114-18072009><FONT
color=#0000ff size=2 face=Arial>Rajeev</FONT></SPAN></DIV><BR>
<BLOCKQUOTE
style="BORDER-LEFT: #0000ff 2px solid; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; MARGIN-RIGHT: 0px">
<DIV dir=ltr lang=en-us class=OutlookMessageHeader align=left>
<HR tabIndex=-1>
<FONT size=2 face=Tahoma><B>From:</B>
mpich-discuss-bounces@mcs.anl.gov
[mailto:mpich-discuss-bounces@mcs.anl.gov] <B>On Behalf Of </B>Gra
zeus<BR><B>Sent:</B> Saturday, July 18, 2009 12:06 AM<BR><B>To:</B>
mpich-discuss@mcs.anl.gov<BR><B>Subject:</B> [mpich-discuss] I
wonder if my mpdboot is the cause ofproblem...help
me!<BR></FONT><BR></DIV>
<DIV></DIV>
<TABLE border=0 cellSpacing=0 cellPadding=0>
<TBODY>
<TR>
<TD vAlign=top>hello,
<DIV><BR></DIV>
<DIV>thx for answer yesterday.</DIV>
<DIV>I tested my code in one machine (with"mpiexec -n 2
./myprog"),everything work fine - my program can use
MPI_Send,MPI_Recv without any problems.</DIV>
<DIV><BR></DIV>
<DIV>today, I setup mpich2 on two machines. Both machines can
communicate with others, ssh are tested on both machines, mpd
work, mpdringtest work.</DIV>
<DIV><BR></DIV>
<DIV>however,when i run my program that use MPI_Send and
MPI_Recv, MPI_Recv is blocked forever.</DIV>
<DIV>so i write new simple code to test MPI_Send,MPI_Recv like
this</DIV>
<DIV><BR></DIV>
<DIV> int myrank;</DIV>
<DIV>
<DIV> MPI_Status status;</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Init( &argc, &argv
);</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Comm_rank( MPI_COMM_WORLD,
&myrank );</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>if (myrank == 0) </DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>{</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>int senddata = 1;</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Send(&senddata, 1,
MPI_INT, 1, 0, MPI_COMM_WORLD);</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>}</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>else if (myrank == 1) </DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>{</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>int recvdata = 0;</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Recv(&recvdata, 1,
MPI_INT, 0, 0, MPI_COMM_WORLD, &status);</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>printf("received :%d:\n",
recvdata);</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>}</DIV>
<DIV><SPAN style="WHITE-SPACE: pre"
class=Apple-tab-span></SPAN>MPI_Finalize();</DIV>
<DIV><BR></DIV>
<DIV><BR></DIV>
<DIV>i got this error</DIV>
<DIV><BR></DIV>
<DIV><BR></DIV>
<DIV>
<DIV>Assertion failed in file ch3_progress.c at line 489:
pkt->type >= 0 && pkt->type <
MPIDI_NEM_PKT_END</DIV>
<DIV>internal ABORT - process 1</DIV>
<DIV>Fatal error in MPI_Finalize: Other MPI error, error
stack:</DIV>
<DIV>MPI_Finalize(315)..................: MPI_Finalize
failed</DIV>
<DIV>MPI_Finalize(207)..................: </DIV>
<DIV>MPID_Finalize(92)..................: </DIV>
<DIV>PMPI_Barrier(476)..................:
MPI_Barrier(comm=0x44000002) failed</DIV>
<DIV>MPIR_Barrier(82)...................: </DIV>
<DIV>MPIC_Sendrecv(164).................: </DIV>
<DIV>MPIC_Wait(405).....................: </DIV>
<DIV>MPIDI_CH3I_Progress(150)...........: </DIV>
<DIV>MPID_nem_mpich2_blocking_recv(1074): </DIV>
<DIV>MPID_nem_tcp_connpoll(1667)........: </DIV>
<DIV>state_commrdy_handler(1517)........: </DIV>
<DIV>MPID_nem_tcp_recv_handler(1413)....: socket closed</DIV>
<DIV><BR></DIV>
<DIV>////////////////////////////////////////////////////////////////</DIV>
<DIV><BR></DIV>
<DIV>I also tried example/cpi that come with install package
-> result is the example program freezed, without any
errors.(I assume it stopped at MPI_Bcast())</DIV>
<DIV><BR></DIV>
<DIV>Can anyone help me with this?</DIV>
<DIV>This code and my program can run smoothly when I use 1
machine (with option , -n 2, -n 4 .... etc) but whenever
I start mpdboot with 2 machines, mpi processes can't
communicate with other mpi processes via
MPI_Send,MPI_Recv.</DIV>
<DIV><BR></DIV>
<DIV>thx,</DIV>
<DIV>gra</DIV></DIV>
<DIV><BR></DIV></DIV>
<DIV><BR></DIV></TD></TR></TBODY></TABLE><BR></BLOCKQUOTE></DIV></BLOCKQUOTE></TD></TR></TBODY></TABLE><BR></BLOCKQUOTE></BODY></HTML>