<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML dir=ltr xmlns:o = "urn:schemas-microsoft-com:office:office"><HEAD>
<META content="text/html; charset=us-ascii" http-equiv=Content-Type>
<META name=GENERATOR content="MSHTML 8.00.6001.18904"></HEAD>
<BODY>
<DIV dir=ltr align=left><SPAN class=908395118-02042010><FONT color=#0000ff
size=2 face=Arial>Can you try configuring with --with-device=ch3:sock and see if
that works?</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=908395118-02042010><FONT color=#0000ff
size=2 face=Arial></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=908395118-02042010><FONT color=#0000ff
size=2 face=Arial>Rajeev</FONT></SPAN></DIV><BR>
<BLOCKQUOTE
style="BORDER-LEFT: #0000ff 2px solid; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; MARGIN-RIGHT: 0px"
dir=ltr>
<DIV dir=ltr lang=en-us class=OutlookMessageHeader align=left>
<HR tabIndex=-1>
<FONT size=2 face=Tahoma><B>From:</B> mpich-discuss-bounces@mcs.anl.gov
[mailto:mpich-discuss-bounces@mcs.anl.gov] <B>On Behalf Of </B>Costa,
Michael<BR><B>Sent:</B> Friday, April 02, 2010 1:01 PM<BR><B>To:</B>
mpich-discuss@mcs.anl.gov<BR><B>Subject:</B> [mpich-discuss]
mpich2-1.2.1p1<BR></FONT><BR></DIV>
<DIV></DIV>
<DIV dir=ltr id=idOWAReplyText94516>
<DIV dir=ltr><FONT color=#000000 size=2 face=Arial>I have been struggling with
communication errors when ever I run mpiexec. This installation is on PARISC
based cluster. Running mpich2-1.2.1p1, I have configured it with
--with-device=ch3:nemesis.
<DIV dir=ltr><FONT size=2 face=Arial></FONT> </DIV>
<DIV dir=ltr><FONT size=2 face=Arial>Currently only 2 nodes are in the
ring, </FONT><FONT size=2 face=Arial>hp20 and hp14 for testing/setup
purposes.</FONT></DIV>
<DIV dir=ltr><FONT size=2 face=Arial></FONT> </DIV>
<DIV dir=ltr><FONT size=2 face=Arial>The following steps may shed some light
on the problem, which I'm sure is something I have omitted or failed to do
with the initial installation/configuration. It appears that I can run non MPI
programs OK, but MPI code like cpi or hello fail.</FONT></DIV>
<DIV dir=ltr><FONT size=2 face=Arial></FONT> </DIV>
<DIV dir=ltr><FONT size=2 face=Arial></FONT> </DIV>
<DIV dir=ltr><FONT size=2 face=Arial>hp20:~$ mpdallexit</FONT></DIV><FONT
size=2 face=Arial>
<DIV dir=ltr><BR><A href="mailto:mikec@hp20:~$"
target=_blank>mikec@hp20:~$</A> mpdboot -v -n 2 -f /etc/mpd.hosts<BR>running
mpdallexit on hp20<BR>LAUNCHED mpd on hp20 via<BR>RUNNING: mpd on
hp20<BR>LAUNCHED mpd on hp14 via hp20<BR>RUNNING: mpd on
hp14<BR></DIV>
<DIV dir=ltr><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A>
mpdtrace<BR>hp20<BR>hp14</DIV>
<DIV dir=ltr><BR><A href="mailto:mikec@hp20:~$"
target=_blank>mikec@hp20:~$</A> mpdtrace -l<BR>hp20_44192
(172.17.81.20)<BR>hp14_51832 (172.17.81.14)</DIV>
<DIV dir=ltr><BR><A href="mailto:mikec@hp20:~$"
target=_blank>mikec@hp20:~$</A> mpdringtest 10<BR>time for 10 loops =
0.0491678714752 seconds</DIV>
<DIV dir=ltr><BR><A href="mailto:mikec@hp20:~$"
target=_blank>mikec@hp20:~$</A> mpiexec -n 2 uname -a<BR>Linux hp20
2.6.32-trunk-parisc #1 Mon Jan 11 03:07:31 UTC 2010 parisc GNU/Linux<BR>Linux
hp14 2.6.32-trunk-parisc #1 Mon Jan 11 03:07:31 UTC 2010 parisc
GNU/Linux</DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A>
mpiexec -n 1 ./cpi<BR>Process 0 of 1 is on hp20<BR>pi is approximately
3.1415926544231341, Error is 0.0000000008333410<BR>wall clock time =
0.003888<BR><BR></DIV>
<DIV dir=ltr><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A>
mpiexec -n 2 ./cpi<BR>Process 0 of 2 is on hp20<BR>Process 1 of 2 is on
hp14<BR>Fatal error in PMPI_Bcast: Other MPI error, error
stack:<BR>PMPI_Bcast(1302)..................: MPI_Bcast(buf=0xc016e33c,
count=1, MPI_INT, root=0, MPI_COMM_WORLD)
failed<BR>MPIR_Bcast(1031)..................:<BR>MPIR_Bcast_binomial(157)..........:<BR>MPIC_Recv(83).....................:<BR>MPIC_Wait(513)....................:<BR>MPIDI_CH3I_Progress(150)..........:<BR>MPID_nem_mpich2_blocking_recv(948):<BR>MPID_nem_tcp_connpoll(1720).......:<BR>state_listening_handler(1787).....:
accept of socket fd failed - Resource temporarily unavailable<BR>rank 1 in job
2 hp20_44192 caused collective abort of all ranks<BR>
exit status of rank 1: return code 1<BR>Fatal error in PMPI_Bcast: Other MPI
error, error stack:<BR>PMPI_Bcast(1302)..................:
MPI_Bcast(buf=0xc067f33c, count=1, MPI_INT, root=0, MPI_COMM_WORLD)
failed<BR>MPIR_Bcast(1031)..................:<BR>MPIR_Bcast_binomial(187)..........:<BR>MPIC_Send(41).....................:<BR>MPIC_Wait(513)....................:<BR>MPIDI_CH3I_Progress(150)..........:<BR>MPID_nem_mpich2_blocking_recv(948):<BR>MPID_nem_tcp_connpoll(1709).......:
Communication error</DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr>Any comments and or suggestions are greatly appreciated.</DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr>Mike C.</DIV>
<DIV dir=ltr></FONT><FONT size=2 face=Arial></FONT> </DIV></FONT></DIV>
<DIV dir=ltr><FONT color=#000000 size=2 face=Arial></FONT> </DIV></DIV>
<DIV dir=ltr id=idSignature83318>
<DIV><FONT color=#000000 size=2 face=Arial>
<DIV><FONT color=#000000><FONT face=Arial><SPAN
style="COLOR: black; FONT-SIZE: 10pt; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial">
<DIV style="LINE-HEIGHT: normal; MARGIN: 0in 0in 0pt" class=MsoNormal><SPAN
style="COLOR: black; FONT-SIZE: 10pt; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial">
<DIV style="LINE-HEIGHT: normal; MARGIN: 0in 0in 0pt" class=MsoNormal>
<DIV dir=ltr id=idSignature64848>
<DIV><FONT color=#000000 size=2 face=Arial>Michael A. Costa</FONT></DIV>
<DIV><FONT size=2 face=Arial>SET <EM>(RCC)</EM>, CCAI-CCNA/CCNP
<EM>(Cisco)</EM>, MInfTech <EM>(Griffith)</EM></FONT></DIV>
<DIV><FONT size=2 face=Arial>Professor - Information Technology
Division</FONT></DIV>
<DIV><FONT size=2 face=Arial>Fanshawe College</FONT></DIV>
<DIV><FONT size=2 face=Arial>G3001</FONT></DIV>
<DIV><FONT size=2 face=Arial>1001 Fanshawe College Boulevard</FONT></DIV>
<DIV><FONT size=2 face=Arial>P.O. Box 7005</FONT></DIV>
<DIV><FONT size=2 face=Arial>London, ON </FONT><FONT size=2 face=Arial>N5Y
5R6</FONT></DIV>
<DIV><FONT size=2 face=Arial>Tel: (519) 452-4291 Fax: (519)
452-1801</FONT></DIV></DIV></DIV>
<DIV style="LINE-HEIGHT: normal; MARGIN: 0in 0in 0pt" class=MsoNormal><SPAN
style="FONT-SIZE: 10pt; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial"></SPAN><SPAN
style="FONT-FAMILY: 'Times New Roman','serif'; FONT-SIZE: 12pt; mso-fareast-font-family: 'Times New Roman'"><o:p></o:p></SPAN> </DIV>
<DIV style="LINE-HEIGHT: normal; MARGIN: 0in 0in 0pt" class=MsoNormal>
<DIV style="LINE-HEIGHT: normal; MARGIN: 0in 0in 0pt" class=MsoNormal><SPAN
style="FONT-SIZE: 10pt; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial"></SPAN><SPAN
style="FONT-FAMILY: 'Times New Roman','serif'; FONT-SIZE: 12pt; mso-fareast-font-family: 'Times New Roman'"><o:p></o:p></SPAN></DIV><SPAN
style="FONT-SIZE: 10pt; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial"></SPAN><SPAN
style="FONT-FAMILY: 'Times New Roman','serif'; FONT-SIZE: 12pt; mso-fareast-font-family: 'Times New Roman'"><o:p></o:p></SPAN></DIV></SPAN></DIV></SPAN></FONT></FONT></DIV></FONT></DIV></DIV></BLOCKQUOTE></BODY></HTML>