<HTML dir=ltr><HEAD>
<META http-equiv=Content-Type content="text/html; charset=unicode">
<META content="MSHTML 6.00.6001.18248" name=GENERATOR></HEAD>
<BODY>
<DIV id=idOWAReplyText94516 dir=ltr>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2>I have been struggling with communication errors when ever I run mpiexec. This installation is on PARISC based cluster. Running mpich2-1.2.1p1, I have configured it with --with-device=ch3:nemesis.
<DIV dir=ltr><FONT face=Arial size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial size=2>Currently only 2 nodes are in the ring, </FONT><FONT face=Arial size=2>hp20 and hp14 for testing/setup purposes.</FONT></DIV>
<DIV dir=ltr><FONT face=Arial size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial size=2>The following steps may shed some light on the problem, which I'm sure is something I have omitted or failed to do with the initial installation/configuration. It appears that I can run non MPI programs OK, but MPI code like cpi or hello fail.</FONT></DIV>
<DIV dir=ltr><FONT face=Arial size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial size=2></FONT> </DIV>
<DIV dir=ltr><FONT face=Arial size=2>hp20:~$ mpdallexit</FONT></DIV><FONT face=Arial size=2>
<DIV dir=ltr><BR><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A> mpdboot -v -n 2 -f /etc/mpd.hosts<BR>running mpdallexit on hp20<BR>LAUNCHED mpd on hp20 via<BR>RUNNING: mpd on hp20<BR>LAUNCHED mpd on hp14 via hp20<BR>RUNNING: mpd on hp14<BR></DIV>
<DIV dir=ltr><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A> mpdtrace<BR>hp20<BR>hp14</DIV>
<DIV dir=ltr><BR><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A> mpdtrace -l<BR>hp20_44192 (172.17.81.20)<BR>hp14_51832 (172.17.81.14)</DIV>
<DIV dir=ltr><BR><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A> mpdringtest 10<BR>time for 10 loops = 0.0491678714752 seconds</DIV>
<DIV dir=ltr><BR><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A> mpiexec -n 2 uname -a<BR>Linux hp20 2.6.32-trunk-parisc #1 Mon Jan 11 03:07:31 UTC 2010 parisc GNU/Linux<BR>Linux hp14 2.6.32-trunk-parisc #1 Mon Jan 11 03:07:31 UTC 2010 parisc GNU/Linux</DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A> mpiexec -n 1 ./cpi<BR>Process 0 of 1 is on hp20<BR>pi is approximately 3.1415926544231341, Error is 0.0000000008333410<BR>wall clock time = 0.003888<BR><BR></DIV>
<DIV dir=ltr><A href="mailto:mikec@hp20:~$" target=_blank>mikec@hp20:~$</A> mpiexec -n 2 ./cpi<BR>Process 0 of 2 is on hp20<BR>Process 1 of 2 is on hp14<BR>Fatal error in PMPI_Bcast: Other MPI error, error stack:<BR>PMPI_Bcast(1302)..................: MPI_Bcast(buf=0xc016e33c, count=1, MPI_INT, root=0, MPI_COMM_WORLD) failed<BR>MPIR_Bcast(1031)..................:<BR>MPIR_Bcast_binomial(157)..........:<BR>MPIC_Recv(83).....................:<BR>MPIC_Wait(513)....................:<BR>MPIDI_CH3I_Progress(150)..........:<BR>MPID_nem_mpich2_blocking_recv(948):<BR>MPID_nem_tcp_connpoll(1720).......:<BR>state_listening_handler(1787).....: accept of socket fd failed - Resource temporarily unavailable<BR>rank 1 in job 2 hp20_44192 caused collective abort of all ranks<BR> exit status of rank 1: return code 1<BR>Fatal error in PMPI_Bcast: Other MPI error, error stack:<BR>PMPI_Bcast(1302)..................: MPI_Bcast(buf=0xc067f33c, count=1, MPI_INT, root=0, MPI_COMM_WORLD) failed<BR>MPIR_Bcast(1031)..................:<BR>MPIR_Bcast_binomial(187)..........:<BR>MPIC_Send(41).....................:<BR>MPIC_Wait(513)....................:<BR>MPIDI_CH3I_Progress(150)..........:<BR>MPID_nem_mpich2_blocking_recv(948):<BR>MPID_nem_tcp_connpoll(1709).......: Communication error</DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr>Any comments and or suggestions are greatly appreciated.</DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr> </DIV>
<DIV dir=ltr>Mike C.</DIV>
<DIV dir=ltr></FONT><FONT face=Arial size=2></FONT> </DIV></FONT></DIV>
<DIV dir=ltr><FONT face=Arial color=#000000 size=2></FONT> </DIV></DIV>
<DIV id=idSignature83318 dir=ltr>
<DIV><FONT face=Arial color=#000000 size=2>
<DIV><FONT color=#000000><FONT face=Arial><SPAN style="FONT-SIZE: 10pt; COLOR: black; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial">
<DIV class=MsoNormal style="MARGIN: 0in 0in 0pt; LINE-HEIGHT: normal"><SPAN style="FONT-SIZE: 10pt; COLOR: black; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial">
<DIV class=MsoNormal style="MARGIN: 0in 0in 0pt; LINE-HEIGHT: normal">
<DIV id=idSignature64848 dir=ltr>
<DIV><FONT face=Arial color=#000000 size=2>Michael A. Costa</FONT></DIV>
<DIV><FONT face=Arial size=2>SET <EM>(RCC)</EM>, CCAI-CCNA/CCNP <EM>(Cisco)</EM>, MInfTech <EM>(Griffith)</EM></FONT></DIV>
<DIV><FONT face=Arial size=2>Professor - Information Technology Division</FONT></DIV>
<DIV><FONT face=Arial size=2>Fanshawe College</FONT></DIV>
<DIV><FONT face=Arial size=2>G3001</FONT></DIV>
<DIV><FONT face=Arial size=2>1001 Fanshawe College Boulevard</FONT></DIV>
<DIV><FONT face=Arial size=2>P.O. Box 7005</FONT></DIV>
<DIV><FONT face=Arial size=2>London, ON </FONT><FONT face=Arial size=2>N5Y 5R6</FONT></DIV>
<DIV><FONT face=Arial size=2>Tel: (519) 452-4291 Fax: (519) 452-1801</FONT></DIV></DIV></DIV>
<DIV class=MsoNormal style="MARGIN: 0in 0in 0pt; LINE-HEIGHT: normal"><SPAN style="FONT-SIZE: 10pt; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial"></SPAN><SPAN style="FONT-SIZE: 12pt; FONT-FAMILY: 'Times New Roman','serif'; mso-fareast-font-family: 'Times New Roman'"><?xml:namespace prefix = o ns = "urn:schemas-microsoft-com:office:office" /><o:p></o:p></SPAN> </DIV>
<DIV class=MsoNormal style="MARGIN: 0in 0in 0pt; LINE-HEIGHT: normal">
<DIV class=MsoNormal style="MARGIN: 0in 0in 0pt; LINE-HEIGHT: normal"><SPAN style="FONT-SIZE: 10pt; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial"></SPAN><SPAN style="FONT-SIZE: 12pt; FONT-FAMILY: 'Times New Roman','serif'; mso-fareast-font-family: 'Times New Roman'"><o:p></o:p></SPAN></DIV><SPAN style="FONT-SIZE: 10pt; mso-ascii-font-family: Arial; mso-fareast-font-family: 'Times New Roman'; mso-hansi-font-family: Arial; mso-bidi-font-family: Arial"></SPAN><SPAN style="FONT-SIZE: 12pt; FONT-FAMILY: 'Times New Roman','serif'; mso-fareast-font-family: 'Times New Roman'"><o:p></o:p></SPAN></DIV></SPAN></DIV></SPAN></FONT></FONT></DIV></FONT></DIV></DIV></BODY></HTML>