[MPICH] can't set up mpd ring between two nodes

jetspeed ibatis2 at 163.com
Thu Dec 27 06:00:31 CST 2007


Thanks for your reply

the mpdcheck seems right. but the mpd ring  can't be set up. I will check the iptables as Krishna Chaitanya mentioned.

 mpdcheck -s
server listening at INADDR_ANY on: inode01 33441
server has conn on <socket._socketobject object at 0xf7f769c0> from ('10.0.0.12', 32975)
server successfully recvd msg from client: hello_from_client_to_server

 mpdcheck -c inode01 33441
client successfully recvd ack from server: ack_from_server_to_client


On Wed, 26 Dec 2007 23:08:04 -0600
"Rajeev Thakur" <thakur at mcs.anl.gov> wrote:

> The networking environment on the machines may not be set up correctly. To
> debug the problem, you can use the mpdcheck utility as described in the
> installation guide.
> 
> Rajeev
>  
> 
> > -----Original Message-----
> > From: owner-mpich-discuss at mcs.anl.gov 
> > [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of jetspeed
> > Sent: Wednesday, December 26, 2007 7:49 AM
> > To: mpi
> > Subject: [MPICH] can't set up mpd ring between two nodes
> > 
> > Hi all:
> > 	
> > 	I installed mvapich2 , which is with the OFED 1.2.5.
> > 	
> > 	1. when I use mpdboot on a machine, I got :
> > 	  mpdboot_inode02 (handle_mpd_output 359): failed to 
> > ping mpd on inode02; recvd output={}
> > 	2.  when I try to use mpd to set up mpd ring, as the 
> > user guide of mpich2:
> > 			mpd &                       on node02
> > 			mpd -h node02 -p port       on node01
> > 	I got:
> > on node01:  (the latter mpd)
> > inode01_33435 (connect_lhs 621): invalid challenge from 
> > inode02 32969: {}
> > inode01_33435 (enter_ring 566): lhs connect failed
> > inode01_33435 (run 233): failed to enter ring
> > 
> > on node02:  (the first mpd )
> > 	
> > inode02_32969: mpd_uncaught_except_tb handling:
> >   exceptions.TypeError: sequence item 0: expected string, int found
> >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py  733  
> > handle_ring_listener_connection
> >         newsock.correctChallengeResponse = \
> >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py  488  
> > handle_active_streams        handler(stream,*args)
> >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd  266  runmainloop
> >         rv = self.streamHandler.handle_active_streams(timeout=8.0)
> >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd  240  run
> >         self.runmainloop()
> >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd  1344  ?
> >         mpd.run()
> > 
> > 
> > 
> > Has anyone encountered this problem?
> > Thanks in advance.
> > 
> > 
> > 
> 




More information about the mpich-discuss mailing list