[MPICH] can't set up mpd ring between two nodes

Rajeev Thakur thakur at mcs.anl.gov
Fri Dec 28 01:09:47 CST 2007


Did you run all steps of mpdcheck as described in the installation guide? Is
there a firewall running on the machines? 

> -----Original Message-----
> From: jetspeed [mailto:ibatis2 at 163.com] 
> Sent: Thursday, December 27, 2007 6:01 AM
> To: Rajeev Thakur
> Cc: mpich-discuss at mcs.anl.gov
> Subject: Re: [MPICH] can't set up mpd ring between two nodes
> 
> Thanks for your reply
> 
> the mpdcheck seems right. but the mpd ring  can't be set up. 
> I will check the iptables as Krishna Chaitanya mentioned.
> 
>  mpdcheck -s
> server listening at INADDR_ANY on: inode01 33441 server has 
> conn on <socket._socketobject object at 0xf7f769c0> from 
> ('10.0.0.12', 32975) server successfully recvd msg from 
> client: hello_from_client_to_server
> 
>  mpdcheck -c inode01 33441
> client successfully recvd ack from server: ack_from_server_to_client
> 
> 
> On Wed, 26 Dec 2007 23:08:04 -0600
> "Rajeev Thakur" <thakur at mcs.anl.gov> wrote:
> 
> > The networking environment on the machines may not be set up 
> > correctly. To debug the problem, you can use the mpdcheck 
> utility as 
> > described in the installation guide.
> > 
> > Rajeev
> >  
> > 
> > > -----Original Message-----
> > > From: owner-mpich-discuss at mcs.anl.gov 
> > > [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of jetspeed
> > > Sent: Wednesday, December 26, 2007 7:49 AM
> > > To: mpi
> > > Subject: [MPICH] can't set up mpd ring between two nodes
> > > 
> > > Hi all:
> > > 	
> > > 	I installed mvapich2 , which is with the OFED 1.2.5.
> > > 	
> > > 	1. when I use mpdboot on a machine, I got :
> > > 	  mpdboot_inode02 (handle_mpd_output 359): failed to 
> ping mpd on 
> > > inode02; recvd output={}
> > > 	2.  when I try to use mpd to set up mpd ring, as the 
> user guide of 
> > > mpich2:
> > > 			mpd &                       on node02
> > > 			mpd -h node02 -p port       on node01
> > > 	I got:
> > > on node01:  (the latter mpd)
> > > inode01_33435 (connect_lhs 621): invalid challenge from
> > > inode02 32969: {}
> > > inode01_33435 (enter_ring 566): lhs connect failed
> > > inode01_33435 (run 233): failed to enter ring
> > > 
> > > on node02:  (the first mpd )
> > > 	
> > > inode02_32969: mpd_uncaught_except_tb handling:
> > >   exceptions.TypeError: sequence item 0: expected string, 
> int found
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py  733 
> > > handle_ring_listener_connection
> > >         newsock.correctChallengeResponse = \
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py  488  
> > > handle_active_streams        handler(stream,*args)
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd  266  runmainloop
> > >         rv = self.streamHandler.handle_active_streams(timeout=8.0)
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd  240  run
> > >         self.runmainloop()
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd  1344  ?
> > >         mpd.run()
> > > 
> > > 
> > > 
> > > Has anyone encountered this problem?
> > > Thanks in advance.
> > > 
> > > 
> > > 
> > 
> 
> 




More information about the mpich-discuss mailing list