[MPICH] can't set up mpd ring between two nodes
jetspeed
ibatis2 at 163.com
Thu Dec 27 06:00:31 CST 2007
Thanks for your reply
the mpdcheck seems right. but the mpd ring can't be set up. I will check the iptables as Krishna Chaitanya mentioned.
mpdcheck -s
server listening at INADDR_ANY on: inode01 33441
server has conn on <socket._socketobject object at 0xf7f769c0> from ('10.0.0.12', 32975)
server successfully recvd msg from client: hello_from_client_to_server
mpdcheck -c inode01 33441
client successfully recvd ack from server: ack_from_server_to_client
On Wed, 26 Dec 2007 23:08:04 -0600
"Rajeev Thakur" <thakur at mcs.anl.gov> wrote:
> The networking environment on the machines may not be set up correctly. To
> debug the problem, you can use the mpdcheck utility as described in the
> installation guide.
>
> Rajeev
>
>
> > -----Original Message-----
> > From: owner-mpich-discuss at mcs.anl.gov
> > [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of jetspeed
> > Sent: Wednesday, December 26, 2007 7:49 AM
> > To: mpi
> > Subject: [MPICH] can't set up mpd ring between two nodes
> >
> > Hi all:
> >
> > I installed mvapich2 , which is with the OFED 1.2.5.
> >
> > 1. when I use mpdboot on a machine, I got :
> > mpdboot_inode02 (handle_mpd_output 359): failed to
> > ping mpd on inode02; recvd output={}
> > 2. when I try to use mpd to set up mpd ring, as the
> > user guide of mpich2:
> > mpd & on node02
> > mpd -h node02 -p port on node01
> > I got:
> > on node01: (the latter mpd)
> > inode01_33435 (connect_lhs 621): invalid challenge from
> > inode02 32969: {}
> > inode01_33435 (enter_ring 566): lhs connect failed
> > inode01_33435 (run 233): failed to enter ring
> >
> > on node02: (the first mpd )
> >
> > inode02_32969: mpd_uncaught_except_tb handling:
> > exceptions.TypeError: sequence item 0: expected string, int found
> > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py 733
> > handle_ring_listener_connection
> > newsock.correctChallengeResponse = \
> > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py 488
> > handle_active_streams handler(stream,*args)
> > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd 266 runmainloop
> > rv = self.streamHandler.handle_active_streams(timeout=8.0)
> > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd 240 run
> > self.runmainloop()
> > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd 1344 ?
> > mpd.run()
> >
> >
> >
> > Has anyone encountered this problem?
> > Thanks in advance.
> >
> >
> >
>
More information about the mpich-discuss
mailing list