[MPICH] can't set up mpd ring between two nodes
Rajeev Thakur
thakur at mcs.anl.gov
Fri Dec 28 01:09:47 CST 2007
Did you run all steps of mpdcheck as described in the installation guide? Is
there a firewall running on the machines?
> -----Original Message-----
> From: jetspeed [mailto:ibatis2 at 163.com]
> Sent: Thursday, December 27, 2007 6:01 AM
> To: Rajeev Thakur
> Cc: mpich-discuss at mcs.anl.gov
> Subject: Re: [MPICH] can't set up mpd ring between two nodes
>
> Thanks for your reply
>
> the mpdcheck seems right. but the mpd ring can't be set up.
> I will check the iptables as Krishna Chaitanya mentioned.
>
> mpdcheck -s
> server listening at INADDR_ANY on: inode01 33441 server has
> conn on <socket._socketobject object at 0xf7f769c0> from
> ('10.0.0.12', 32975) server successfully recvd msg from
> client: hello_from_client_to_server
>
> mpdcheck -c inode01 33441
> client successfully recvd ack from server: ack_from_server_to_client
>
>
> On Wed, 26 Dec 2007 23:08:04 -0600
> "Rajeev Thakur" <thakur at mcs.anl.gov> wrote:
>
> > The networking environment on the machines may not be set up
> > correctly. To debug the problem, you can use the mpdcheck
> utility as
> > described in the installation guide.
> >
> > Rajeev
> >
> >
> > > -----Original Message-----
> > > From: owner-mpich-discuss at mcs.anl.gov
> > > [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of jetspeed
> > > Sent: Wednesday, December 26, 2007 7:49 AM
> > > To: mpi
> > > Subject: [MPICH] can't set up mpd ring between two nodes
> > >
> > > Hi all:
> > >
> > > I installed mvapich2 , which is with the OFED 1.2.5.
> > >
> > > 1. when I use mpdboot on a machine, I got :
> > > mpdboot_inode02 (handle_mpd_output 359): failed to
> ping mpd on
> > > inode02; recvd output={}
> > > 2. when I try to use mpd to set up mpd ring, as the
> user guide of
> > > mpich2:
> > > mpd & on node02
> > > mpd -h node02 -p port on node01
> > > I got:
> > > on node01: (the latter mpd)
> > > inode01_33435 (connect_lhs 621): invalid challenge from
> > > inode02 32969: {}
> > > inode01_33435 (enter_ring 566): lhs connect failed
> > > inode01_33435 (run 233): failed to enter ring
> > >
> > > on node02: (the first mpd )
> > >
> > > inode02_32969: mpd_uncaught_except_tb handling:
> > > exceptions.TypeError: sequence item 0: expected string,
> int found
> > > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py 733
> > > handle_ring_listener_connection
> > > newsock.correctChallengeResponse = \
> > > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py 488
> > > handle_active_streams handler(stream,*args)
> > > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd 266 runmainloop
> > > rv = self.streamHandler.handle_active_streams(timeout=8.0)
> > > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd 240 run
> > > self.runmainloop()
> > > /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd 1344 ?
> > > mpd.run()
> > >
> > >
> > >
> > > Has anyone encountered this problem?
> > > Thanks in advance.
> > >
> > >
> > >
> >
>
>
More information about the mpich-discuss
mailing list