[MPICH] can't set up mpd ring between two nodes

Krishna Chaitanya kris.c1986 at gmail.com
Thu Dec 27 07:31:53 CST 2007


Hi,
     Have you checked doing the same thing by setting "inode01" as the
client and the other machine as the server?
     The output does indicate that the mpd's are up on both the machines and
they can communicate with each other. It would be of help if you could post
the output that you are getting when you try running a sample program.

Krishna Chaitanya K

On Dec 27, 2007 5:30 PM, jetspeed <ibatis2 at 163.com> wrote:

> Thanks for your reply
>
> the mpdcheck seems right. but the mpd ring  can't be set up. I will check
> the iptables as Krishna Chaitanya mentioned.
>
>  mpdcheck -s
> server listening at INADDR_ANY on: inode01 33441
> server has conn on <socket._socketobject object at 0xf7f769c0> from ('
> 10.0.0.12', 32975)
> server successfully recvd msg from client: hello_from_client_to_server
>
>  mpdcheck -c inode01 33441
> client successfully recvd ack from server: ack_from_server_to_client
>
>
> On Wed, 26 Dec 2007 23:08:04 -0600
> "Rajeev Thakur" <thakur at mcs.anl.gov> wrote:
>
> > The networking environment on the machines may not be set up correctly.
> To
> > debug the problem, you can use the mpdcheck utility as described in the
> > installation guide.
> >
> > Rajeev
> >
> >
> > > -----Original Message-----
> > > From: owner-mpich-discuss at mcs.anl.gov
> > > [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of jetspeed
> > > Sent: Wednesday, December 26, 2007 7:49 AM
> > > To: mpi
> > > Subject: [MPICH] can't set up mpd ring between two nodes
> > >
> > > Hi all:
> > >
> > >     I installed mvapich2 , which is with the OFED 1.2.5.
> > >
> > >     1. when I use mpdboot on a machine, I got :
> > >       mpdboot_inode02 (handle_mpd_output 359): failed to
> > > ping mpd on inode02; recvd output={}
> > >     2.  when I try to use mpd to set up mpd ring, as the
> > > user guide of mpich2:
> > >                     mpd &                       on node02
> > >                     mpd -h node02 -p port       on node01
> > >     I got:
> > > on node01:  (the latter mpd)
> > > inode01_33435 (connect_lhs 621): invalid challenge from
> > > inode02 32969: {}
> > > inode01_33435 (enter_ring 566): lhs connect failed
> > > inode01_33435 (run 233): failed to enter ring
> > >
> > > on node02:  (the first mpd )
> > >
> > > inode02_32969: mpd_uncaught_except_tb handling:
> > >   exceptions.TypeError: sequence item 0: expected string, int found
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py  733
> > > handle_ring_listener_connection
> > >         newsock.correctChallengeResponse = \
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpdlib.py  488
> > > handle_active_streams        handler(stream,*args)
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd  266  runmainloop
> > >         rv = self.streamHandler.handle_active_streams(timeout=8.0)
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd  240  run
> > >         self.runmainloop()
> > >     /usr/mpi/gcc/mvapich2-0.9.8-15/bin/mpd  1344  ?
> > >         mpd.run()
> > >
> > >
> > >
> > > Has anyone encountered this problem?
> > > Thanks in advance.
> > >
> > >
> > >
> >
>
>


-- 
In the middle of difficulty, lies opportunity
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20071227/9b84402e/attachment.htm>


More information about the mpich-discuss mailing list