[MPICH] No route to host problem

Ralph M. Butler rbutler at mtsu.edu
Thu Jun 30 16:16:28 CDT 2005


Hi:

I can not say for sure what the problem here is.  Typically, a
'no route to host' problem is a network problem outside the
control of mpd/mpich2.  For configuration problems, we generally
suggest that folks begin debugging with the mpdcheck program
as described in the troubleshooting section.

--ralph

> > Date: Thu, 30 Jun 2005 16:05:34 -0500
> > From: Rajeev Thakur <thakur at mcs.anl.gov>
> > To: 'Ralph M. Butler' <rbutler at mtsu.edu>
> > Cc: 'Rusty Lusk' <lusk at mcs.anl.gov>
> > Subject: FW: [MPICH] No route to host problem
> >
> > Ralph,
> >       Any idea what this is? You can reply directly to him
> with a cc to the
> > list.
> >
> > Thanks,
> > Rajeev
> >
> > -----Original Message-----
> > From: owner-mpich-discuss at mcs.anl.gov
> > [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of
> Patrick E. Kane
> > Sent: Thursday, June 30, 2005 9:52 AM
> > To: mpich-discuss at mcs.anl.gov
> > Subject: Re: [MPICH] No route to host problem
> >
> > Below is more info on the "No route to host" problem.
> > I did an "strace" of the python process as it gave this error msg:
> >
> >    0: aborting job:
> >    0: Fatal error in MPI_Bcast: Other MPI error, error stack:
> >    0: MPI_Bcast(827): MPI_Bcast(buf=0xbffff88c, count=1,
> MPI_INT, root=0,
> >
> > Here an interesting part of the Linux system call trace log:
> >  ...
> >  connect(7, {sa_family=AF_INET, sin_port=htons(32805),
> >          sin_addr=inet_addr("192.168.9.3")},
> >          16) = -1 EINPROGRESS (Operation now in progress)
> >  poll([{fd=4, events=POLLIN}, {fd=7, events=POLLOUT,
> >       revents=POLLERR|POLLHUP}], 2, -1) = 1
> >  getpeername(7, 0xbffff480,
> >              [16])  = -1 ENOTCONN (Transport endpoint is
> not connected)
> >  getsockopt(7, SOL_SOCKET, SO_ERROR, [113], [4]) = 0
> >  write(2, "aborting job:\nFatal error in MPI_Bcast: Other
> MPI error"...




More information about the mpich-discuss mailing list