[mpich-discuss] mpdboot error message
Cezary Sliwa
sliwa at cft.edu.pl
Wed Jan 20 15:25:58 CST 2010
On Fri, Jan 15, 2010 at 01:37:02PM -0600, mpich-discuss-request at mcs.anl.gov wrote:
> Message: 4
> Date: Fri, 15 Jan 2010 13:36:56 -0600
> From: Dave Goodell <goodell at mcs.anl.gov>
> Subject: Re: [mpich-discuss] mpdboot error message
> To: mpich-discuss at mcs.anl.gov
> Message-ID: <9B4ADB37-A1EC-4F48-A68F-EF2B9F827161 at mcs.anl.gov>
> Content-Type: text/plain; charset=UTF-8; format=flowed; delsp=yes
>
> I suspect you are hitting this bug: https://trac.mcs.anl.gov/projects/mpich2/ticket/974
>
> "mpdboot" is normally issued as a foreground command, not in the
> background. It should exit immediately after setting up the ring. If
> your mpdboot is hanging then that is the real bug. My access to some
> CentOS machines to reproduce/fix always seems to be a few days away,
> so I haven't been able to fix it yet.
Thank you very much for this explanation.
>
> In the meantime, my recommendation is to use hydra unless you need one
> of the very few mpd-specific features that still exist:
>
> http://wiki.mcs.anl.gov/mpich2/index.php/Using_the_Hydra_Process_Manager
>
> Out of curiosity, what operating system are you using? CentOS/RHEL?
It is CentOS 4 x86_64.
Cezary Sliwa
>
> -Dave
>
> On Jan 15, 2010, at 11:02 AM, Cezary ?liwa wrote:
>
> >
> > Hello,
> >
> > mpdboot in mpich2-1.2.1 displays an error message when mpdallexit is
> > used:
> >
> > sl2klast2-n5 ~$ mpdboot -v -n 2 &
> > [1] 6793
> > sl2klast2-n5 ~$ running mpdallexit on sl2klast2-n5
> > LAUNCHED mpd on sl2klast2-n5 via
> > RUNNING: mpd on sl2klast2-n5
> > LAUNCHED mpd on sl2klast2-n6 via sl2klast2-n5
> >
> > sl2klast2-n5 ~$ mpdallexit
> > sl2klast2-n5 ~$ mpdboot_sl2klast2-n5 (handle_mpd_output 415): failed
> > to connect to mpd on sl2klast2-n6
> >
> > [1]+ Exit 255 mpdboot -v -n 2
> > sl2klast2-n5 ~$ cat mpd.hosts
> > sl2klast2-n5:4
> > sl2klast2-n6:4
> >
> >
> > Cezary Sliwa
> >
> >
More information about the mpich-discuss
mailing list