[mpich-discuss] mpdboot fails

Rajeev Thakur thakur at mcs.anl.gov
Tue Oct 26 05:59:55 CDT 2010


Can you try the latest release of MPICH2, 1.3. It uses a different process manager, called hydra, by default. You don't need to run mpdboot.

Rajeev

On Oct 25, 2010, at 10:55 AM, Matt Thiffault wrote:

> So, I have a number of machines I'm trying to set up with MPICH2, but I'm starting off with just 2. One is behind an NAT router and the other has a public ip, but is protected by an iptables firewall. 
> 
> I can ssh back and forth between them, and I have them keyed so that you don't need a password to do so after starting an ssh-agent and giving the password to the RSA key. 
> 
> I have MPICH_PORT_RANGE and MPIEXEC_PORT_RANGE set in /etc/profile on both systems to 10000:10100 and both firewalls allow traffic through on those ports (I've tested this with netcat). 
> 
> mpdboot fails with the following output:
> mthiffau at foehammer ~ (255)% mpdboot -n 2 -v -f mpd.hosts
> running mpdallexit on foehammer
> LAUNCHED mpd on foehammer  via  
> RUNNING: mpd on foehammer
> LAUNCHED mpd on halo.mthiffau.ca  via  foehammer
> mpdboot_foehammer (handle_mpd_output 415): failed to connect to mpd on halo.mthiffau.ca
> 
> So, mpd starts locally, and on the other machine, but afterwards comunication seems to break down. Does the ssh-ing back and forth have to be completely passwordless? Is there something else I have to do to ensure the correct port range gets used?
> 
> I suspect it might be the latter, as even with the environment variables set, mpdcheck -s starts listening on random ports that are outside the range specified.
> 
> Any help would be appreciated, thanks a bunch.
> 
> Matt Thiffault
> _______________________________________________
> mpich-discuss mailing list
> mpich-discuss at mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss



More information about the mpich-discuss mailing list