[mpich-discuss] mpdboot beheviour
    Dave Goodell 
    goodell at mcs.anl.gov
       
    Fri Jan 15 11:31:03 CST 2010
    
    
  
On Jan 15, 2010, at 10:57 AM, Cezary Śliwa wrote:
> Regarding mpdboot in mpich2-1.2.1. The default is to use --ncpus=1  
> rather than the value from  mpd.hosts. Does it make sense? Why not  
> use the value from mpd.hosts as the default?
This is a longstanding known user interface problem.  It trips people  
up all the time, including me on occasion.  Unfortunately, we are  
unlikely to change the behavior for two reasons: (1) it will break  
compatibility with the thousands of scripts out there that invoke  
mpdboot assuming the old behavior and (2) mpd is receiving a bare  
minimum of maintenance and development at this point because it is  
being replaced by hydra.
> In case of running a job under PBS or SGE, the correct number is in  
> the file. At present, one has to extract this information from the  
> file and put it on the mpdboot command line. This is cumbersome.
I agree wholeheartedly.
> Moreover, if the host running mpdboot is not in mpd.hosts, it makes  
> sense to use --ncpus=0 as the default.
Just FYI, this doesn't actually work the way you would expect.   
mpdboot still basically sets up an mpd as though you had specified -- 
ncpus=1.  You can't use mpd to have a "head node" in a straightforward  
fashion.  The best you can do is use the "-1" option to mpiexec to  
avoid placing processes locally first, but that is a pretty weak  
option too.
Hydra supports running mpiexec on a node that isn't in the hostfile  
(at least with all bootstrap servers that support remote process  
creation).
-Dave
    
    
More information about the mpich-discuss
mailing list