[mpich-discuss] SMPD, Problem launching when using -host

Jayesh Krishna jayesh at mcs.anl.gov
Fri Oct 17 12:22:54 CDT 2008


 Hi,
  Yes, if you need to set the export PATH to all the executables you
should use the "-genvlist" (& friends).

Regards,
Jayesh

-----Original Message-----
From: James S Perrin [mailto:james.s.perrin at manchester.ac.uk] 
Sent: Friday, October 17, 2008 12:19 PM
To: Jayesh Krishna
Cc: mpich-discuss at mcs.anl.gov
Subject: Re: [mpich-discuss] SMPD, Problem launching when using -host

Hi,

	The -path option is of no use as I think mpiexec only uses this
path to find the executable. I don't think that it is then set in the
environment that the executable is launched in, hence the executable still
can't find DLLs.

	Using -genvlist PATH or similar seems to work however.

Regards
James

Jayesh Krishna wrote:
>  Hi,
>   It is recommended that you use the "-path" option available with 
> mpiexec to specify the path to the executable.
> 
> Regards,
> Jayesh
> 
> -----Original Message-----
> From: James S Perrin [mailto:james.s.perrin at manchester.ac.uk]
> Sent: Friday, October 10, 2008 4:57 AM
> To: Jayesh Krishna
> Cc: mpich-discuss at mcs.anl.gov
> Subject: Re: [mpich-discuss] SMPD, Problem launching when using -host
> 
> Hi,
> 
> I have found the reason why my executable is failing to start, however I
>   think -host is not behaving as it should or at least the 
> documentation needs clarifying.
> 
> I guessed that using -host was somehow changing the executable's 
> environment and so it is failing to start correctly because it 
> couldn't find a dll.
> 
> On windows the PATH variable should be made up of the system wide 
> settings and the user specific additions:
> 
> ie echo %PATH% => <system settings>;<user settings>
> 
> The user settings are required to launch the process. When I launch as
> follows:
> 
> mpiexec -localroot -n 1 master : -n 1 slave
> 
> both get the path setting as above, however if I use
> 
> mpiexec -localroot -n 1 master : -host roobarb -n 1 slave
> 
> process 1 has PATH=<system settings>;<user settings> but process 2 has 
> PATH=<system settings> only
> 
> I have no idea why the following works but it does, if I add -host 
> roobarb to the process 1 process 2 now gets the full PATH variable
> 
> mpiexec -localroot -host roobarb -n 1 master : -host roobarb -n 1 
> slave
> 
> Final permutation, if I now don't specify -localroot both processes 
> only get the only the system settings for PATH:
> 
> mpiexec -host roobarb -n 1 master : -host roobarb -n 1 slave
> 
> In summary using -host only the system path settings are used and not 
> the user specific settings. Is this a security feature or a 
> non-iteractive login issue c.f bash under linux the .bashrc is not 
> executed for processes started remotely?
> 
> A little extra testing confirmed that when process gets both the 
> system and user path settings it is getting this from the current cmd
shell.
> 
> The solution is to either make sure paths are added to the system path 
> variable or launch via a script that sets up the environment for each 
> processes though I would have like to avoid this if possible. The 
> first is a pain for development and the later a pain for user
installations.
> 
> FYI I was examining the PATH variable using:
> 
> mpiexec -l -host roobarb -n 1 env : -host roobarb -n 1 env | grep 
> \]PATH=
> 
> I have the UNIX commands env and grep in my PATH.
> 
> Regards
> James
> 
> Jayesh Krishna wrote:
>  >  Hi,
>  >   Can you send us the debug output of mpiexec and smpd ? Please
follow
>  > the instructions below to send us the debug output,  >  > # Stop 
> any instances of smpd using the command, smpd -stop # Start  > smpd in 
> the debug mode using the command, smpd -d # Run a non-MPI  > program 
> with mpiexec in the verbose mode using the command, mpiexec  > 
> -verbose -n 1 hostname : -host IPADDRESS_OF_roobarb -n  > 1 hostname  
> >  > # Run an MPI program (cpi.exe provided with MPICH2) with mpiexec 
> in  > the verbose mode using the command, mpiexec -verbose -n 1 
> cpi.exe :
>  > -host IPADDRESS_OF_roobarb -n 1 cpi.exe  >  > # Send us the 
> debug/verbose outputs of mpiexec and smpd.
>  >
>  >   Let us know the results.
>  >
>  > Regards,
>  > Jayesh
>  >
>  > -----Original Message-----
>  > From: owner-mpich-discuss at mcs.anl.gov  > 
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of James S Perrin  
> > Sent: Tuesday, October 07, 2008 5:25 AM  > Cc: 
> mpich-discuss at mcs.anl.gov  > Subject: Re: [mpich-discuss] SMPD, 
> Problem launching when using -host  >  > Hi,  >
>  >      No I get the same error if I use the ipaddress.
>  >
>  > Regards
>  > James
>  >
>  >
>  > Jayesh Krishna wrote:
>  >  >  Hi,
>  >  >   Does it work if you specify the ipaddress of the machine instead
of
>  >  > hostname (mpiexec -n 1 master : -host IPADDRESS_OF_roobarb -n 1
> slave) ?
>  >  >
>  >  > Regards,
>  >  > Jayesh
>  >  >
>  >  > -----Original Message-----
>  >  > From: James S Perrin [mailto:james.s.perrin at manchester.ac.uk]
>  >  > Sent: Monday, October 06, 2008 5:18 AM  > To: Jayesh Krishna  >
Cc:
>  > mpich-discuss at mcs.anl.gov  > Subject: Re: [mpich-discuss] SMPD,  > 
> Problem launching when using -host  >  > Hi,  >  > Jayesh Krishna  > 
> wrote:
>  >  >  >  Hi,
>  >  >  >
>  >  >  >  >> mpiexec -n 1 -host roobarb master : -n 1 slave
>  >  >  >         The command above("-host" option specified for only one
>  >  >  > executable) works for me. What is the error message that you 
> get  > >  > (Provide us with the snapshot of your command and the 
> error  > output. It  > > would also help us if you provide more 
> details - Is  > roobarb a remote  > > machine ? etc) ?
>  >  >
>  >  > The error is:
>  >  >
>  >  > [0] PMI_Init failed: FAIL - init called when another process has  
> >  > exited without calling init Fatal error in MPI_Init_thread: Other 
> MPI  > > error, error stack:
>  >  > MPIR_Init_thread(294): Initialization failed  >  > 
> MPID_Init(82)........: channel initialization failed  >  > 
> MPID_Init(333).......: PMI_Init returned -1unable to read the cmd  >  
> > header on the pmi context, generic socket failure, error stack:
>  >  > MPIDU_Sock_wait(2603): The specified network name is no longer  
> >  > available. (errno 64).
>  >  >
>  >  > job aborted:
>  >  > rank: node: exit code[: error message]  > 0: ROOBARB: 3: Fatal  
> > error in MPI_Init_thread: Other MPI error, error  > stack:
>  >  > MPIR_Init_thread(294): Initialization failed  >  > 
> MPID_Init(82)........: channel initialization failed  >  > 
> MPID_Init(333).......: PMI_Init returned -1  > 1: roobarb: -1073741515  
> > >  > The second process is not starting for some reason.
>  >  >
>  >  > roobarb happens to be the local machine in this case but the  > 
> problem  > also occurs on a cluster.
>  >  >
>  >  > It will launch correctly if I use:
>  >  >
>  >  > mpiexec -n 1 master : -n 1 slave - SUCCESS  >  > which should be 
> no  > different from:
>  >  >
>  >  > mpiexec -n 1 master : -host roobarb -n 1 slave - FAILS  >  > 
> when  > everything is running on roobarb.
>  >  >
>  >  >  >  >> mpiexec -localroot -n 1 roobarb master : -host roobarb -n 
> 1  > > slave  >
>  >  >  >         When using the "-localroot" option you should not 
> specify the
>  >  >  > hostname for the 1st executable. The command should be,  >  >  
> > >>  > mpiexec -localroot -n 1 master : -host roobarb -n 1 slave  >  
> >  > sorry typo I meant if would work I used:
>  >  >
>  >  > mpiexec -localroot -host roobarb -n 1  master : -host roobarb -n 
> 1  > > slave  >  > Regards  > James  >  >  >  >  > -----Original  > 
> Message-----  >  > From: owner-mpich-discuss at mcs.anl.gov  >  >  > 
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of James S Perrin  
> > > > Sent: Friday, October 03, 2008 12:13 PM  > To: mpich  > Subject:
>  >  > [mpich-discuss] SMPD, Problem launching when using -host  >  > Hi,
>  >  >  >      Processes fail to start if -host is used for only some but
not
>  >  >  > all processes when launching. ie the machines that some  > 
> processes  > > launch on is left up to the smpd to allocate.
>  >  >  >
>  >  >  > eg
>  >  >  >
>  >  >  > mpiexec -n 1 -host roobarb master : -n 1 slave  >  > when  >  
> > -localroot is used the following fails unless -host is also  >  >  > 
> specified for the master.
>  >  >  >
>  >  >  > mpiexc -localroot -n 1 roobarb master : -host roobarb -n 1 
> slave  > >  > > Using MPICH2 1.0.7 on WinXP ia32.
>  >  >  >
>  >  >  > Regards
>  >  >  > James
>  >  >  > --
>  >  >  >
>  >
------------------------------------------------------------------------
>  >  >  >    James S. Perrin
>  >  >  >    Visualization
>  >  >  >
>  >  >  >    Research Computing Services
>  >  >  >    The University of Manchester
>  >  >  >    Kilburn Building, Oxford Road
>  >  >  >    Manchester, M13 9PL
>  >  >  >
>  >  >  >    t: +44 (0) 161 275 6945
>  >  >  >    e: james.perrin at manchester.ac.uk
>  >  >  >    w: www.manchester.ac.uk/researchcomputing
>  >  >  >
>  >
------------------------------------------------------------------------
>  >  >  >   "The test of intellect is the refusal to belabour the
obvious"
>  >  >  >   - Alfred Bester
>  >  >  >
>  >  >
>  > 
> ----------------------------------------------------------------------
>  >  >  > --
>  >  >  >
>  >  >
>  >  > --
>  >  >
> ------------------------------------------------------------------------
>  >  >    James S. Perrin
>  >  >    Visualization
>  >  >
>  >  >    Research Computing Services
>  >  >    The University of Manchester
>  >  >    Kilburn Building, Oxford Road
>  >  >    Manchester, M13 9PL
>  >  >
>  >  >    t: +44 (0) 161 275 6945
>  >  >    e: james.perrin at manchester.ac.uk
>  >  >    w: www.manchester.ac.uk/researchcomputing
>  >  >
> ------------------------------------------------------------------------
>  >  >   "The test of intellect is the refusal to belabour the obvious"
>  >  >   - Alfred Bester
>  >  >
>  > 
> ----------------------------------------------------------------------
>  >  > --
>  >  >
>  >
>  > --
>  >
------------------------------------------------------------------------
>  >    James S. Perrin
>  >    Visualization
>  >
>  >    Research Computing Services
>  >    The University of Manchester
>  >    Kilburn Building, Oxford Road
>  >    Manchester, M13 9PL
>  >
>  >    t: +44 (0) 161 275 6945
>  >    e: james.perrin at manchester.ac.uk
>  >    w: www.manchester.ac.uk/researchcomputing
>  >
------------------------------------------------------------------------
>  >   "The test of intellect is the refusal to belabour the obvious"
>  >   - Alfred Bester
>  > 
> ----------------------------------------------------------------------
>  > --
>  >
> 
> --
> ------------------------------------------------------------------------
>    James S. Perrin
>    Visualization
> 
>    Research Computing Services
>    The University of Manchester
>    Kilburn Building, Oxford Road
>    Manchester, M13 9PL
> 
>    t: +44 (0) 161 275 6945
>    e: james.perrin at manchester.ac.uk
>    w: www.manchester.ac.uk/researchcomputing
> ------------------------------------------------------------------------
>   "The test of intellect is the refusal to belabour the obvious"
>   - Alfred Bester
> ----------------------------------------------------------------------
> --
> 

--
------------------------------------------------------------------------
   James S. Perrin
   Visualization

   Research Computing Services
   Rm 074B, Devonshire House, University Precinct
   The University of Manchester
   Oxford Road, Manchester, M13 9PL

   t: +44 (0) 161 275 6945
   e: james.perrin at manchester.ac.uk
   w: www.manchester.ac.uk/researchcomputing
------------------------------------------------------------------------
  "The test of intellect is the refusal to belabour the obvious"
  - Alfred Bester
------------------------------------------------------------------------
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20081017/a15b2a67/attachment.htm>


More information about the mpich-discuss mailing list