[mpich-discuss] hydra_pmi_proxy: No such file
Dave Goodell
goodell at mcs.anl.gov
Fri Oct 5 17:25:29 CDT 2012
MPICH2 does not support running in a heterogeneous configuration (some 32-bit binaries and some 64-bit binaries). However, on most Linux distributions you can run 32-bit binaries on 64-bit platforms. Just configure your MPICH2 with "-m32" in your CFLAGS/FFLAGS/FCFLAGS/CXXFLAGS and you should get a working installation for both machines that can be shared over NFS.
-Dave
On Oct 5, 2012, at 4:47 PM CDT, Martin Gilchrist wrote:
> I did a uname -a on the two nodes and one is 64 and the other is 32 bit.
>
> I guess I'll have to install and build mpich2 on the nodes separately, rather than have it on an nfs shared directory.
>
> If anyone has a simpler solution please let me know.
>
> Thanks all.
>
> Martin.
>
> From: Dave Goodell <goodell at mcs.anl.gov>
> To: Martin Gilchrist <gilchrist.martin at yahoo.com>; mpich-discuss at mcs.anl.gov
> Cc: Pavan Balaji <balaji at mcs.anl.gov>
> Sent: Friday, October 5, 2012 2:34 PM
> Subject: Re: [mpich-discuss] hydra_pmi_proxy: No such file
>
> What is the output of "uname -a" on each host?
>
> -Dave
>
> On Oct 5, 2012, at 4:04 PM CDT, Martin Gilchrist wrote:
>
> > Hi Pavan.
> >
> > I've got some more information that might give some insight into what is happening on my system.
> >
> > When I do ldd ./cpi on the base node where nfs is installed and mpich was compiled I get some paths to libraries, but when I do it on the other node I get the message that ./cpi is not a dynamic executable.
> >
> > When I do mpicc ./child.c, an a.out is produced, when I do the same thing on the other node I get messages like:
> >
> > /usr/bin/ld: skipping incompatible /mirror/mpich2/lib/libmpich.a when searching for -lmpich.
> >
> > Also, a simple invocation of ./cpi on the main node works, but gives the error message: bash: ./cpi: No such file or directory when run on the other node.
> >
> > I have mpich and the user mpiu installed on nfs. Could it be that the binaries compiled on one node are not compatible with the processor on the other node?
> >
> > Martin.
> >
> > From: Pavan Balaji <balaji at mcs.anl.gov>
> > To: Martin Gilchrist <gilchrist.martin at yahoo.com>
> > Cc: mpich-discuss at mcs.anl.gov
> > Sent: Wednesday, October 3, 2012 8:29 PM
> > Subject: Re: [mpich-discuss] hydra_pmi_proxy: No such file
> >
> >
> > This is weird. Can you run mpiexec with the -verbose option and send
> > the output?
> >
> > -- Pavan
> >
> > On 10/03/2012 05:09 PM, Martin Gilchrist wrote:
> > > Yes.
> > >
> > > The first is the machine where nfs host machine and the second is the
> > > other node.
> > >
> > >
> > > mpiu at martinlaptop-HP-Pavilion-dm4-Notebook-PC:~$ hostname
> > > martinlaptop-HP-Pavilion-dm4-Notebook-PC
> > > mpiu at martinlaptop-HP-Pavilion-dm4-Notebook-PC:~$ ls -l
> > > /mirror/mpich2/bin/mpiexec
> > > lrwxrwxrwx 1 mpiu mpiu 13 2012-10-03 14:32 /mirror/mpich2/bin/mpiexec ->
> > > mpiexec.hydra
> > > mpiu at martinlaptop-HP-Pavilion-dm4-Notebook-PC:~$ ls -l
> > > /mirror/mpich2/bin/mpiexec.hydra
> > > -rwxr-xr-x 1 mpiu mpiu 1065207 2012-10-03 14:32
> > > /mirror/mpich2/bin/mpiexec.hydra
> > > mpiu at martinlaptop-HP-Pavilion-dm4-Notebook-PC:~$ ls -l
> > > /mirror/mpich2/bin/hydra_pmi_proxy
> > > -rwxr-xr-x 1 mpiu mpiu 851803 2012-10-03 14:32
> > > /mirror/mpich2/bin/hydra_pmi_proxy
> > > mpiu at martinlaptop-HP-Pavilion-dm4-Notebook-PC:~$
> > >
> > >
> > >
> > >
> > > mpiu at martin-desktop:~$ ls -l /mirror/mpich2/bin/mpiexec
> > > lrwxrwxrwx 1 mpiu mpiu 13 2012-10-03 14:32 /mirror/mpich2/bin/mpiexec ->
> > > mpiexec.hydra
> > > mpiu at martin-desktop:~$ ls -lat /mirror/mpich2/bin/mpiexec.hydra
> > > -rwxr-xr-x 1 mpiu mpiu 1065207 2012-10-03 14:32
> > > /mirror/mpich2/bin/mpiexec.hydra
> > > mpiu at martin-desktop:~$ ls -l /mirror/mpich2/bin/hydra_pmi_proxy
> > > -rwxr-xr-x 1 mpiu mpiu 851803 2012-10-03 14:32
> > > /mirror/mpich2/bin/hydra_pmi_proxy
> > > mpiu at martin-desktop:~$
> > >
> > >
> > > ------------------------------------------------------------------------
> > > *From:* Pavan Balaji <balaji at mcs.anl.gov>
> > > *To:* Martin Gilchrist <gilchrist.martin at yahoo.com>
> > > *Cc:* "mpich-discuss at mcs.anl.gov" <mpich-discuss at mcs.anl.gov>
> > > *Sent:* Wednesday, October 3, 2012 3:46 PM
> > > *Subject:* Re: [mpich-discuss] hydra_pmi_proxy: No such file
> > >
> > > Hi Martin,
> > >
> > > Can you please stick to one configuration? It's hard for me to tell
> > > when you are running in which configuration.
> > >
> > > Can you please run the following on each of the two machines:
> > >
> > > % hostname
> > >
> > > % ls -l /mirror/mpich2/bin/mpiexec
> > >
> > > % ls -l /mirror/mpich2/bin/hydra_pmi_proxy
> > >
> > > -- Pavan
> > >
> > > On 10/03/2012 04:44 PM, Martin Gilchrist wrote:
> > > > Yes the two are different.
> > > > The one where the proxy is not found is produced when mpiexec is invoked
> > > > on the main computer where I have installed nfs and done all the make
> > > > and make install of mpich2. The one with the mpiexec not found is
> > > > produced when mpiexec is invoked on the node.
> > > > Martin.
> > > >
> > > > *From:* Pavan Balaji <balaji at mcs.anl.gov <mailto:balaji at mcs.anl.gov>>
> > > > *To:* Martin Gilchrist <gilchrist.martin at yahoo.com
> > > <mailto:gilchrist.martin at yahoo.com>>
> > > > *Cc:* mpich-discuss at mcs.anl.gov <mailto:mpich-discuss at mcs.anl.gov>;
> > > Anthony Chan <chan at mcs.anl.gov <mailto:chan at mcs.anl.gov>>
> > > > *Sent:* Tuesday, October 2, 2012 1:39 PM
> > > > *Subject:* Re: [mpich-discuss] hydra_pmi_proxy: No such file
> > > >
> > > >
> > > > On 10/02/2012 03:27 PM, Martin Gilchrist wrote:
> > > > > I can ssh into the node and hydra_pmi_proxy is visible and executable.
> > > > > And when I am logged into the node directly it is visible and
> > > executable.
> > > >
> > > > Both those statement sound identical. Did you mean to add a "not"
> > > > somewhere there?
> > > >
> > > > > One more thing, when I am directly logged into the node and do
> > > > > mpiexec -n 2 -f machinefile ./cpi
> > > > > I get the message:
> > > > > bash:/mirror/mpich2/bin/mpiexec:No such file or directory.
> > > > > It looks like it might be an issue with permissions since bash can't
> > > > > find the executable, but it is visible via which and has executable
> > > > > permission under ls -lat.
> > > >
> > > > This is not an executable permission issue. It should say "Permission
> > > > denied" for that, not "No such file or directory".
> > > >
> > > > Can you send me the following output?
> > > >
> > > > % ls -l /mirror/mpich2/bin/mpiexec
> > > >
> > > > % /mirror/mpich2/bin/mpiexec -n 2 -f machinefile ./cpi
> > > >
> > > > I'm having trouble believing that you can see the file, but are getting
> > > > the following error, so I'd like to see that myself :-). Of course,
> > > > it's always possible that it's some weird NFS-ness that I'm not aware of.
> > > >
> > > > -- Pavan
> > > >
> > > > -- Pavan Balaji
> > > > http://www.mcs.anl.gov/~balaji
> > > >
> > > >
> > >
> > > --
> > > Pavan Balaji
> > > http://www.mcs.anl.gov/~balaji <http://www.mcs.anl.gov/~balaji>
> > >
> > >
> >
> > --
> > Pavan Balaji
> > http://www.mcs.anl.gov/~balaji
> >
> >
> > _______________________________________________
> > mpich-discuss mailing list mpich-discuss at mcs.anl.gov
> > To manage subscription options or unsubscribe:
> > https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
>
>
>
More information about the mpich-discuss
mailing list