[mpich-discuss] Error in the mpdboot step
Pavan Balaji
balaji at mcs.anl.gov
Tue Feb 2 07:43:31 CST 2010
On 02/02/2010 04:52 AM, Prashantha Hebbar wrote:
> mlscub1 at mlscub1-desktop:~$ mpdboot -n 2 -f mpd.hosts --ncpus=1
> unable to obtain IP for host: mlscub1 at mlscub1-desktop.local
> unable to obtain IP for host: mlscub2 at mlscub2-desktop.local
> totalnum=2 numhosts=1
> there are not enough hosts on which to start all processes
The /etc/hosts file on each machine needs to have information on all the
nodes in the system.
> So, can you please tell me what might have gone wrong? I do not think
it is
> a problem of my /etc/hosts setting.
> mlscub1 at mlscub1-desktop:~$ cat /etc/hosts
> 127.0.0.1 localhost
> 172.16.17.24 mlscub1-desktop
The file doesn't contain information on mlscub2-desktop
You can use the mpdcheck utility to look for such errors in your setup.
> I tried it in other way round with specifying hostname of mpd master in
> slave system. That works fine.
>
> mlscub1 at mlscub1-desktop:~$ mpd &
> [1] 22750
>
> mlscub1 at mlscub1-desktop:~$ mpdtrace -l
> mlscub1-desktop_50100 (172.16.17.24)
>
> mlscub2 at mlscub2-desktop:~$ mpd -h mlscub1-desktop.local -p 50100 &
> [1] 11418
>
> mlscub2 at mlscub2-desktop:~$ mpdtrace -l
> mlscub2-desktop_50514 (172.16.17.93)
> mlscub1-desktop_50100 (172.16.17.24)
>
> mlscub1 at mlscub1-desktop:~$ mpdtrace -l
> mlscub1-desktop_50100 (172.16.17.24)
> mlscub2-desktop_50514 (172.16.17.93)
Good to know that this works. This is another option, but is more
cumbersome to use, so we don't usually prefer it.
> I have another problem with executing programs. I find something like
> permission denied error messages.
>
> mlscub1 at mlscub1-desktop:~$ mpiexec -n 5
> /home/mlscub1/libraries/mpich2-1.2.1/examples/cpi.c
> problem with execution of
> /home/mlscub1/libraries/mpich2-1.2.1/examples/cpi.c on mlscub1-desktop:
> [Errno 13] Permission denied
> problem with execution of
> /home/mlscub1/libraries/mpich2-1.2.1/examples/cpi.c on mlscub2-desktop:
> [Errno 2] No such file or directory
> problem with execution of
> /home/mlscub1/libraries/mpich2-1.2.1/examples/cpi.c on mlscub2-desktop:
> [Errno 2] No such file or directory
> problem with execution of
> /home/mlscub1/libraries/mpich2-1.2.1/examples/cpi.c on mlscub1-desktop:
> [Errno 13] Permission denied
> problem with execution of
> /home/mlscub1/libraries/mpich2-1.2.1/examples/cpi.c on mlscub1-desktop:
> [Errno 13] Permission denied
Why are you trying to execute cpi.c ? Do you mean to compile it and
execute cpi?
% mpicc examples/cpi.c -o examples/cpi
% mpiexec -n 5 ./examples/cpi
-- Pavan
--
Pavan Balaji
http://www.mcs.anl.gov/~balaji
More information about the mpich-discuss
mailing list