[mpich-discuss] novice question
Jayesh Krishna
jayesh at mcs.anl.gov
Thu Jun 26 16:09:58 CDT 2008
Hi,
To run your MPI program across multiple hosts use the "-machinefile"
option of mpiexec (See the window's developer's guide for more information
about the various options available with mpiexec. The guide is available
at
http://www.mcs.anl.gov/research/projects/mpich2/documentation/index.php?s=
docs).
(eg: mpiexec -n 2 -machinefile mf.txt a.exe)
Please try providing the ip addresses of the machines in your
machinefile so that you don't have problems with the host name resolution.
Also make sure that the executable is available (or accessible) at both
the machines. If you are sharing the executable using the network mapped
drive, make sure that you map the drive using the "-map" option of mpiexec
(eg: mpiexec -n 2 -machinefile mf.txt -map
j:\\sharecompname\sharedFolderName j:\a.exe). Instead of sharing the
executable using the network shared folder you can also copy the
executable to both the machines.
>>> First question - do I need Windows server (or server computer cluster)
on the other machine?
No (eg: You can run your MPI program across 2 WinXP machines)
>>> Second - Do I need any of the mpich stuff on the other machine?
Yes, You need to install MPICH2 on all the machines which are part
of your MPI job.
>>> Third - does the MPICH library have a way to verify that other hosts
are reachable?
You can check the status of the process manager, smpd, on each
machine using the "-status" option of smpd command (eg: "smpd -status
10.1.1.1")
Let us know if you need further help.
Regards,
Jayesh
-----Original Message-----
From: owner-mpich-discuss at mcs.anl.gov
[mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Mark Sale
Sent: Thursday, June 26, 2008 3:51 PM
To: mpich-discuss at mcs.anl.gov
Subject: [mpich-discuss] novice question
I'm working on a parallel/MPI Fortran application for windows (XP and/or
Vista). Runs fine on one machine with mpiexec -n X a.exe but I'm having
trouble getting it to run on other hosts, e.g., mpiexe -n X -host slave11
a.exe I get this:
Unable to connect to 'slave11:8676',
sock error: generic socket failure, error stack:
MPIDU_Sock_post_connect(1228): unable to connect to slave11 on port 8676,
exhausted all endpoints (errno -1)
MPIDU_Sock_post_connect(1244): gethostbyname failed, No such host is
known. (errno 11001)
First question - do I need Windows server (or server computer cluster) on
the other machine?
Second - Do I need any of the mpich stuff on the other machine?
Third - does the MPICH library have a way to verify that other hosts are
reachable?
I'm using Intel Fortran (version 9), with Visual Studio.
thanks
Mark
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20080626/29609574/attachment.htm>
More information about the mpich-discuss
mailing list