<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META http-equiv=Content-Type content="text/html; charset=us-ascii">
<META content="MSHTML 6.00.6000.16735" name=GENERATOR></HEAD>
<BODY>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009>Hi,</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009># Do you get any error message related to mapping
network drives when you ran your job ?</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009> Please provide us with the command+output of your
MPI job (Copy-paste your complete mpiexec command and its output in your
email).</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009></SPAN></FONT> </DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009># Can you run a command like (Note that I have removed
"-noprompt" option), </SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009></SPAN></FONT> </DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009> mpiexec -map
x:\\computer1\MPI -wdir x:\ -n 8 -machinefile testallnamesmf.txt
hostname</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009></SPAN></FONT> </DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009> with the following contents in the machinefile
(testallnamesmf.txt - contains all the computer/host names - Note that I specify
that only 1 MPI process be launched on each host using "hostname:1"
syntax),</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009></SPAN></FONT> </DIV>
<DIV dir=ltr align=left><FONT><SPAN class=241351619-13012009>computer1:1 -ifhn
192.168.1.1<BR>computer2:1 -ifhn 192.168.1.2<BR>...<BR>computer8:1 -ifhn
192.168.1.8</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009></SPAN></FONT> </DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009># Does your program fail consistently for certain
computers ? Try running a simple job (mpiexec -map x:\\computer1\MPI -wdir x:\
-n 1 -machinefile testmf.txt hostname) with only specifying 1 computer/host at a
time.</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009></SPAN></FONT> </DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009># Try removing "-noprompt" from the mpiexec command and
see if mpiexec prompts you for anything (password, inputs
etc).</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009></SPAN></FONT> </DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009>Regards,</SPAN></FONT></DIV>
<DIV dir=ltr align=left><FONT face=Arial color=#0000ff size=2><SPAN
class=241351619-13012009>Jayesh</SPAN></FONT></DIV><BR>
<DIV class=OutlookMessageHeader lang=en-us dir=ltr align=left>
<HR tabIndex=-1>
<FONT face=Tahoma size=2><B>From:</B> mpich-discuss-bounces@mcs.anl.gov
[mailto:mpich-discuss-bounces@mcs.anl.gov] <B>On Behalf Of </B>Tina
Tina<BR><B>Sent:</B> Tuesday, January 13, 2009 12:01 PM<BR><B>To:</B>
mpich-discuss@mcs.anl.gov<BR><B>Subject:</B> [mpich-discuss] MPICH2 1.1a2 -
problems with more than 4 computers<BR></FONT><BR></DIV>
<DIV></DIV>Dear Community!<BR><BR>I am using the latest version of MPICH2 for
Windows (the problem occurs also on 1.0.8). I have 8 computers connected over
giga-bit switch. I have written a program that uses MPI for paralelization. When
I run a program on one or two computers. Everything works OK (lets say most of
the time). When I run it on 4 computers, sometimes it works and sometimes it
does not. The error that I get is:<BR>launch failed:
CreateProcess(X:\mpi_program.exe) on 'computerX' failed, error 3 - The system
cannot find the path specified.<BR><BR>Most times I get this error for one
computer in machine list, but it can also happen for 2 or more computers
etc.<BR><BR>If I increase number of computers over 4. I get this error almost
every time. With 6 or more this happens every time. It looks like the higher the
number the worse it gets. I would really like to make this work. Has anybody had
such experiences and what was the solution.<BR><BR>It looks like the computer
tries to start the program before the mapped drive would be made operational. Is
there any way to increase this delay? Or are there any other settings that needs
to be set?<BR><BR>There are some other errors that I occasionally get, but this
is the most important one (for now).<BR><BR>Systems:<BR>Windows XP SP3 (on all
computers)<BR>Installed latest MPICH2<BR>Connection giga-bit NICs (local
network) over switch<BR><BR>Example of run command: "C:\Program
Files\MPICH2\bin\mpiexec.exe" -map X:\\computer1\MPI -wdir X:\ -n 4 -machinefile
"C:\Program Files\MPICH2\bin\machines.txt" -noprompt
X:\mpi_program.exe<BR><BR>\\computer1\MPI is a shared folder on computer1 from
which the command is run<BR><BR>machines.txt consists of following
lines:<BR>computer1 -ifhn 192.168.1.1<BR>computer2 -ifhn
192.168.1.2<BR>...<BR>computer8 -ifhn 192.168.1.8<BR><BR>These are the NICs I
would like MPI to use them for communication. The order of computers in
machines.txt is irrelevant (it happens on every
combination).<BR><BR>Regards<BR></BODY></HTML>