Hi:<br> <br>After that I put all 4 nodes in the same domain (originally they are in the same workgroup)<br>For the MPI_Barrier(...) function, version 1.0.6p1 and 1.0.8 both return at once.<br>But for another simple MPI program (Laplace equation solver), both 1.0.6p1 and 1.0.8 run very SLOW!!!!<br>
<br>I have noticed that if the command is like<br>mpiexec -hosts 2 <a href="http://192.168.1.1">192.168.1.1</a> <a href="http://192.168.1.2">192.168.1.2</a> <a href="file://192.168.1.1/shared/LaplaceSolver.exe">\\192.168.1.1\shared\LaplaceSolver.exe</a><br>
It runs very SLOW!!! (the processes are located at different machines)<br> <br>Another command is like<br>mpiexec -hosts 2 <a href="http://192.168.1.1">192.168.1.1</a> <a href="http://192.168.1.1">192.168.1.1</a> <a href="file://192.168.1.1/shared/LaplaceSolver.exe">\\192.168.1.1\shared\LaplaceSolver.exe</a><br>
It runs at normal fast speed! (the processes are located at the SAME machine)<br> <br> <br>I think this may due to the strict policies on network transfer of Windows HPC Server 2008 (and Vista also)<br>Do you have any solution to this? thank you!<br>
<br>regards,<br> <br>Seifer Lin<br><br>
<div class="gmail_quote">2008/12/8 Jayesh Krishna <span dir="ltr"><<a href="mailto:jayesh@mcs.anl.gov">jayesh@mcs.anl.gov</a>></span><br>
<blockquote class="gmail_quote" style="PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: #ccc 1px solid">
<div>
<div dir="ltr" align="left"><font face="Arial" color="#0000ff" size="2"><span>Hi,</span></font></div>
<div dir="ltr" align="left"><font face="Arial" color="#0000ff" size="2"><span> Can you try out the latest stable version (1.0.8) of MPICH2 (<a href="http://www.mcs.anl.gov/research/projects/mpich2/downloads/index.php?s=downloads" target="_blank">http://www.mcs.anl.gov/research/projects/mpich2/downloads/index.php?s=downloads</a>)?</span></font></div>
<div dir="ltr" align="left"><font face="Arial" color="#0000ff" size="2"><span></span></font> </div>
<div dir="ltr" align="left"><font face="Arial" color="#0000ff" size="2"><span>Regards,</span></font></div>
<div dir="ltr" align="left"><font face="Arial" color="#0000ff" size="2"><span>Jayesh</span></font></div><br>
<div lang="en-us" dir="ltr" align="left">
<hr>
<font face="Tahoma" size="2"><b>From:</b> <a href="mailto:mpich-discuss-bounces@mcs.anl.gov" target="_blank">mpich-discuss-bounces@mcs.anl.gov</a> [mailto:<a href="mailto:mpich-discuss-bounces@mcs.anl.gov" target="_blank">mpich-discuss-bounces@mcs.anl.gov</a>] <b>On Behalf Of </b>Seifer Lin<br>
<b>Sent:</b> Monday, December 08, 2008 12:47 AM<br><b>To:</b> <a href="mailto:mpich-discuss@mcs.anl.gov" target="_blank">mpich-discuss@mcs.anl.gov</a><br><b>Subject:</b> [mpich-discuss] MPICH2 1.0.6p1 & Windows HPC Server 2008 (badperformance)<br>
</font><br></div>
<div>
<div></div>
<div class="Wj3C7c">
<div></div>
<div>Hi everyone:</div>
<div> </div>
<div>I have a cluster of 4 nodes, all of them are with Windows HPC server 2008 installed.</div>
<div>I make all of the 4 nodes in the same workgroup. I use MPICH2 1.0.6p1 from Argonne Lab.</div>
<div>And then</div>
<div>1. firewall of all 4 nodes are turned off</div>
<div>2. UAC (User Account Control) of all 4 nodes are turned off</div>
<div>3. I start smpd.exe (1.0.6p1 x64) in all the 4 nodes</div>
<div> </div>
<div>And I run a very simple MPI program (test_mpich2.exe)</div>
<div> </div>
<div>#include "mpi.h"<br>#include <iostream></div>
<div>int main(int argc, char **argv)<br>{<br> int cpuid, ncpu;<br> MPI_Init(&argc, &argv);<br> MPI_Comm_size(MPI_COMM_WORLD, &ncpu);<br> MPI_Comm_rank(MPI_COMM_WORLD, &cpuid);</div>
<div> printf("NCPU:%d, CPUID:%d\n", ncpu, cpuid);<br> fflush(stdout);</div>
<div> printf("start barrier\n"); fflush(stdout);<br> MPI_Barrier(MPI_COMM_WORLD);<br> printf("end barrier\n"); fflush(stdout);</div>
<div> MPI_Finalize();</div>
<div> return 0;</div>
<div>}</div>
<div> </div>
<div>The command is </div>
<div>mpiexec -hosts 2 <a href="http://192.168.1.1/" target="_blank">192.168.1.1</a> <a href="http://192.168.1.2/" target="_blank">192.168.1.2</a> <a>\\192.168.1.1\shared\test_mpich2.exe</a></div>
<div> </div>
<div>And the MPI_Barrier(...) function costs 10 seconds to return !!!!!</div>
<div> </div>
<div>If the same code is running on a Windows XP cluster, MPI_Barrier(...) returns at once!</div>
<div> </div>
<div> </div>
<div>Does anyone know how to solve this problem on Windows HPC Server 2008 ? (Windows Vista has the same problem, too)</div>
<div> </div>
<div>regards,</div>
<div> </div>
<div>Seifer Lin</div>
<div> </div>
<div> </div>
<div> </div>
<div> </div>
<div> </div>
<div> </div>
<div> </div>
<div> </div></div></div></div></blockquote></div><br>