[mpich-discuss] command line ordering of hosts matters?

David_Lowinger at ea.epson.com David_Lowinger at ea.epson.com
Mon Jul 5 17:23:33 CDT 2010


I'm trying to compile the MPICH2 code package for Windows, following your 
"windevguide", with the hopes of tracing through the MPI code and finding 
where and why MPI_Bcast is failing.  When I try to run 
"./maint/updatefies" from cygwin, it tells me I need autoconf version 2.63 
(exactly).  Unfortunately, it doesn't allow a newer version, and I'm 
running 2.65.  How can I downgrade to 2.63?
Thanks,
David




David_Lowinger at ea.epson.com 
Sent by: mpich-discuss-bounces at mcs.anl.gov
07/05/2010 12:11 PM
Please respond to
mpich-discuss at mcs.anl.gov
Expire Date: 10/27/2010


To
mpich-discuss at mcs.anl.gov
cc

Subject
Re: [mpich-discuss] command line ordering of hosts matters?







Hi Jayesh, 

Yes, I can run a simple non-MPI program (ie. the program "hostname") 
successfully with "mpiexec -hosts 2 10.0.0.101 1 10.0.0.6 1 hostname".  I 
see both hostnames get printed out, and then the program exits.  The 
ordering of hosts does not matter in this case. 
I see the exact same behavior as before when I provide the complete path 
to the executable; ie. the MPI_Bcase never completes if I put 10.0.0.101 
before 10.0.0.6. 
I see the exact same behavior as before when I use the machinefile option. 


With regards to checking the control panel, I assume you mean the 
"Processes" tab of the Windows Task Manager?  If so, which processes 
should I be looking for as the program runs?  I changed the MPI "hello 
world" app to MPI_Bcast a very large buffer over and over, so that I can 
look for some kind of process in the Windows Task Manager.  What is the 
name of the process I should be looking for?  Thanks, 

David 



Jayesh Krishna <jayesh at mcs.anl.gov> 
07/01/2010 11:54 AM 
Expire Date: 07/04/2012 


To
mpich-discuss at mcs.anl.gov 
cc
David_Lowinger at ea.epson.com 
Subject
Re: [mpich-discuss] command line ordering of hosts matters?








Hi,
Along with checking whether the processes are launched, please provide us 
with the information below,

# Are you able to run a simple non-MPI program (mpiexec -hosts 2 
10.0.0.101 1 10.0.0.6 1 hostname)?
# Try providing the complete path to the executable (mpiexec -hosts 2 
10.0.0.101 1 10.0.0.6 1 c:\temp\helloworld.exe). Without the complete path 
you might actually be executing two different MPI programs on the hosts.
# Does the machinefile option work (mpiexec -n 2 -machinefile mf.txt 
c:\temp\helloworld.exe ; Where the file mf.txt contains the host 
ipaddresses)?

Regards,
Jayesh
----- Original Message -----
From: "David Lowinger" <David_Lowinger at ea.epson.com>
To: mpich-discuss at mcs.anl.gov
Sent: Wednesday, June 30, 2010 4:26:16 PM GMT -06:00 US/Canada Central
Subject: Re: [mpich-discuss] command line ordering of hosts matters?



Hi Jayesh, 
On both machines, when I type, "smpd -version", both display "1.2.1p1". 
When I call "MPI_Get_version()" on both machines, they both show version 
2, subversion 2. 

Both machines are running 64-bit Windows Vista. 

I'll check the control panel to see if the MPI processes are being 
launched. 

David 



                Jayesh Krishna <jayesh at mcs.anl.gov> 

06/30/2010 04:53 PM 
Expire Date: 06/29/2012 
 
To                  mpich-discuss at mcs.anl.gov 
 
cc                  David_Lowinger at ea.epson.com 
 
Subject                  Re: [mpich-discuss] command line ordering of 
hosts matters? 
 



Hi, 
Which version of MPICH2 are you using ? 
Do the two machines have the same underlying architecture (MPICH2 
currently does not support heterogeneous systems - So you cannot run your 
job across 32-bit and 64-bit machines/MPICH2_libs)? 
Also try checking the control panel to see if the MPI processes are being 
launched on the machines. 

Regards, 
Jayesh 

----- Original Message ----- 
From: "David Lowinger" <David_Lowinger at ea.epson.com> 
To: mpich-discuss at mcs.anl.gov 
Sent: Wednesday, June 30, 2010 3:42:22 PM GMT -06:00 US/Canada Central 
Subject: Re: [mpich-discuss] command line ordering of hosts matters? 



Firewall is turned off on both machines. 
There is no error message... the MPI_Bcase simply never completes. I've 
left it running for 10 minutes, and the second printf ("completed 
MPI_Bcast") never appears when I use the second host ordering below 
("10.0.0.101 1 10.0.0.6 1"). 
When I run "smpd -status 10.0.0.6" from 10.0.0.101, I see the message 
"smpd running on 10.0.0.6". When I run "smpd -status 10.0.0.101" from 
10.0.0.6, I see the message "smpd running on 10.0.0.101". 



jayesh at mcs.anl.gov 

06/30/2010 10:50 AM 
Expire Date: 06/29/2012 

To mpich-discuss at mcs.anl.gov 

cc David_Lowinger at ea.epson.com 

Subject Re: [mpich-discuss] command line ordering of hosts matters? 




Hi, 
Do you have a firewall running on any of these machines (If so, can you 
try running your job after turning off the firewall)? 
What is the error message that you get when you run your job ? 
Can you try running "smpd -status REMOTE_MACHINE" from each of the 
machines and let us know the results ("smpd -status 10.0.0.6" from 
10.0.0.101 & "smpd -status 10.0.0.101" from 10.0.0.6)? 

Regards, 
Jayesh 
----- Original Message ----- 
From: "David Lowinger" <David_Lowinger at ea.epson.com> 
To: mpich-discuss at mcs.anl.gov 
Sent: Tuesday, June 29, 2010 5:53:07 PM GMT -06:00 US/Canada Central 
Subject: [mpich-discuss] command line ordering of hosts matters? 



Hi, 
When running a very basic "hello world" app, I've found that the app's 
behavior depends on the order I use for hosts in the command line. For 
example, if I use: 

mpiexec -hosts 2 10.0.0.6 1 10.0.0.101 1 helloworld.exe 

The program executes flawlessly. But, if I use: 

mpiexec -hosts 2 10.0.0.101 1 10.0.0.6 1 helloworld.exe 

then the program never gets past the call to "MPI_Bcast()". Here is my 
code: 

------------------- 

#include "mpi.h" 

#define MPI_FLUSH() fflush(stdout) 

int main( int argc, char* argv[] ) 
{ 
int g_Thread_ID, g_Num_Threads; 
int test = 0; 

/**************************************************\ 
* MPI Initialization * 
\**************************************************/ 
MPI_Init(&argc, &argv); 
MPI_Comm_rank(MPI_COMM_WORLD, &g_Thread_ID); 
MPI_Comm_size(MPI_COMM_WORLD, &g_Num_Threads); 

printf("thread %d: main: About to execute MPI_Bcast\n", g_Thread_ID); 
MPI_FLUSH(); 

// Broadcast integer 
int err = MPI_Bcast(&test, 1, MPI_INT, 0, MPI_COMM_WORLD); 

printf("thread %d: completed MPI_Bcast\n", g_Thread_ID); 
MPI_FLUSH(); 

MPI_Finalize(); 
} 

------------------ 

I am running Windows Vista on both machines. Has anyone seen this before? 
Thanks, 
David 

_______________________________________________ 
mpich-discuss mailing list 
mpich-discuss at mcs.anl.gov 
https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss 


_______________________________________________ 
mpich-discuss mailing list 
mpich-discuss at mcs.anl.gov 
https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss 


_______________________________________________
mpich-discuss mailing list
mpich-discuss at mcs.anl.gov
https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
_______________________________________________
mpich-discuss mailing list
mpich-discuss at mcs.anl.gov
https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20100705/bd267b0a/attachment.htm>


More information about the mpich-discuss mailing list