[MPICH] Problem with mpiexec after creating a ring in Windows/Cygwin

Jayesh Krishna jayesh at mcs.anl.gov
Fri Jul 6 09:41:12 CDT 2007


 Hi,
  Is the firewall (windows firewall) turned off on both the machines ?

Regards,
Jayesh

-----Original Message-----
From: owner-mpich-discuss at mcs.anl.gov
[mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Mauro Sgroi
Sent: Friday, July 06, 2007 2:12 AM
To: Rajeev Thakur; mpich-discuss at mcs.anl.gov
Subject: RE: [MPICH] Problem with mpiexec after creating a ring in
Windows/Cygwin

Dear Rajeev,
thanks a lot for the reply.
I suspect that my problem arise from the fact that the
2 PCs have different operating systems (Windows XP and 2000). 
Using 2 WinXP PC I don't have problems.
I will try to upgrade the Win 2000 PC with a SP.
Best regards,
Mauro.

--- Rajeev Thakur <thakur at mcs.anl.gov> ha scritto:

> The port you specified is for the MPD daemons to talk to each other. 
> The MPI application processes talk to each other on ports returned by 
> the operating system. These are ports in the ephemeral range.
> 
> Rajeev
> 
> > -----Original Message-----
> > From: owner-mpich-discuss at mcs.anl.gov 
> > [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf
> Of Mauro Sgroi
> > Sent: Thursday, July 05, 2007 8:08 AM
> > To: mpich-discuss-digest at mcs.anl.gov
> > Subject: [MPICH] Problem with mpiexec after
> creating a ring
> > in Windows/Cygwin
> > 
> > Dear all,
> > I'm new to MPICH2. I compiled the program Abinit
> using
> > the MPICH2 library. The compilation of MPICH2 was fine. I created a 
> > ring between two Windows PC (XP
> and
> > Win 2000) having the Cygwin tools installed. I not installed ssh, so 
> > I created the ring doing:
> > 
> > mpd --listenport=2500 & on the first PC
> > 
> > and
> > 
> > mpd --host=C0PC0167 --port=2500 --listenport=2501
> > 
> > Cheching the ring I get:
> > 
> > mpdtrace -l
> > 
> > C0PC0167_2500 (151.91.212.63)
> > C0PC0165_2501 (151.91.212.129)
> > 
> > Now I'm trying to launch my parallel code on the 2 monoprocessor PC:
> > 
> > mpiexec  -n 2 /usr/local/abinit/5.3/bin/abinip.exe
> <
> > tparal_1.files
> > 
> > I get the error:
> > [unset]: Unable to get AF_INET socket
> > [unset]: Unable to connect to 151.91.212.129 on
> 1385
> > [unset]: aborting job:
> > Fatal error in MPI_Init: Other MPI error, error
> stack:
> > MPIR_Init_thread(247): Initialization failed
> > MPID_Init(71)........: channel initialization
> failed
> > MPID_Init(274).......: PMI_Init returned -1
> > 
> > mpiexec is accessing to the second PC on the wrong
> > port: 1385 instead of 2501.
> > I tried also to use a machinefile but with no good results. How can 
> > I force mpiexec to look in the
> good
> > port?
> > 
> > Thanks a lot and best regards,
> > Mauro Sgroi.
> > Italy.
> > 
> > 
> > 
> > 
> >       ___________________________________
> > L'email della prossima generazione? Puoi averla
> con la nuova
> > Yahoo! Mail:
> http://it.docs.yahoo.com/nowyoucan.html
> > 
> > 
> 
> 





	
		
___________________________________
L'email della prossima generazione? Puoi averla con la nuova Yahoo! Mail: 
http://it.docs.yahoo.com/nowyoucan.html





More information about the mpich-discuss mailing list