[mpich-discuss] Unable to connect issue

Jayesh Krishna jayesh at mcs.anl.gov
Mon Sep 22 14:10:08 CDT 2008


Hi,
 Good to know you have a setup working for you.
 Unless we debug further we won't be able to pin point the source of the
error (something wrong with the setup OR bug in MPICH2) in the setup with
XP machines.
 Let us know if you need any further assistance.

Regards,
Jayesh

  _____

From: bob ilgner [mailto:bobilgner at gmail.com]
Sent: Monday, September 22, 2008 1:19 PM
To: Jayesh Krishna
Subject: Re: [mpich-discuss] Unable to connect issue


Hi Jayesh,

I did that when I first installed last week. no luck there. i.e. I ensured
paths were set to bins and also to directory calling examples on all
machines. If there is no setup for this variation, could it be a bug ? No
slight intended.

I have now installed MPich2 on Vista as this is on the C: drive. Mpich2
works well on Vista and can also distribute to other PCs running XP. that
is, it runs the example cpi.exe across Vista and XP just fine as long as
both installs are on the c: partition.

I am quite happy the way this is working out Jayesh. I am not too
concerned that it is not running under XP on the one machine as I will
just use the Vista.

Regards, bob




2008/9/22 Jayesh Krishna <jayesh at mcs.anl.gov>


Hi,
 You will have to specify the path to the executable using the "-path"
option of mpiexec.
 Let us know if it works.

Regards,
Jayesh

  _____


From: bob ilgner [mailto:bobilgner at gmail.com]

Sent: Monday, September 22, 2008 3:21 AM
To: 'Jayesh Krishna'

Subject: RE: [mpich-discuss] Unable to connect issue



Hi Jayesh,



I installed the mpich2 on another PC and it worked well first time round
on two both machines. It also allowed me to isolate which machine was
giving me a problem in the first place.



Could the problem be related to Pathing as the error now seems to lie with
the CreateProcess on that problem PC.? The XP system is installed on the
G: drive and not on the c: drive(there is no c: drive). I set the path
environment variable to point at the folder where cpi.exe resides but this
did not seem to make any difference.





Regards, bob








  _____


From: Jayesh Krishna [mailto:jayesh at mcs.anl.gov]
Sent: Friday, September 19, 2008 5:56 PM
To: 'bob ilgner'
Cc: mpich-discuss at mcs.anl.gov
Subject: RE: [mpich-discuss] Unable to connect issue



Hi,

 Do you have the same SMPD passphrase specified (during installation
process) on both the machines ?

 Can you try uninstalling-installing MPICH2 on both machines ? Make sure
that you use the same MPICH2 installable to install MPICH2 on the machines
and that you specify the same passphrase during installation.

 Let us know if that does not work.



Regards,

Jayesh




  _____


From: bob ilgner [mailto:bobilgner at gmail.com]
Sent: Friday, September 19, 2008 10:26 AM
To: Jayesh Krishna
Subject: Re: [mpich-discuss] Unable to connect issue

Hi,



The smpd versions are both the same. 1.0.7



I have attached the logs for the machine master(images16) and the
slave(images17)



Regards, bob











2008/9/19 Jayesh Krishna <jayesh at mcs.anl.gov>

Hi,

 Do you have the same version of MPICH2 installed on both the machines ?
You can check the version of smpd using the "smpd -version" command.

 The error does not seem to be because of a firewall issue. If the above
suggestion does not work please provide me the smpd debug outputs from
BOTH hosts and the mpiexec verbose output.



(PS: I am one of the contributors to the MPICH2 dev manual. Let me know if
you need any info to be added to the manual.)



Regards,

Jayesh




  _____


From: bob ilgner [mailto:bobilgner at gmail.com]
Sent: Friday, September 19, 2008 10:03 AM
To: Jayesh Krishna


Subject: Re: [mpich-discuss] Unable to connect issue



Hi Jayesh,



Your assumption is correct. The command was mpiexec -hosts 2 images16
images17 cpi



mpiexec -n 2 cpi.exe works fine



Same error message if I use IP addresses instead of names for mpiexec
command



I have attached the debug log for smpd and the verbose log for the mpiexec
as requested.



I have just put 2+2 together and noticed that you put together the
development manual for the MPICH2.



Regards, bob















2008/9/19 Jayesh Krishna <jayesh at mcs.anl.gov>

Hi,



# Can you run the MPI program on a single host ? (i.e., mpiexec -n 2
cpi.exe)

# Can you try specifying ipaddresses of the hosts instead of the hostnames
? (i.e., mpiexec -hosts 2 IPADDRESS_OF_IMAGES16 IPADDRESS_OF_IMAGES17
cpi.exe)



    If the above suggestions don't work please provide us the verbose
output of smpd and mpiexec. To get the verbose output of smpd & mpiexec,



# Stop any instances of smpd using, smpd -stop, command

# Run smpd in debug mode using, smpd -d, command

# Run mpiexec in the verbose mode using the "-verbose" option of mpiexec.



   Let us know the results.



(Note: In the command that you listed in your email both hostnames are the
same -- there is no images16 specified with "-hosts" arg of mpiexec. I am
assuming you meant "mpiexec -hosts 2 images16 images17 cpi.exe")



Regards,

Jayesh




  _____


From: owner-mpich-discuss at mcs.anl.gov
[mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of bob ilgner
Sent: Friday, September 19, 2008 8:30 AM
To: mpich-discuss at mcs.anl.gov
Subject: Re: [mpich-discuss] Unable to connect issue

Hi,



I am running XP Pro on the machines and am therefore using smpd. As far as
I know the mpd is used in the unix/linux environment. The smpd is
operational on both PCs. The local check for the status os the smpd on
either machine shows that they are running.  When i do a remote check with
smdp I get the same message.i.e.



smpd -status <remotehostname>

abort: unable to connect to <remotehostname>







Regards, bob

On Fri, Sep 19, 2008 at 1:38 PM, The Source <thesourcehim at gmail.com>
wrote:

do you have mpd ring running and connected on both machines?

bob ilgner пишет:



I have installed mpich2 1.0.7 on 2 XP Pro hosts and am trying to run the
cpi process on both hosts with the command
mpiexec -hosts 2 images17 images17 cpi
where images16 and images17 are the hosts and cpi is the supplied example
application and get the error:
abort: unable to connect to images16
I have checked passwords and regsitration is the same on both hosts.
Firewalls are both down. Host names defined in hosts file under system32.
I have ensured that smpd is operating on both machines. Usernames and
password are the same on both hosts and have both been delegated. I have
gone through the mpich2 user manual and find no way forward with this. Is
there anything else I could try ?
Would it help if I listed the output from the mpiexec with the verbose
switch on.
Regards, bob










-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20080922/6e835f13/attachment.htm>


More information about the mpich-discuss mailing list