[MPICH] MPICH Connection Problem

Rajeev Thakur thakur at mcs.anl.gov
Thu Mar 15 13:15:04 CDT 2007


Also see Sec 6.3 of the MPICH Users Guide.
http://www-unix.mcs.anl.gov/mpi/mpich1/docs/mpichman-chp4.pdf
 
Rajeev



  _____  

From: owner-mpich-discuss at mcs.anl.gov
[mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Natarajan, Senthil
Sent: Thursday, March 15, 2007 10:48 AM
To: mpich-discuss at mcs.anl.gov; ashton at mcs.anl.gov
Subject: [MPICH] MPICH Connection Problem



Hi,

I am using MPICH1.2.4 on Linux. I installed with the option -rsh=ssh.

After successfully installed, I am trying to run a simple mpi job with the
two machines.

I have generated the key pair (ssh-keygen) and copied to other machine, and
I can ssh between the machines with out password.

 

I am trying to run a simple mpi job, but it with out trying to connect other
machine, complains about connection refused.

 

mpirun -v -np 2 -machinefile machines tspRunOneBranch randomOut10.txt

running /home/condor-nobody/teststuff/tspRunOneBranch on 2 LINUX ch_p4
processors

Created /home/condor-nobody/teststuff/PI24892

connect to address xxx.xx.xxx.95: Connection refused

Trying krb4 rsh...

connect to address xxx.xx.xxx.95: Connection refused

trying normal rsh (/usr/bin/rsh)

machine2: Connection refused

p0_24976:  p4_error: Timeout in making connection to remote process on
machine2: 0

 

 

Could you please any one let me know or point out the link, what is
happening once we execute mpirun command.

I though it will connect to other machine through ssh, but it is not making
any communication at all to other machine. I don't know what it is doing and
how to debug.

Could you please let me know, how to run simple mpirun job through ssh.

Thanks,

Senthil

 

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20070315/de011a1e/attachment.htm>


More information about the mpich-discuss mailing list