[MPICH] MPICH Connection Problem

Natarajan, Senthil senthil at pitt.edu
Thu Mar 15 10:47:50 CDT 2007


Hi,

I am using MPICH1.2.4 on Linux. I installed with the option -rsh=ssh.

After successfully installed, I am trying to run a simple mpi job with
the two machines.

I have generated the key pair (ssh-keygen) and copied to other machine,
and I can ssh between the machines with out password.

 

I am trying to run a simple mpi job, but it with out trying to connect
other machine, complains about connection refused.

 

mpirun -v -np 2 -machinefile machines tspRunOneBranch randomOut10.txt

running /home/condor-nobody/teststuff/tspRunOneBranch on 2 LINUX ch_p4
processors

Created /home/condor-nobody/teststuff/PI24892

connect to address xxx.xx.xxx.95: Connection refused

Trying krb4 rsh...

connect to address xxx.xx.xxx.95: Connection refused

trying normal rsh (/usr/bin/rsh)

machine2: Connection refused

p0_24976:  p4_error: Timeout in making connection to remote process on
machine2: 0

 

 

Could you please any one let me know or point out the link, what is
happening once we execute mpirun command.

I though it will connect to other machine through ssh, but it is not
making any communication at all to other machine. I don't know what it
is doing and how to debug.

Could you please let me know, how to run simple mpirun job through ssh.

Thanks,

Senthil

 

 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20070315/98ecb01a/attachment.htm>


More information about the mpich-discuss mailing list