[MPICH] MPICH Connection Problem
Natarajan, Senthil
senthil at pitt.edu
Thu Mar 15 10:47:50 CDT 2007
Hi,
I am using MPICH1.2.4 on Linux. I installed with the option -rsh=ssh.
After successfully installed, I am trying to run a simple mpi job with
the two machines.
I have generated the key pair (ssh-keygen) and copied to other machine,
and I can ssh between the machines with out password.
I am trying to run a simple mpi job, but it with out trying to connect
other machine, complains about connection refused.
mpirun -v -np 2 -machinefile machines tspRunOneBranch randomOut10.txt
running /home/condor-nobody/teststuff/tspRunOneBranch on 2 LINUX ch_p4
processors
Created /home/condor-nobody/teststuff/PI24892
connect to address xxx.xx.xxx.95: Connection refused
Trying krb4 rsh...
connect to address xxx.xx.xxx.95: Connection refused
trying normal rsh (/usr/bin/rsh)
machine2: Connection refused
p0_24976: p4_error: Timeout in making connection to remote process on
machine2: 0
Could you please any one let me know or point out the link, what is
happening once we execute mpirun command.
I though it will connect to other machine through ssh, but it is not
making any communication at all to other machine. I don't know what it
is doing and how to debug.
Could you please let me know, how to run simple mpirun job through ssh.
Thanks,
Senthil
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20070315/98ecb01a/attachment.htm>
More information about the mpich-discuss
mailing list