[mpich-discuss] Can't form mpi ring
Reuti
reuti at Staff.Uni-Marburg.DE
Mon Sep 27 06:48:16 CDT 2010
Hi,
Am 27.09.2010 um 12:21 schrieb Akshey Jawa:
> I am facing problem in using mpich2 on my computer systems(having Ubuntu).
>
> Here is what I tried:
>
> 1) I ran 'mpd &' on one machine and then I ran 'mpdtrace -l' on the same machine. I got this as output: "my-lappy_53430 (127.0.1.1)"
> 2) On another machine I ran 'mpd -h -p 53430 &' and got this error:
you don't need a DNS for this to work, but some unique addresses for each of the machines, so that you can also login from one machine to the other by ssh for example.
> akshey-desktop_39993: conn error in connect_lhs: Connection timed out
> akshey-desktop_39993 (connect_lhs 924): failed to connect to lhs at 10.2.28.137 52430
Where is 10.2.28.137 coming from?
> akshey-desktop_39993 (enter_ring 879): lhs connect failed
> akshey-desktop_39993 (run 267): failed to enter ring
>
>
>
> Can you please help with this issue? I tried to ping and ssh the first machine(on which mpd is running) from the second machine and it worked.
>
> After this I executed 'mpdheck' on the first machine, I got this as output:
>
>
> *** first ipaddr for this host (via my-lappy) is: 127.0.1.1
>
> These are the contents of /etc/hosts of the first machine:
>
>
> 127.0.0.1 localhost
> 127.0.1.1 my-lappy
Make this a address out of the private ones and include two entries for both machines on both of them. Either:
127.0.0.1 localhost
10.0.1.1 my-lappy
10.0.1.2 my-desktop
or
127.0.0.1 localhost
192.168.0.1 my-lappy
192.168.0.2 my-desktop
and setup your network to use these addresses. Then try on both sides a `ping my-lappy` resp. `ping my-desktop`. When this works, you can try to start the mpds again.
-- Reuti
> # The following lines are desirable for IPv6 capable hosts
> ::1 localhost ip6-localhost ip6-loopback
> fe00::0 ip6-localnet
> ff00::0 ip6-mcastprefix
> ff02::1 ip6-allnodes
> ff02::2 ip6-allrouters
> ff02::3 ip6-allhosts
>
> Then I ran 'mpdcheck -l' and got this as output:
>
>
> **********
> Your unqualified hostname resolves to 127.0.0.1, which is
> the IP address reserved for localhost. This likely means that
> you have a line similar to this one in your /etc/hosts file:
> 127.0.0.1 $uqhn
> This should perhaps be changed to the following:
> 127.0.0.1 localhost.localdomain localhost
> **********
>
> Even after changing the first line of /etc/hosts to "127.0.0.1 localhost.localdomain localhost" I still got the same output from 'mpdcheck -l'
>
> Please note that I do not have access to the DNS server of the network and these machines do not have a DNS entry in the DNS server. (I think this should not be a problem because we can always use IP addresses instead of hostnames. Isn't it so?)
>
> Thanks,
>
> Akshey
>
>
>
>
> _______________________________________________
> mpich-discuss mailing list
> mpich-discuss at mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
More information about the mpich-discuss
mailing list