[mpich-discuss] unable to get mpich2 1.0.7 working
Kamaraju Kusumanchi
kamaraju at gmail.com
Wed Oct 22 11:19:44 CDT 2008
The system administrator fixed the problem of nodes pointing
themselves to 127.0.0.1 . However running the testsuite now gives
different errors. I will open a new thread regarding this.
raju
On Mon, Oct 20, 2008 at 3:51 AM, Kamaraju Kusumanchi <kamaraju at gmail.com> wrote:
> Ok. Reading the installation manual again offered some hints. I feel
> stupid for having missed this information before... Sorry for wasting
> all your time. I should have read the manual more carefully...
>
> It looks like some of the nodes on our cluster are not configured
> correctly. For example if I log in into node2 and run "mpdcheck -v -l"
> then I get
>
> raju at node2:~ 1 1737 03:45 AM
> $mpdcheck -v -l
> obtaining hostname via gethostname and getfqdn
> gethostname gives node2.ank.mae.cornell.edu
> getfqdn gives node2.ank.mae.cornell.edu
> checking out unqualified hostname; make sure is not "localhost", etc.
> checking out qualified hostname; make sure is not "localhost", etc.
> obtain IP addrs via qualified and unqualified hostnames; make sure
> other than 127.0.0.1
> gethostbyname_ex: ('node2.ank.mae.cornell.edu', [], ['172.18.0.2'])
> gethostbyname_ex: ('node2.ank.mae.cornell.edu', [], ['172.18.0.2'])
> checking that IP addrs resolve to same host
> now do some gethostbyaddr and gethostbyname_ex for machines in hosts file
>
>
> If I log in into node6 and run mpdcheck then I get
>
> $mpdcheck -v -l
> obtaining hostname via gethostname and getfqdn
> gethostname gives node6.ank.mae.cornell.edu
> getfqdn gives node6.ank.mae.cornell.edu
> checking out unqualified hostname; make sure is not "localhost", etc.
> checking out qualified hostname; make sure is not "localhost", etc.
> obtain IP addrs via qualified and unqualified hostnames; make sure
> other than 127.0.0.1
> gethostbyname_ex: ('node6.ank.mae.cornell.edu', ['node6',
> 'localhost.localdomain', 'localhost'], ['127.0.0.1'])
>
> **********
> Your unqualified hostname resolves to 127.0.0.1, which is
> the IP address reserved for localhost. This likely means that
> you have a line similar to this one in your /etc/hosts file:
> 127.0.0.1 $uqhn
> This should perhaps be changed to the following:
> 127.0.0.1 localhost.localdomain localhost
> **********
>
> gethostbyname_ex: ('node6.ank.mae.cornell.edu', ['node6',
> 'localhost.localdomain', 'localhost'], ['127.0.0.1'])
> checking that IP addrs resolve to same host
> now do some gethostbyaddr and gethostbyname_ex for machines in hosts file
>
>
> I have sent an email to our system administrator about this problem. I
> will update this thread if there is any progress.
>
> regards
> raju
>
>
>
More information about the mpich-discuss
mailing list