[mpich-discuss] unable to get mpich2 1.0.7 working

Kamaraju Kusumanchi kamaraju at gmail.com
Wed Oct 22 11:19:44 CDT 2008


The system administrator fixed the problem of nodes pointing
themselves to 127.0.0.1 . However running the testsuite now gives
different errors. I will open a new thread regarding this.

raju

On Mon, Oct 20, 2008 at 3:51 AM, Kamaraju Kusumanchi <kamaraju at gmail.com> wrote:
> Ok. Reading the installation manual again offered some hints. I feel
> stupid for having missed this information before... Sorry for wasting
> all your time. I should have read the manual more carefully...
>
> It looks like some of the nodes on our cluster are not configured
> correctly. For example if I log in into node2 and run "mpdcheck -v -l"
> then I get
>
> raju at node2:~ 1  1737 03:45 AM
> $mpdcheck -v -l
> obtaining hostname via gethostname and getfqdn
> gethostname gives  node2.ank.mae.cornell.edu
> getfqdn gives  node2.ank.mae.cornell.edu
> checking out unqualified hostname; make sure is not "localhost", etc.
> checking out qualified hostname; make sure is not "localhost", etc.
> obtain IP addrs via qualified and unqualified hostnames;  make sure
> other than 127.0.0.1
> gethostbyname_ex:  ('node2.ank.mae.cornell.edu', [], ['172.18.0.2'])
> gethostbyname_ex:  ('node2.ank.mae.cornell.edu', [], ['172.18.0.2'])
> checking that IP addrs resolve to same host
> now do some gethostbyaddr and gethostbyname_ex for machines in hosts file
>
>
> If I log in into node6 and run mpdcheck then I get
>
> $mpdcheck -v -l
> obtaining hostname via gethostname and getfqdn
> gethostname gives  node6.ank.mae.cornell.edu
> getfqdn gives  node6.ank.mae.cornell.edu
> checking out unqualified hostname; make sure is not "localhost", etc.
> checking out qualified hostname; make sure is not "localhost", etc.
> obtain IP addrs via qualified and unqualified hostnames;  make sure
> other than 127.0.0.1
> gethostbyname_ex:  ('node6.ank.mae.cornell.edu', ['node6',
> 'localhost.localdomain', 'localhost'], ['127.0.0.1'])
>
>    **********
>    Your unqualified hostname resolves to 127.0.0.1, which is
>    the IP address reserved for localhost. This likely means that
>    you have a line similar to this one in your /etc/hosts file:
>    127.0.0.1   $uqhn
>    This should perhaps be changed to the following:
>    127.0.0.1   localhost.localdomain localhost
>    **********
>
> gethostbyname_ex:  ('node6.ank.mae.cornell.edu', ['node6',
> 'localhost.localdomain', 'localhost'], ['127.0.0.1'])
> checking that IP addrs resolve to same host
> now do some gethostbyaddr and gethostbyname_ex for machines in hosts file
>
>
> I have sent an email to our system administrator about this problem. I
> will update this thread if there is any progress.
>
> regards
> raju
>
>
>




More information about the mpich-discuss mailing list