[mpich-discuss] mpdcheck fails after logout
Richard Katz
richard.katz at earth.ox.ac.uk
Mon Jan 12 11:50:27 CST 2009
Hello,
I am trying to run a sequence of MPI jobs from a shell script and
encountering this problem:
after having launched the shell script in the background, if I log out
of the machine, the script still executes but I get:
mpiexec failed: gethostbyname_ex failed for darcy.earth.ox.ac.uk
for each MPI job in the script. As a test, I used the following script:
#!/bin/bash
mpdcheck -v > test_script.out
mpiexec -np 2 ./myProg >> test_script.out < /dev/null
mpdcheck -v >> test_script.out
mpiexec -np 2 ./myProg >> test_script.out < /dev/null
and ran it in the background. The first mpdcheck returned
successfully. The first run of myProg was also successful. At some
time during the first execution of myProg I logged out of the machine
and the subsequent mpdcheck and mpicheck failed. mpdcheck -v returned:
obtaining hostname via gethostname and getfqdn
gethostname gives darcy.earth.ox.ac.uk
getfqdn gives darcy.earth.ox.ac.uk
checking out unqualified hostname; make sure is not "localhost", etc.
checking out qualified hostname; make sure is not "localhost", etc.
obtain IP addrs via qualified and unqualified hostnames; make sure
other than 1
27.0.0.1
*** gethostbyname_ex failed for this host darcy.earth.ox.ac.uk
*** gethostbyname_ex failed for host darcy.earth.ox.ac.uk
checking that IP addrs resolve to same host
now do some gethostbyaddr and gethostbyname_ex for machines in hosts
file
If I run the script but remain logged in to the machine, there is no
problem.
How can I fix this so that I can log out and not cause
gethostbyname_ex to fail?
Many thanks,
Rich
More information about the mpich-discuss
mailing list