In the mpich2-1.2.1, mpdboot stopped working (upgraded from mpich2-1.1.1) for a fairly simple host file<div><br></div><div>(on compute06)</div><div>mpdboot --totalnum=6 --ncpus=0</div><div><br></div><div>host file:</div><div>
compute07</div><div>compute08</div><div>compute09</div><div>compute10</div><div>compute11</div><div><br></div><div>mpdboot will hang after trying to launch mpd on compute10</div><div><br></div><div><div>[kcoloma@compute06 ~]$ /rd_personalization08/kcoloma/mpich_install/bin/mpdboot --totalnum=6 --ncpus=0 --file=/home/kcoloma/mpiHosts.txt --mpd=/rd_personalization08/kcoloma/mpich_install/bin/mpd --verbose</div>
<div>running mpdallexit on compute06</div><div>LAUNCHED mpd on compute06 via </div><div>RUNNING: mpd on compute06</div><div>LAUNCHED mpd on compute07 via compute06</div><div>LAUNCHED mpd on compute08 via compute06</div>
<div>LAUNCHED mpd on compute09 via compute06</div><div>LAUNCHED mpd on compute10 via compute06</div><div>Traceback (most recent call last):</div><div> File "/rd_personalization08/kcoloma/mpich_install/bin/mpdboot", line 476, in ?</div>
<div> mpdboot()</div><div> File "/rd_personalization08/kcoloma/mpich_install/bin/mpdboot", line 347, in mpdboot</div><div> handle_mpd_output(fd,fd2idx,hostsAndInfo)</div><div> File "/rd_personalization08/kcoloma/mpich_install/bin/mpdboot", line 385, in handle_mpd_output</div>
<div> for line in fd.readlines(): # handle output from shells that echo stuff</div><div>KeyboardInterrupt</div><div><br></div><div>It will hang as long as --totalnum > 1.</div><div><br></div><div>mpdboot.py scripts are the same between the two versions of mpich, but the mpd.py scripts changed to address ticket #905. I've found that rolling back to the mpich2-1.1.1p1 mpd.py, fixes the mpdboot issue I'm having.</div>
<div><br></div></div>