[MPICH] unmanaged disconnection from mpd ring
Jinyou Liang
jliang at arb.ca.gov
Tue May 15 11:30:55 CDT 2007
Dear friends,
I encountered a problem with mpd as described below, and would
appreciate any insights that you may kindly offer to prevent similar
problem.
Thanks in advance,
Paul
The problem:
Yesterday, I linked 8 dual processors together and mpdtrace output was:
chara
cha02
cha03
...
cha08.
I submitted a job that was supposed to finish during the night.
However, the job was still running (waiting) this morning, since 6
processors were off the ring with the mpdtrace output as
chara
cha02.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: jliang.vcf
Type: text/x-vcard
Size: 145 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20070515/38d9c1cf/attachment.vcf>
More information about the mpich-discuss
mailing list