[MPICH] unmanaged disconnection from mpd ring

Jinyou Liang jliang at arb.ca.gov
Tue May 15 11:30:55 CDT 2007


Dear friends,

I encountered a problem with mpd as described below, and would 
appreciate any insights that you may kindly offer to prevent similar 
problem.
Thanks in advance,
Paul

The problem:
 
Yesterday, I linked 8 dual processors together and mpdtrace output was:
chara
cha02
cha03
...
cha08.

I submitted a job that was supposed to finish during the night.

However, the job was still running (waiting) this morning, since 6 
processors were off the ring with the mpdtrace output as
chara
cha02.

-------------- next part --------------
A non-text attachment was scrubbed...
Name: jliang.vcf
Type: text/x-vcard
Size: 145 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20070515/38d9c1cf/attachment.vcf>


More information about the mpich-discuss mailing list