<!DOCTYPE HTML PUBLIC "-//W3C//DTD HTML 4.0 Transitional//EN">
<HTML><HEAD>
<META content="text/html; charset=us-ascii" http-equiv=Content-Type>
<STYLE type=text/css>DIV {
        MARGIN: 0px
}
</STYLE>
<META name=GENERATOR content="MSHTML 8.00.6001.18783"></HEAD>
<BODY>
<DIV dir=ltr align=left><SPAN class=671442516-11072009><FONT color=#0000ff
size=2 face=Arial>The first issue has been fixed. If you try one of the nightly
snapshots, it should go away. It will be included in 1.1.1 to be out next
week.</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=671442516-11072009><FONT color=#0000ff
size=2 face=Arial></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=671442516-11072009><FONT color=#0000ff
size=2 face=Arial>Can you tell us more about the second issue. What are the
processes doing when they suddenly become idle? Have they already communicated
before? Are they all running on a single machine?</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=671442516-11072009><FONT color=#0000ff
size=2 face=Arial></FONT></SPAN> </DIV>
<DIV dir=ltr align=left><SPAN class=671442516-11072009><FONT color=#0000ff
size=2 face=Arial>Rajeev</FONT></SPAN></DIV>
<DIV dir=ltr align=left><SPAN class=671442516-11072009><FONT color=#0000ff
size=2 face=Arial></FONT></SPAN> </DIV><FONT color=#0000ff size=2
face=Arial></FONT><BR>
<BLOCKQUOTE
style="BORDER-LEFT: #0000ff 2px solid; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; MARGIN-RIGHT: 0px">
<DIV dir=ltr lang=en-us class=OutlookMessageHeader align=left>
<HR tabIndex=-1>
<FONT size=2><FONT face=Tahoma><B>From:</B> mpich-discuss-bounces@mcs.anl.gov
[mailto:mpich-discuss-bounces@mcs.anl.gov] <B>On Behalf Of </B>chong
tan<BR><B>Sent:</B> Friday, July 10, 2009 6:20 PM<BR><B>To:</B>
mpich-discuss@mcs.anl.gov<BR><B>Subject:</B> [mpich-discuss] version 1.1
strange behavior : all processes becomeidle for extensive
period<BR></FONT><SPAN class=671442516-11072009><FONT color=#0000ff
face=Arial> </FONT></SPAN></FONT><BR></DIV>
<DIV></DIV>
<DIV
style="FONT-FAMILY: times new roman, new york, times, serif; FONT-SIZE: 12pt">
<DIV>I am seeing this funny situation which I did not see on 1.0.6 and
1.0.8. Some background:</DIV>
<DIV> </DIV>
<DIV>machine : INTEL 4Xcore 2</DIV>
<DIV> </DIV>
<DIV>running mpiexec -n 4</DIV>
<DIV> </DIV>
<DIV>machine has 32G of mem. </DIV>
<DIV> </DIV>
<DIV>when my application runs, almost all memory are used.
However, there is no swapping.</DIV>
<DIV>I have exclusive use of the machine, so contention is not an issue.</DIV>
<DIV> </DIV>
<DIV>issue #1 : processes take extra long to be initialized, compared to
1.0.6</DIV>
<DIV>issue #2 : during the run, at time all of them will become idle at the
same time, for almost a</DIV>
<DIV>
minute. We never observed this with 1.0.6</DIV>
<DIV> </DIV>
<DIV> </DIV>
<DIV>The codes are the same, only linked with different versions of
MPICH2.</DIV>
<DIV> </DIV>
<DIV>MPICH2 was built with --enable-threads=multiple for 1.1. without
for 1.0.6 or 1.0.8</DIV>
<DIV> </DIV>
<DIV>MPI calls are all in the main application thread. I used only 4 MPI
functions :</DIV>
<DIV>init(), Send(), Recv() and Barrier(). </DIV>
<DIV> </DIV>
<DIV> </DIV>
<DIV> </DIV>
<DIV>any suggestion ?</DIV>
<DIV> </DIV>
<DIV>thanks</DIV>
<DIV>tan</DIV>
<DIV><BR> </DIV>
<DIV
style="FONT-FAMILY: times new roman, new york, times, serif; FONT-SIZE: 12pt"><BR>
<DIV
style="FONT-FAMILY: times new roman, new york, times, serif; FONT-SIZE: 12pt">
<BLOCKQUOTE
style="BORDER-LEFT: #0000ff 2px solid; PADDING-LEFT: 5px; MARGIN-LEFT: 5px; MARGIN-RIGHT: 0px"> </BLOCKQUOTE></DIV></DIV></DIV><BR></BLOCKQUOTE></BODY></HTML>