<div>Dear Rajeev,</div>
<div> </div>
<div>Thanks for the reply. I tried all those options with mpdcheck (e.g., -v, -l, -pc) after rsh to a node. All quit silently. Maybe the thin kernel for compute nodes on my Scyld Beowulf (no change since shipped from vendor) is too thin. Even the error messages may have been suppressed or redirected somewhere. (But not all, it would know, e.g., if entering a wrong command). Have any users successfully ported MPICH2 to such system?</div>
<div> </div>
<div>Thanks a lot.</div>
<div> </div>
<div>Wuyin <br><br></div>
<div class="gmail_quote">On Wed, May 21, 2008 at 2:04 PM, Rajeev Thakur <<a href="mailto:thakur@mcs.anl.gov">thakur@mcs.anl.gov</a>> wrote:<br>
<blockquote class="gmail_quote" style="PADDING-LEFT: 1ex; MARGIN: 0px 0px 0px 0.8ex; BORDER-LEFT: #ccc 1px solid">
<div>
<div dir="ltr" align="left"><span><font face="Arial" color="#0000ff" size="2">May be something with the networking configuration on the machines. To debug, you can use the mpdcheck utility and follow all the steps described in the installation guide.</font></span></div>
<div dir="ltr" align="left"><span><font face="Arial" color="#0000ff" size="2"></font></span> </div>
<div dir="ltr" align="left"><span><font face="Arial" color="#0000ff" size="2">Rajeev</font></span></div><br>
<blockquote dir="ltr" style="PADDING-LEFT: 5px; MARGIN-LEFT: 5px; BORDER-LEFT: #0000ff 2px solid; MARGIN-RIGHT: 0px">
<div lang="en-us" dir="ltr" align="left">
<hr>
<font face="Tahoma" size="2"><b>From:</b> <a href="mailto:owner-mpich-discuss@mcs.anl.gov" target="_blank">owner-mpich-discuss@mcs.anl.gov</a> [mailto:<a href="mailto:owner-mpich-discuss@mcs.anl.gov" target="_blank">owner-mpich-discuss@mcs.anl.gov</a>] <b>On Behalf Of </b>Wuyin Lin<br>
<b>Sent:</b> Monday, May 19, 2008 11:26 AM<br><b>To:</b> <a href="mailto:mpich-discuss@mcs.anl.gov" target="_blank">mpich-discuss@mcs.anl.gov</a><br><b>Subject:</b> [mpich-discuss] unable to start mpd on Scyld Beowulf Cluster compute nodes<br>
</font><br></div>
<div>
<div></div>
<div class="Wj3C7c">
<div></div>
<div>Hello,</div>
<div> </div>
<div>My system is an AMD Opteron cluster running Penguin Computing Scyld Linux release 30cz, with a full linux on master node but thin kernel on compute nodes. Communication mostly thru bproc. rshd is also enabled on compute nodes for a separate mpich1. </div>
<div> </div>
<div>After installation of MPICH2, no problem starting mpd on master node. But launching mpd always exit silently, using bpsh, or rsh to compute node then start it. PYTHONHOME has set properly to import the required libs. Files resident on master node are recognized by same path in compute nodes via NFS. Clueless what else needed for such a system to bring up mpd. </div>
<div> </div>
<div>Any advices are appreciated. Thank you in advance.</div>
<div> </div>
<div> </div>
<div>Wuyin Lin</div></div></div></blockquote></div></blockquote></div><br>