<html><head></head><body style="word-wrap: break-word; -webkit-nbsp-mode: space; -webkit-line-break: after-white-space; "><div style="font-size: 16px; ">Hi,</div><div style="font-size: 16px; "><br></div><div style="font-size: 16px; ">I encountered failure when specifying -iface and, perhaps, related to the number of tasks (but not sure with the last point)</div><div style="font-size: 16px; "><br></div><div style="font-size: 16px; ">More clearly : here is a small test between 2 machines, without any limit (*) and the following machines file :</div><div style="font-size: 16px; "><span class="Apple-style-span" style="font-size: 18px; "><font class="Apple-style-span" size="4"><span class="Apple-style-span" style="font-size: 16px; "><div style="font-size: 14px; "><i>>more /tmp/machines </i></div><div style="font-size: 14px; "><i>ccwpge0061:128</i></div><div style="font-size: 14px; "><i>ccwpge0062:128</i></div><div><br></div><div>1/ without specifying -iface, It's OK (more than 10 tries)</div><div><br></div></span></font></span></div><div><div><div style="font-size: 14px; "><div>mpiexec -f /tmp/machines -n 150 bin/advance_test</div><div>bchambon@ccwpge0062's password: </div><div><br></div><div>I am there </div><div>Running MPI version 2, subversion 2 </div><div>ref_message is ready </div><div>I am the master task 0 sur ccwpge0061, for 149 slaves tasks, we will exchange a buffer of 1 MB</div><div><br></div><div>slave number 1, iteration = 1</div><div>slave number 2, iteration = 1</div><div>slave number 3, iteration = 1</div><div>…</div><div><br></div></div></div></div><div style="font-size: 15px; ">>echo $status</div><div style="font-size: 15px; ">0</div><div><br></div><div><br></div><div>2/ When <span class="Apple-style-span" style="font-size: 16px; ">specifying -iface eth0 (or eth2 : 10Gb/s) </span><span class="Apple-style-span" style="font-size: 16px; ">I <u>always</u> get </span><span class="Apple-style-span" style="font-size: 16px; ">failure (</span><span class="Apple-style-span" style="font-size: 13px; ">assert (!closed) failed)</span></div><div><font class="Apple-style-span" size="4"><span class="Apple-style-span" style="font-size: 16px; "><br></span></font></div><div><div style="font-size: 13px; ">>mpiexec -iface eth0 -f /tmp/machines -n 150 bin/advance_test <span class="Apple-style-span" style="font-size: 16px; ">(as previous, more than 10 tries)</span></div><div style="font-size: 13px; ">bchambon@ccwpge0062's password: </div><div style="font-size: 13px; "><br></div><div style="font-size: 13px; ">Segmentation fault</div><div style="font-size: 13px; ">[mpiexec@ccwpge0061] control_cb (./pm/pmiserv/pmiserv_cb.c:215): assert (!closed) failed</div><div style="font-size: 13px; ">[mpiexec@ccwpge0061] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status</div><div style="font-size: 13px; ">[mpiexec@ccwpge0061] HYD_pmci_wait_for_completion (./pm/pmiserv/pmiserv_pmci.c:181): error waiting for event</div><div style="font-size: 13px; ">[mpiexec@ccwpge0061] main (./ui/mpich/mpiexec.c:405): process manager error waiting for completion</div></div><div><br></div><div>see attachment for -verbose option</div><div><br></div><div>Best regards.</div><div><br></div><div><br></div><div style="font-size: 14px; ">(*) </div><div><div style="font-size: 14px; ">>limit</div><div style="font-size: 14px; ">cputime unlimited</div><div style="font-size: 14px; ">filesize unlimited</div><div style="font-size: 14px; ">datasize unlimited</div><div style="font-size: 14px; ">stacksize unlimited</div><div style="font-size: 14px; ">coredumpsize unlimited</div><div style="font-size: 14px; ">memoryuse unlimited</div><div style="font-size: 14px; ">vmemoryuse unlimited</div><div style="font-size: 14px; ">descriptors 1000000 </div><div style="font-size: 14px; ">memorylocked unlimited</div><div style="font-size: 14px; ">maxproc 409600 </div></div><div><br></div><div><br></div><div><div style="font-size: 16px; ">>more /tmp/machines </div><div style="font-size: 16px; ">ccwpge0061:128</div><div style="font-size: 16px; ">ccwpge0062:128</div><div><br></div></div><div><br></div><div style="font-size: 14px; "><br></div><div style="font-size: 14px; "><div>>mpich2version </div><div>MPICH2 Version: <span class="Apple-tab-span" style="white-space:pre">        </span>1.4.1p1</div><div>MPICH2 Release date:<span class="Apple-tab-span" style="white-space:pre">        </span>Thu Sep 1 13:53:02 CDT 2011</div><div>MPICH2 Device: <span class="Apple-tab-span" style="white-space:pre">        </span>ch3:nemesis</div><div>MPICH2 configure: <span class="Apple-tab-span" style="white-space:pre">        </span>--prefix=/scratch/BC/mpich2-1.4 --enable-threads=multiple</div><div>MPICH2 CC: <span class="Apple-tab-span" style="white-space:pre">        </span>/usr/bin/gcc -m64 -O2</div><div>MPICH2 CXX: <span class="Apple-tab-span" style="white-space:pre">        </span>c++ -m64 -O2</div><div>MPICH2 F77: <span class="Apple-tab-span" style="white-space:pre">        </span>/usr/bin/f77 -O2</div><div>MPICH2 FC: <span class="Apple-tab-span" style="white-space:pre">        </span>f95 </div></div><div><br></div><div><br></div><div><br></div><div>>mpiexec -verbose -iface eth0 -f /tmp/machines -n 150 bin/advance_test </div><div></div></body></html>