I think you can use a resource manager and scheduler to do this, like torque + maui. You can suspend and resume jobs.<br><br>
<div class="gmail_quote">On Sat, May 5, 2012 at 8:46 AM, Pavan Balaji <span dir="ltr"><<a href="mailto:balaji@mcs.anl.gov" target="_blank">balaji@mcs.anl.gov</a>></span> wrote:<br>
<blockquote style="BORDER-LEFT:#ccc 1px solid;MARGIN:0px 0px 0px 0.8ex;PADDING-LEFT:1ex" class="gmail_quote">Hello,<br><br>We don't support this right now. I've created a ticket for it.<br><br><a href="https://trac.mcs.anl.gov/projects/mpich2/ticket/1627" target="_blank">https://trac.mcs.anl.gov/<u></u>projects/mpich2/ticket/1627</a><br>
<br>Please add yourself to the cc list of this ticket, if you'd like to be informed about updates on this issue.<br><br> -- Pavan
<div class="HOEnZb">
<div class="h5"><br><br>On 05/04/2012 12:54 PM, Shan-ho Tsai wrote:<br>
<blockquote style="BORDER-LEFT:#ccc 1px solid;MARGIN:0px 0px 0px 0.8ex;PADDING-LEFT:1ex" class="gmail_quote">Hello all,<br>We have mpich2 1.4.1p1 installed on a RHEL5 cluster<br>and sometimes have the need to suspend all jobs clusterwide.<br>
<br>Is there a way to suspend MPICH2 jobs that use Hydra, in<br>such a way that the master process and all slave process<br>(on multiple nodes) get properly suspended?<br><br>If there is a way to do this, what is the procedure? Is there<br>
a signal that we could send to mpiexec?<br><br>I tried sending a SIGSTOP to mpiexec, but only mpiexec<br>got suspended, the actual a.out processes continued to run.<br><br>I really appreciate any suggestions.<br>thank you,<br>
Shan-Ho<br><br>------------------------------<u></u>----------------------<br>Shan-Ho Tsai<br>University of Georgia, Athens GA<br><br><br><br>______________________________<u></u>_________________<br>mpich-discuss mailing list <a href="mailto:mpich-discuss@mcs.anl.gov" target="_blank">mpich-discuss@mcs.anl.gov</a><br>
To manage subscription options or unsubscribe:<br><a href="https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss" target="_blank">https://lists.mcs.anl.gov/<u></u>mailman/listinfo/mpich-discuss</a><br></blockquote><br>
</div></div><span class="HOEnZb"><font color="#888888">-- <br>Pavan Balaji<br><a href="http://www.mcs.anl.gov/~balaji" target="_blank">http://www.mcs.anl.gov/~balaji</a></font></span>
<div class="HOEnZb">
<div class="h5"><br>______________________________<u></u>_________________<br>mpich-discuss mailing list <a href="mailto:mpich-discuss@mcs.anl.gov" target="_blank">mpich-discuss@mcs.anl.gov</a><br>To manage subscription options or unsubscribe:<br>
<a href="https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss" target="_blank">https://lists.mcs.anl.gov/<u></u>mailman/listinfo/mpich-discuss</a><br></div></div></blockquote></div><br>