[MPICH] one shot jobs in mpich2?

Benjamin Rutt rutt at bmi.osu.edu
Fri Jun 17 13:38:12 CDT 2005


Reuti <reuti at staff.uni-marburg.de> writes:

> Hi Benjamin,
>
> you could compile MPICH2 for smpd instead of mpd and use it in a
> daemonless mode. So the behavior would be similar to MPICH 1.2.6 with
> the ch_p4 device.

Can you give me some more information about this daemonless mode?  I'm
able to run mpich2 jobs with smpd, I tested across up to three nodes,
but I cannot find any information on daemonless operation.

> Another possibility is to start a smpd per node for each job. So
> shutting down the daemons belonging to one job will not interfere with
> the other job.

I suppose this will work OK for me, given the ability to use the same
open port across all machines (unique for that job+user, not just
user), and given that I'd I write a wrapper script to run 'smpd -s'
via ssh/rsh to all the nodes, capture their PIDs for later killing,
run the job using mpiexec, and later on kill the captured smpd PIDs
across on all nodes.  It's kind of sad/tragic to start a daemon for
only one job, but hey, that is what I asked for.  :-)

Actually, how would I tell a given mpiexec to use the set of already
running smpd's on (e.g.) port 8888 on localhost + others, rather than
the set of running smpd's on port 9999 on localhost + others?

> Although it is intended for use with SGE, you can have a look at the
> Howto for MPICH2 integration and maybe get some ideas for your usage
> (or use SGE to handle the jobs ;-) ).
>
> http://gridengine.sunsource.net/howto/mpich2-integration/mpich2-integration.html

Thanks, I will check it out.
-- 
Benjamin




More information about the mpich-discuss mailing list