[mpich-discuss] mpich2 does not work with SGE

tilakraj dattaram tilakraj1985 at gmail.com
Tue Jul 12 06:03:29 CDT 2011


Hi

We have a rocks cluster with 10 nodes, with sun grid engine installed and
running. I then installed the most recent version of mpich2 (1.4) on the
master and compute nodes. However, we are unable to run parallel jobs
through SGE (we can submit serial jobs without a problem). I am a sge
newbie, and most of the installation that we have done is by reading
step-by-step tutorials on the web.

The mpich2 manual says that hydra is the default process manager for mpich2,
and I have checked that the mpiexec command points to mpiexec.hydra. Also,
which mpicc, which mpiexec point to the desired location of mpich2. I
understand that in this version of mpich2, hydra should be integrated with
SGE by default. But maybe I am missing something here.

We are able to run parallel jobs using command line by specifying a host
file (e.g, mpiexec -f hostfile -np 16 ./a.out), but would like the resource
manager to take care of allocating resources on the cluster.

Your help would be greatly appreciated.

Thank you

Regards
Tilak
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20110712/57966ef3/attachment.htm>


More information about the mpich-discuss mailing list