Ok, I have tried to recompile MPICH2 with following options. I cannot recompile the 'global version', so I have tried to install it in my home dir and would update the PATH accordingly. But compiling is failing at the 'configure' step with following error:<br>
<br>command: ./configure --prefix=/data1/visitor/cgaurik/mympi/ --with-pmi=slurm --with-pm=no<br>End part of the output:<br>RUNNING CONFIGURE FOR THE SLURM PMI<br>checking for make... make<br>checking whether clock skew breaks make... no<br>
checking whether make supports include... yes<br>checking whether make allows comments in actions... yes<br>checking for virtual path format... VPATH<br>checking whether make sets CFLAGS... yes<br>checking for gcc... gcc<br>
checking for C compiler default output file name... a.out<br>checking whether the C compiler works... yes<br>checking whether we are cross compiling... no<br>checking for suffix of executables...<br>checking for suffix of object files... o<br>
checking whether we are using the GNU C compiler... yes<br>checking whether gcc accepts -g... yes<br>checking for gcc option to accept ANSI C... none needed<br>checking how to run the C preprocessor... gcc -E<br>checking for slurm/pmi.h... no<br>
configure: error: could not find slurm/pmi.h. Configure aborted<br>configure: error: Configure of src/pmi/slurm failed!<br><br clear="all"><br>Gauri.<br>---------<br>
<br><br><div class="gmail_quote"><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;"><div><div class="h5"><br>
> > > Message: 4<br>
> > > Date: Mon, 23 Feb 2009 23:38:06 -0600<br>
> > > From: "Rajeev Thakur" <<a href="mailto:thakur@mcs.anl.gov">thakur@mcs.anl.gov</a>><br>
> > > Subject: Re: [mpich-discuss] HP-XC 3000 cluster issues<br>
> > > To: <<a href="mailto:mpich-discuss@mcs.anl.gov">mpich-discuss@mcs.anl.gov</a>><br>
> > > Message-ID: <72376B2D10EC43F9A0A433C960F951B6@thakurlaptop><br>
> > > Content-Type: text/plain; charset="us-ascii"<br>
> > ><br>
> > > To run MPICH2 with SLURM, configure with the options<br>
> > > "--with-pmi=slurm<br>
> > > --with-pm=no" as described in the MPICH2 README file. Also see<br>
> the<br>
> > > instructions on how to run MPICH2 with SLURM at<br>
> > > <a href="https://computing.llnl.gov/linux/slurm/quickstart.html" target="_blank">https://computing.llnl.gov/linux/slurm/quickstart.html</a> .<br>
> > ><br>
> > > Rajeev<br>
> > ><br>
> > ><br>
> > ><br>
> > > _____<br>
> > ><br>
> > > From: <a href="mailto:mpich-discuss-bounces@mcs.anl.gov">mpich-discuss-bounces@mcs.anl.gov</a><br>
> > > [mailto:<a href="mailto:mpich-discuss-bounces@mcs.anl.gov">mpich-discuss-bounces@mcs.anl.gov</a>] On Behalf Of Gauri<br>
> > > Kulkarni<br>
> > > Sent: Monday, February 23, 2009 11:19 PM<br>
> > > To: <a href="mailto:mpich-discuss@mcs.anl.gov">mpich-discuss@mcs.anl.gov</a><br>
> > > Subject: [mpich-discuss] HP-XC 3000 cluster issues<br>
> > ><br>
> > ><br>
> > > Hi,<br>
> > ><br>
> > > I am a newbie to the MPI in general. Currently in our institute,<br>
> we<br>
> > > have a<br>
> > > cluster of 16nodes-8processors. It is an HP-XC 3000 cluster which<br>
> > > basically<br>
> > > means, it's quite proprietary. It has its own MPI implementation<br>
> -<br>
> > > HP-MPI -<br>
> > > in which, the parallelization is managed by SLURM (Simple Linux<br>
> > > Utility for<br>
> > > Resource Management). There is also a batch job scheduler - LSF<br>
> (Load<br>
> > > Sharing Facility) which works in tandem with SLURM to parallelize<br>
> the<br>
> > > batch<br>
> > > jobs. We have installed both MPICH and MPICH2 and are testing it,<br>
> but<br>
> > > we are<br>
> > > running into compatibility issues. For a simple helloworld.c<br>
> program:<br>
> > > 1. For HPMPI: Compiled with mpicc of this implemetation and<br>
> executed<br>
> > > with<br>
> > > its mpirun: mpirun -np 4 helloworld works correctly. For batch<br>
> > > scheduling,<br>
> > > we need to isse "bsub -n4 [other options] mpirun -srun helloworld"<br>
> and<br>
> > > it<br>
> > > runs fine too. "srun" is SLURM utility that parallelizes the<br>
> jobs.<br>
> > > 2. For MPICH and mPICH2: Again, compiled with mpicc of these<br>
> > > respective<br>
> > > implemetations and executed with their own mpirun:<br>
> > > i) mpirun -np 4 helloword : Works.<br>
> > > ii) mpirun -np 15 helloworld: The parallelization is limited to<br>
> just<br>
> > > a<br>
> > > single node - that is 8 processes run first on 8 processors of a<br>
> > > single node<br>
> > > and then remaining ones.<br>
> > > iii) bsub -n4 [options] mpirun -srun hellowrold: Job terminated.<br>
> > > srun<br>
> > > option not recognized.<br>
> > > iv) bsub [options] mpirun -np 4 helloworld: Works<br>
> > > V) bsub [options] mpirun -np 15 helloworld: (Same as iii)<br>
> > ><br>
> > > Anybody aware of HP cluster issues with MPICH? Am I<br>
> misinterpreting?<br>
> > > Any<br>
> > > help is appreciated.<br>
> > ><br>
> > > Gauri.<br>
> > > ---------<br>
> ><br>
</div></div></blockquote></div><br>