[mpich-discuss] Hydra process manager on Condor

Pavan Balaji balaji at mcs.anl.gov
Fri Mar 16 16:32:16 CDT 2012


Are you able to use MPICH2 outside the condor environment?  It's better 
to make sure you are able to do that first.

  -- Pavan

On 01/24/2012 03:01 PM, Savita Shrivastava wrote:
>
> I am struggling to run an MPI job on our RedHat MRG which has condor and
> MPICH2 (1.4.1) with hydra process manager installed. If some one has
> used hydra on condor please let me know how to run a job successfully.
>
> Thanks
> Savita
>
>
>
> On Tue, Dec 27, 2011 at 9:30 PM, Pavan Balaji <balaji at mcs.anl.gov
> <mailto:balaji at mcs.anl.gov>> wrote:
>
>     Savita,
>
>     [please keep mpich-discuss cc'ed]
>
>     These files should be written to the same location that your mpiexec
>     was launched from.  If you want to write it to a different location,
>     you can provide the full path to the files.
>
>       -- Pavan
>
>     On 12/27/2011 03:28 PM, Shrivastava, Savita wrote:
>
>         Hi Pavan, Thanks for your response.
>
>         After adding the -outfile-pattern ho.out -errfile-pattern he.err
>         in the
>         mpiexec command, the output file and error files were not
>         written to the
>         directory. I also tried adding -wdir. Another point I want to
>         mention
>         here is that the output file created by my script is not written
>         to the
>         output directory I mentioned in my script.
>
>         I suspect there may be some parameters or settings may be needed
>         to run
>         mpiexec successfully on condor.
>
>         Please advise.
>
>         Thanks
>         Savita
>
>
>
>
>         -----Original Message-----
>         From: Pavan Balaji [mailto:balaji at mcs.anl.gov
>         <mailto:balaji at mcs.anl.gov>]
>         Sent: Tuesday, December 27, 2011 12:01 AM
>         To: mpich-discuss at mcs.anl.gov <mailto:mpich-discuss at mcs.anl.gov>
>         Cc: Shrivastava, Savita
>         Subject: Re: [mpich-discuss] Hydra process manager on Condor
>
>
>         Hydra does not understand Condor's parameters.  But you can
>         emulate the
>         behavior you want by setting these options for mpiexec:
>
>            -outfile-pattern ho.out -errfile-pattern he.err
>
>         You can also do more fancy things like:
>
>            -outfile-pattern ho.%r.out -errfile-pattern he.%r.err
>
>         which uses different files for each rank.
>
>         See mpiexec -outfile-pattern -help for more information on other
>         patterns.
>
>            -- Pavan
>
>         On 12/19/2011 12:27 PM, Shrivastava, Savita wrote:
>
>             Hi,
>
>             We have installed MPICH2 with Hydra process manager on our
>             Condor
>             cluster. When I submit an mpi job to condor, the job is
>             transferred to
>             the execute node and executed properly (as I looked in the
>             condor log
>
>         on
>
>             execute node where the job was executing) but it does not
>             write the
>             standard output from script to condor output file mentioned
>             in job
>             description file. Please guide me here how to run the mpi job
>             successfully on condor using hydra process manager.
>
>             My job description as below. The perl script has standard output
>
>         "Hello".
>
>
>             universe = parallel
>
>             executable = /usr/lib64/mpich2/bin/mpiexec
>
>             arguments = -n 2 -machinefile machinefile test2.pl
>             <http://test2.pl>
>
>             getenv=true
>
>             machine_count = 1
>
>             should_transfer_files = yes
>
>             when_to_transfer_output = on_exit
>
>             transfer_input_files = test2.pl <http://test2.pl>
>
>             output = ho.out
>
>             error = he.err
>
>             log = hl.log
>
>             Requirements = Memory>= 1024&&  Cpus>=2
>
>
>             request_cpus = 2
>
>             request_memory = 1024
>
>             queue
>
>             Thanks
>
>             Savita
>
>
>
>             _________________________________________________
>             mpich-discuss mailing list mpich-discuss at mcs.anl.gov
>             <mailto:mpich-discuss at mcs.anl.gov>
>             To manage subscription options or unsubscribe:
>             https://lists.mcs.anl.gov/__mailman/listinfo/mpich-discuss
>             <https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss>
>
>
>
>     --
>     Pavan Balaji
>     http://www.mcs.anl.gov/~balaji <http://www.mcs.anl.gov/%7Ebalaji>
>     _________________________________________________
>     mpich-discuss mailing list mpich-discuss at mcs.anl.gov
>     <mailto:mpich-discuss at mcs.anl.gov>
>     To manage subscription options or unsubscribe:
>     https://lists.mcs.anl.gov/__mailman/listinfo/mpich-discuss
>     <https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss>
>
>
>
>
> _______________________________________________
> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
> To manage subscription options or unsubscribe:
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss

-- 
Pavan Balaji
http://www.mcs.anl.gov/~balaji


More information about the mpich-discuss mailing list