[Swift-devel] Re: [Swift-user] pbs ppn count and stuff
Michael Wilde
wilde at mcs.anl.gov
Wed Feb 2 14:08:51 CST 2011
Would 2PM tomorrow work? Justin, can you join this discussion? I'll set up a conf call once we confirm a time.
Im inserting below an email thread started by Matt at NCAR on this topic. He also refers back to a very old thread in which the same issue was raised.
- Mike
----- Forwarded Message -----
From: "Matthew Woitaszek" <matthew.woitaszek at gmail.com>
To: "Allan Espinosa" <aespinosa at cs.uchicago.edu>
Cc: swift-user at ci.uchicago.edu
Sent: Thursday, November 4, 2010 10:06:48 AM
Subject: Re: [Swift-user] Coasters and PBS resource requests: nodes and ppn
Hi Allan,
Yep, that's it. When the coasters resource request comes in with just "nodes=1", it gets interpreted by PBS as nodes=1:ppn=1, and thus PBS puts other jobs on the node, too, until all 8 CPUs are allocated (e.g., 8 1-cpu PBS jobs are running on it).
I'd like to find some way to make the request as:
nodes=1:ppn=8
along with
workersPerNode=8
so that PBS allocates one node and all 8 processors, and then one Coasters job would put 8 workers on it, matching the resource request with the use.
Matthew
On Wed, Nov 3, 2010 at 5:41 PM, Allan Espinosa < aespinosa at cs.uchicago.edu > wrote:
Hi Matthew,
Does this mean, coasters will now submit nodes=1;ppn=1 and do node packing?
If there is no node packing being initiated by PBS, you can just
specify workersPerNode=8 . But then what you request to PBS is now
different to what you actually use.
-Allan
2010/11/3 Matthew Woitaszek < matthew.woitaszek at gmail.com >:
> Good afternoon,
>
> Is there a way to update PBS resource requests when using coasters to supply
> modified PBS resource strings such as "nodes=1:ppn=8"? (Or other arbitrary
> resource requests, such as node properties?)
>
> Of course, I'm just trying to get coasters to allocate all of the processors
> on an 8-core node, using either the "gt2:gt2:pbs" or "local:pbs" provider.
> Both submit jobs just fine. I found no discernible difference with the
> "host_types" Globus namespace variable, presuming I'm setting it right.
>
> The particular cluster I'm using allows node packing for users that run lots
> of single-processor tasks, so without ppn, it will assume nodes=1,ncpus=1
> and thus pack 8 jobs on each node before moving on to the next node. (I know
> it won't be an issue at sites that make nodes exclusive. On this system, the
> queue default is "nodes=1:ppn=8", but because coasters explicitly specifies
> the number of nodes in its generated resource request, the ppn default seems
> to get lost!)
>
> I see that this has been discussed as far back as 2007, and I found Marcin
> and Mike's previous discussion of the topic at
>
> http://mail.ci.uchicago.edu/pipermail/swift-user/2010-March/001409.html
>
> but there didn't seem to be any definitive conclusion. Any suggestions would
> be appreciated!
>
> Matthew
>
--
Allan M. Espinosa < http://amespinosa.wordpress.com >
PhD student, Computer Science
University of Chicago < http://people.cs.uchicago.edu/~aespinosa >
_______________________________________________
Swift-user mailing list
Swift-user at ci.uchicago.edu
http://mail.ci.uchicago.edu/mailman/listinfo/swift-user
--
Michael Wilde
Computation Institute, University of Chicago
Mathematics and Computer Science Division
Argonne National Laboratory
----- Original Message -----
> On Tue, 2011-02-01 at 15:34 -0600, Michael Wilde wrote:
>
> > Lets start with a voice call and then bring the issue back to the
> > devel list.
>
> Can we do this on Thursday after 12:30 Chicago time?
>
> Mihael
--
Michael Wilde
Computation Institute, University of Chicago
Mathematics and Computer Science Division
Argonne National Laboratory
More information about the Swift-devel
mailing list