[Swift-devel] feature request

Michael Wilde wilde at mcs.anl.gov
Wed Apr 15 14:33:07 CDT 2009



On 4/15/09 2:22 PM, Mihael Hategan wrote:
> On Wed, 2009-04-15 at 14:05 -0500, Glen Hocky wrote:
>> The problem with the first method was that the number of jobs, i.e. 
>> score increased too slowly. In that configuration, i believe that the 
>> behavior was 1 or 2 coasters were submitted to a single site and none to 
>> the others and then it just stayed that way for a long time.
> 
> The behavior you mention is contrary to what should be happening, in
> that all sites should have had 2 swift jobs submitted to.
> 
> It is possible that you've made the observation while coasters were not
> working properly on certain sites. The solution to that is not to
> fundamentally re-engineer the way swift submission works, but to make
> coasters run properly on those sites.

And also to find some way to explain to users what to expect, adhering 
to the principle of least-astonishment.

>> Another problem w/ the default configuration was on sites w/ Coasters 
>> per node > 1. My experience on ranger with the default parameters is 
>> that 2 coasters would start in the queue and only ~6 jobs would run on 
>> them, rather than 32.  Since our jobs take ~1 hour, this means that for 
>> 32 hours of CPU time, I was getting about 6 CPU hours of work. and even 
>> after jobs started finishing in that config, the ramp up was too slow
> 
> That is indeed a scenario which cannot be addressed by the scheme I
> mentioned. Telling swift that a site has a certain granularity, and 2
> jobs eat the same resources as 16 jobs, is not something we have. Though
> I think the scoring could easily be adapted for that.

Could it not glean that from coastersPerNode? Or is that what you mean 
by "scoring could be adapted"?

> 
> 
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel



More information about the Swift-devel mailing list