[Swift-devel] execution.retries

Mihael Hategan hategan at mcs.anl.gov
Tue Jun 10 12:50:49 CDT 2008


On Tue, 2008-06-10 at 17:42 +0000, Ben Clifford wrote:
> On Tue, 10 Jun 2008, Mihael Hategan wrote:
> 
> > That's the one. However, I think that tscores <1 should be translated
> > into timed rate limitations. So if tscore = 10 means I can submit at
> > most jobThrottle*10 jobs, tscore = 0.1 should mean that I can submit
> > jobs no faster than some_number/tscore seconds. Like an exponential
> > back-off.
> 
> heh, I was just writing almost exactly the same email to you.
> 
> tscore=1 should 1 job slot available
> 
> tscore < 1 should mean one job slot available some of the time.
> 
> I'm not sure what the formula for calculating the <1 availability should 
> be, though. It needs to cope with rapidly slowing down in the presence of 
> fast fail, slowing down to the scale of other running jobs (so eg on the 
> scale of minute to hours) without overly slowing down.

It's taking a guess at when the site will recover and when other sites
will be available. I don't think there's a way to know. Which is why it
probably should be as exponential as possible.

> 
> Some experimentation there will probably help.
> 
> Related to this, I've been playing with provider-wonky a bit to make it 
> able to exhibit other failure modes such as this fast fail behaviour; but 
> nothing to commit yet.
> 




More information about the Swift-devel mailing list