[Swift-devel] replication/recall of jobs from slow queues

Mihael Hategan hategan at mcs.anl.gov
Wed May 21 18:42:16 CDT 2008


On Wed, 2008-05-21 at 18:40 -0500, Mihael Hategan wrote:
> On Wed, 2008-05-21 at 23:34 +0000, Ben Clifford wrote:
> > On Sun, 4 May 2008, Ben Clifford wrote:
> > 
> > > One way I was thinking of testing on a real site is to set profile keys so 
> > > that jobs go into a condor pool with a requirement to not run for a 
> > > specified time after submission (I think that is expressible in the 
> > > classad language). That should give reproducible at-least-one-resubmission 
> > > behaviour.
> > 
> > I tried a couple of these on fletch:
> > 
> >  MY.QDate + 180 < TARGET.LastHeardFrom
> >  MY.QDate + 180 < MY.ServerTime
> > 
> > and whilst both match after three minutes, the jobs then don't execute. 
> > grr.
> 
> Won't happen if you have a single job.

Correction: won't happen if no jobs have ever changed state from queued
to active.

>  Perhaps there should be a "guess"
> average to start with. In the mean time you can run a dummy echo or
> something first.
> 
> > 




More information about the Swift-devel mailing list