[Swift-devel] [Bug 101] fast-failing sites will absorb large numbers of jobs causing runs to fail despite multiple attempts at retrying

bugzilla-daemon at mcs.anl.gov bugzilla-daemon at mcs.anl.gov
Mon Jun 16 09:52:47 CDT 2008


http://bugzilla.mcs.anl.gov/swift/show_bug.cgi?id=101





------- Comment #4 from benc at hawaga.org.uk  2008-06-16 09:52 -------
Using provider-wonky it is possible to recreate this problem in a local
environment. The following site definitions will give one local executing site
that runs with a 5s delay with no failure and another site that will fast-fail
all jobs. Try it against eg tests/language-behaviour/130-fmri

  <pool handle="localhost">
    <gridftp  url="local://localhost" />
    <execution provider="wonky" url="good/activefail=0/activedelay=5" />
    <workdirectory >/var/tmp</workdirectory>
    <profile namespace="karajan" key="jobThrottle">0</profile>
  </pool>

  <pool handle="fastfail">
    <gridftp  url="local://localhost" />
    <execution provider="wonky" url="bad/activefail=1/activedelay=0" />
    <workdirectory >/var/tmp</workdirectory>
    <profile namespace="karajan" key="jobThrottle">0</profile>
  </pool>


-- 
Configure bugmail: http://bugzilla.mcs.anl.gov/swift/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
You reported the bug, or are watching the reporter.



More information about the Swift-devel mailing list