[Swift-devel] [Bug 101] failure in site initialisation appears to cause job to fail rather than be retried elsewhere.
bugzilla-daemon at mcs.anl.gov
bugzilla-daemon at mcs.anl.gov
Wed Apr 9 16:19:04 CDT 2008
http://bugzilla.mcs.anl.gov/swift/show_bug.cgi?id=101
------- Comment #2 from benc at hawaga.org.uk 2008-04-09 16:19 -------
There's another example of this in ccf-perm-wf-20080409-1511-kz872673.log
Looks like permission error on one site means that it becomes available for use
again rapidly, whilst the other sites (3 of them) are occupied running jobs
successfully.
So a failed job is retried on the only free resource, the broken one, over and
over until it fails.
--
Configure bugmail: http://bugzilla.mcs.anl.gov/swift/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
You reported the bug, or are watching the reporter.
More information about the Swift-devel
mailing list