[Swift-user] Errors in 13-site OSG run: lazy error question

Mihael Hategan hategan at mcs.anl.gov
Thu Aug 26 23:27:38 CDT 2010


On Thu, 2010-08-26 at 22:11 -0600, Michael Wilde wrote:
> Glen, I wonder if whats happening here is that Swift will retry and
> lazily run past *job* errors, but the error below (a mapping error) is
> maybe being treated as an error in Swift's interpretation of the
> script itself, and this causes an immediate halt to execution?
> 
> Can anyone confirm that this is whats happening, and if it is the expected behavior?

Right. Some errors are re-triable. Jobs get retried in the hope that
they will go away. Which means that they don't get reported until the
last round (and currently only the last error is reported).

Some errors, such as the ones considered to be internal inconsistencies,
will cause everything to fail immediately.






More information about the Swift-user mailing list