<!DOCTYPE html PUBLIC "-//W3C//DTD HTML 4.01 Transitional//EN">
<html>
<head>
<meta content="text/html;charset=ISO-8859-1" http-equiv="Content-Type">
</head>
<body bgcolor="#ffffff" text="#000000">
Thanks, Ben<br>
<br>
Ben Clifford wrote:
<blockquote
cite="mid:Pine.LNX.4.64.0803310129150.5372@dildano.hawaga.org.uk"
type="cite">
<pre wrap="">On Sun, 30 Mar 2008, Ioan Raicu wrote:
</pre>
<blockquote type="cite">
<blockquote type="cite">
<pre wrap="">runam6 failed
</pre>
</blockquote>
</blockquote>
<pre wrap=""><!---->
</pre>
<blockquote type="cite">
<blockquote type="cite">
<pre wrap="">Directory: amps2-20080330-1849-hnpls37c/jobs/y/runam6-yvkudjqi
stderr.txt: mkdir: cannot create directory `am.000000': File exists
</pre>
</blockquote>
</blockquote>
<pre wrap=""><!---->
I think when I've seen that error before, its not been swift-level retries
that have been hapepning - when Swift retries a job, it gets a different
identifier ('yvkudjqi' in the above). If a job gets partly executed and
then retried by the underlying execution mechanism below swift (eg. any
part of cog downwards) then the above will happen.
Does falkon ever try to retry a job that its been given if it thinks
something went wrong? If so, that might cause a problem here - what needs
to hapepn is that the failure gets reported all the way back to swift for
swift to do a retry.
</pre>
</blockquote>
nope, falkon doesn't do any retry for now.<br>
<blockquote
cite="mid:Pine.LNX.4.64.0803310129150.5372@dildano.hawaga.org.uk"
type="cite">
<pre wrap="">
Another cause might be duplicate job IDs generated within swift (the
'yvkudjqi' string again) but that would be very unusual (as in, I've never
seen that happen)
</pre>
<blockquote type="cite">
<pre wrap="">1) How do we disable the retry mechanism, to make sure that Swift won't retry
failed jobs?
</pre>
</blockquote>
<pre wrap=""><!---->
What Quan said - set execution.retries=0 in swift.properties
</pre>
<blockquote type="cite">
<pre wrap="">2) How do we configure Swift to continue sending all tasks it is able to (in
our case, it should be all tasks, as we only have 1 for loop, with no data
dependencies between iterations), although all tasks will eventually fail?
</pre>
</blockquote>
<pre wrap=""><!---->
throttle.score.job.factor=off
I think will do what you want.
</pre>
</blockquote>
ok, I will try this.<br>
<br>
<br>
best wishes<br>
zhangzhao<br>
</body>
</html>