[Swift-devel] execution.retries
lixi at uchicago.edu
lixi at uchicago.edu
Tue Jun 10 12:39:00 CDT 2008
>Say you have two sites.
>
>Site A always fails fast.
>Site B accepts jobs normally.
>
>You have three jobs to submit, job J, K, L, which take a
long time to
>run.
>
>at t=0
>We submit jobs randomly to available sites:
>Job J is submitted to site A.
>Job K is submitted to site B.
>Job L is submitted to site B.
>
>t=1
>Site B is busy executing job K, and L
>Job J fails on site A. We look for somewhere to retry it.
Site B has 0
>slots free. Site A has 2 slots free. We send the job to
site A.
>
>t=2
>same happens.
>
>t=3
>same happens.
>
>Now we have retried job J three times, and so the workflow
ultimately
>fails.
>
>t=1000
>job K and job L complete successfully.
Thanks for clear explaination. :)
Xi
More information about the Swift-devel
mailing list