[Swift-devel] stageout ordering vs restarts

Ian Foster foster at mcs.anl.gov
Sun Jul 6 13:32:09 CDT 2008


Ben:

That's an interesting observation.

Can we control the relative priorities of stage outs and stage ins?

Ian.

On Jul 6, 2008, at 11:41 AM, Ben Clifford wrote:

>
> At present, stageouts for jobs tend to execute quite late in a run,  
> in as
> much as when there are other jobs to run, the stageins for those  
> jobs will
> usually use available file transfer rate-limit load before stageouts
> happen.
>
> I've noticed this before as a user interface quirk - users see GRAM  
> jobs
> complete on remote sites, but do not see output files appear on the  
> submit
> side until much much later and sometimes misinterpret that as a  
> failure.
>
> However, I think there is an issue here with how restarts work too.  
> Jobs
> are not recorded as done for the purposes of restart (i.e. will not be
> re-executed) until stageout has finished.
>
> When stageout is happening late, that means in late-stageout  
> situations,
> lots of work will be done but to the extent that it can be ignored on
> restarts.
>
> So that makes early-stageout behaviour more appealing in some  
> situations -
> situations in which it is expected that a restart will be necessary,  
> or
> where it is preferable to have slower job execution in exchange for  
> more
> stuff marked as done in the restart logs.
>
> That is perhaps worth thinking about as part of the project that  
> Ragib is
> working on.
>
> -- 
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel




More information about the Swift-devel mailing list