[Swift-devel] suggestion please on hanging/sleeping/slow wf

Mihael Hategan hategan at mcs.anl.gov
Wed May 2 11:16:52 CDT 2007


On Wed, 2007-05-02 at 16:03 +0000, Ben Clifford wrote:
> 
> On Tue, 1 May 2007, Tiberiu Stef-Praun wrote:
> 
> > I have a workflow that generates 5000 files.
> > The execution seems to have halted, for no obvious reason:
> 
> In the past few days, I've hit hangs a bunch of times in various places - 
> more than I've ever seen before, but I am doing more complicated things 
> recently compared to before (which was running a few relatively trivial 
> jobs in a bunch of relatively trivial workflows).
> 
> Its an awkward user experience. In some cases, the code should perhaps 
> detect such hangs; and in other cases, perhaps different logging info in 
> the -debug output would be useful...

Yep. The question is how.

> 
> > - there are no more jobs in the queue
> > - no error are reported in the logfile
> > - NOTE: some of the input files have not been staged in yet , yet the
> > workflow is hanging
> > -  NOTE: the remote application temp directory is GONE, only the
> > shared directory is still there
> > - apparently all the output files that are in /shared have been sent
> > back (staged out)
> > 
> > What to do, what to do ?
> > 
> > The workflow is sid-wf.dtm in ~tiberius/scratch on teraport
> > It uses the config files in ~tiberius/local/swift-conf
> > 
> > 
> > 
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
> 




More information about the Swift-devel mailing list