[Swift-devel] second wave of jobs do not start

Ketan Maheshwari ketan at mcs.anl.gov
Wed Mar 11 14:16:29 CDT 2015


Hi,

Please ignore, this was resolved after discussion and debugging with Mike.

--Ketan

On Wed, Mar 11, 2015 at 10:33 AM, Ketan Maheshwari <ketan at mcs.anl.gov>
wrote:

> Hi
>
> With trunk, coasters on ALCF, I am seeing that after a first wave of jobs
> finish, the second wave does not start.
>
> After the completion of first wave of jobs, the Swift progress text shows
> jobs in submitted state while the queue (qstat) still shows running status.
> After a while the queue walltime expires and there are no more new jobs
> submitted to the queue.
>
> Two worker log files are created for the run, possibly the worker shuts
> down and restarts for a second wave.
>
> Attached are the run log and worker logs.
>
> Thanks for any help debugging/fixing.
> --
> Ketan
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20150311/4b703d5d/attachment.html>


More information about the Swift-devel mailing list