[Swift-devel] second wave of jobs do not start

Mihael Hategan hategan at mcs.anl.gov
Wed Mar 11 14:21:15 CDT 2015


And I'd like to know what the issue was!

Mihael

On Wed, 2015-03-11 at 14:16 -0500, Ketan Maheshwari wrote:
> Hi,
> 
> Please ignore, this was resolved after discussion and debugging with Mike.
> 
> --Ketan
> 
> On Wed, Mar 11, 2015 at 10:33 AM, Ketan Maheshwari <ketan at mcs.anl.gov>
> wrote:
> 
> > Hi
> >
> > With trunk, coasters on ALCF, I am seeing that after a first wave of jobs
> > finish, the second wave does not start.
> >
> > After the completion of first wave of jobs, the Swift progress text shows
> > jobs in submitted state while the queue (qstat) still shows running status.
> > After a while the queue walltime expires and there are no more new jobs
> > submitted to the queue.
> >
> > Two worker log files are created for the run, possibly the worker shuts
> > down and restarts for a second wave.
> >
> > Attached are the run log and worker logs.
> >
> > Thanks for any help debugging/fixing.
> > --
> > Ketan
> >
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel





More information about the Swift-devel mailing list