[Swift-devel] Walltime exceeded error

Jonathan Monette jonmon at mcs.anl.gov
Mon Feb 20 16:26:49 CST 2012


No.  This was a run Ketan did a while back.  I have been using this as a reference when trying to re-create the issue with a simple catsnsleep job.

This run was also done on Beagle using the pre-installed java package, which does not have jstack.

On Feb 20, 2012, at 4:24 PM, Mihael Hategan wrote:

> I'm not sure if I asked this, but did you happen to get a jstack of the
> hanging swift?
> 
> On Mon, 2012-02-20 at 16:19 -0600, Jonathan Monette wrote:
>> No.  The last run was run using Beagle.  That is the more interesting one.  That shows jobs failed but the "Failed but can retry" count was not printed very often.  You can see that in the swift.out file.  Eventually the workflow just hung and the hang checker kicked in.  You can also see that Swift got stuck in the initializing state with a count of 61.
>> 
>> On Feb 20, 2012, at 4:16 PM, Mihael Hategan wrote:
>> 
>>> On Mon, 2012-02-20 at 16:14 -0600, Jonathan Monette wrote:
>>>> /gpfs/pads/swift/jonmon/Swift/tests/catsnsleep                                 <----- on /gpfs/pads
>>>> /home/jonmon/public_html/Swift/bugs/SciColSim/run002             <----- on any CI machine
>>> 
>>> Ok. Sorry. I thought the last one was on beagle.
>>> 
>> 
> 
> 




More information about the Swift-devel mailing list