[Swift-devel] Re: Workflow waiting on condition hang
Jonathan Monette
jon.monette at gmail.com
Thu Mar 10 15:56:25 CST 2011
Ok. My mapping error has magically disappeared. I updated to
swift-r4175 and cog-r3057 from swift-r4171 and cog-r3056. I am not sure
what happened but have run my code 4-5 times with no mapping error.
I still have the condition hang. However the output that is reported
every 10s if no events happen is very helpful. I am going through my
scripts to track down on exactly what variable is hanging and if it
hangs at the same place.
On 3/6/11 7:22 PM, Jonathan Monette wrote:
> I will test this out. Currently I have another problem that
> appeared. It was reported in thread "[Swift-devel] Error in Swift
> mapping". I have not put together a simple script that re-creates
> this problem as I just went through my first wave of midterms. I will
> see if I can put together a script this week.
>
> On 3/6/11 3:46 PM, Mihael Hategan wrote:
>> Given that this does not seem to be a java deadlock, I added a hang
>> checker to swift. If nothing is being executed inside karajan and no
>> jobs are running in any ten second interval, it will dump future and
>> thread information to the log file.
>>
>> This is in swift trunk r4171.
>>
>> Can you give that a try and report back the details?
>>
>> Mihael
>>
>> On Sat, 2011-02-19 at 14:54 -0600, Jonathan Monette wrote:
>>> Yes. It always seems to hang at the same place.
>>>
>>> Attached is my montage script. It hangs in the mFitBatch function at
>>> the mConcatFit app call. All other files have been created up to that
>>> step but that app never runs.
>>>
>>> On 2/17/11 3:39 PM, Mihael Hategan wrote:
>>>> On Thu, 2011-02-17 at 15:13 -0600, Jonathan Monette wrote:
>>>>> Hello,
>>>>> My workflow seems to be hanging. This is trunk swift-r4107 and
>>>>> cog-r3051. Attached is a compressed log file and the jstack output for
>>>>> my workflow. The jstack file says it is waiting for a condition and my
>>>>> workflow hangs.
>>>> There's lots of stuff waiting because that's what they do when they
>>>> don't have anything else to do. So I don't see a problem there.
>>>>
>>>> There are no jobs going to the coaster service, so clearly things aren't
>>>> progressing.
>>>>
>>>> So now the question is: does this happen every time you run it or just
>>>> some times?
>>>>
>>>> Also, please send the swift script.
>>>>
>>>> Mihael
>>>>
>>>>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20110310/a23d74b7/attachment.html>
More information about the Swift-devel
mailing list