This seems to be fixed now with the latest version. The test script ran successfully around 2000 times before I killed it.<br><br>David<br><br><div class="gmail_quote">On Thu, Jun 23, 2011 at 1:28 PM, Mihael Hategan <span dir="ltr"><<a href="mailto:hategan@mcs.anl.gov">hategan@mcs.anl.gov</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">I committed a tentative fix to svn. swift trunk r4666.<br>
<div><div></div><div class="h5"><br>
On Sat, 2011-06-18 at 21:21 -0500, David Kelly wrote:<br>
> Here's one I got with the latest version tonight:<br>
><br>
> 2011-06-18 21:01:34<br>
> Full thread dump Java HotSpot(TM) Server VM (19.1-b02 mixed mode):<br>
><br>
> "Attach Listener" daemon prio=10 tid=0x087d6c00 nid=0x882 runnable<br>
> [0x00000000]<br>
> java.lang.Thread.State: RUNNABLE<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "Progress ticker" daemon prio=10 tid=0x9e854400 nid=0x85e waiting on<br>
> condition [0x9dfad000]<br>
> java.lang.Thread.State: TIMED_WAITING (sleeping)<br>
> at java.lang.Thread.sleep(Native Method)<br>
> at org.griphyn.vdl.karajan.lib.RuntimeStats<br>
> $ProgressTicker.run(RuntimeStats.java:141)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "Restart Log Sync" daemon prio=10 tid=0x9e82bc00 nid=0x85d in<br>
> Object.wait() [0x9dffe000]<br>
> java.lang.Thread.State: WAITING (on object monitor)<br>
> at java.lang.Object.wait(Native Method)<br>
> - waiting on <0xaedb4778> (a<br>
> org.globus.cog.karajan.workflow.nodes.restartLog.SyncThread)<br>
> at java.lang.Object.wait(Object.java:485)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.restartLog.SyncThread.run(SyncThread.java:47)<br>
> - locked <0xaedb4778> (a<br>
> org.globus.cog.karajan.workflow.nodes.restartLog.SyncThread)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "Overloaded Host Monitor" daemon prio=10 tid=0x08b19400 nid=0x85c<br>
> waiting on condition [0x9e15c000]<br>
> java.lang.Thread.State: TIMED_WAITING (sleeping)<br>
> at java.lang.Thread.sleep(Native Method)<br>
> at<br>
> org.globus.cog.karajan.scheduler.OverloadedHostMonitor.run(OverloadedHostMonitor.java:47)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "Timer-0" daemon prio=10 tid=0x08354400 nid=0x85b in Object.wait()<br>
> [0x9e1ad000]<br>
> java.lang.Thread.State: TIMED_WAITING (on object monitor)<br>
> at java.lang.Object.wait(Native Method)<br>
> - waiting on <0xaf5e2c38> (a java.util.TaskQueue)<br>
> at java.util.TimerThread.mainLoop(Timer.java:509)<br>
> - locked <0xaf5e2c38> (a java.util.TaskQueue)<br>
> at java.util.TimerThread.run(Timer.java:462)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "NBS0" daemon prio=10 tid=0x087de000 nid=0x85a waiting on condition<br>
> [0x9e1fe000]<br>
> java.lang.Thread.State: WAITING (parking)<br>
> at sun.misc.Unsafe.park(Native Method)<br>
> - parking to wait for <0xaf5e3628> (a<br>
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)<br>
> at<br>
> java.util.concurrent.locks.LockSupport.park(LockSupport.java:158)<br>
> at java.util.concurrent.locks.AbstractQueuedSynchronizer<br>
> $ConditionObject.await(AbstractQueuedSynchronizer.java:1987)<br>
> at<br>
> java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:399)<br>
> at<br>
> java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:947)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:907)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "pool-1-thread-8" prio=10 tid=0x08445c00 nid=0x859 waiting for monitor<br>
> entry [0x9e369000]<br>
> java.lang.Thread.State: BLOCKED (on object monitor)<br>
> at org.griphyn.vdl.karajan.WrapperMap.close(WrapperMap.java:25)<br>
> - waiting to lock <0xaf5ed6c8> (a<br>
> org.griphyn.vdl.karajan.WrapperMap)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.closeShallow(VDLFunction.java:516)<br>
> - locked <0xaed9c108> (a<br>
> org.griphyn.vdl.mapping.RootArrayDataNode)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.SetFieldValue.deepCopy(SetFieldValue.java:121)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.SetFieldValue.function(SetFieldValue.java:49)<br>
> - locked <0xaed9c108> (a<br>
> org.griphyn.vdl.mapping.RootArrayDataNode)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.post(VDLFunction.java:67)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.completed(AbstractSequentialWithArguments.java:194)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:214)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.functions.Argument.post(Argument.java:48)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.completed(AbstractSequentialWithArguments.java:194)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:214)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.post(VDLFunction.java:71)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.completed(AbstractSequentialWithArguments.java:194)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:214)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.post(AbstractFunction.java:28)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.completed(AbstractSequentialWithArguments.java:194)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:214)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.post(AbstractFunction.java:28)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.Sequential.startNext(Sequential.java:29)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.Sequential.executeChildren(Sequential.java:20)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.execute(FlowContainer.java:63)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.restart(FlowNode.java:139)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.start(FlowNode.java:197)<br>
> at<br>
> org.globus.cog.karajan.workflow.FlowElementWrapper.start(FlowElementWrapper.java:227)<br>
> at<br>
> org.globus.cog.karajan.workflow.events.EventBus.start(EventBus.java:104)<br>
> at<br>
> org.globus.cog.karajan.workflow.events.EventTargetPair.run(EventTargetPair.java:40)<br>
> at java.util.concurrent.Executors<br>
> $RunnableAdapter.call(Executors.java:441)<br>
> at java.util.concurrent.FutureTask<br>
> $Sync.innerRun(FutureTask.java:303)<br>
> at java.util.concurrent.FutureTask.run(FutureTask.java:138)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.runTask(ThreadPoolExecutor.java:886)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:908)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Locked ownable synchronizers:<br>
> - <0xaf5e3b10> (a java.util.concurrent.locks.ReentrantLock<br>
> $NonfairSync)<br>
><br>
> "pool-1-thread-7" prio=10 tid=0x08443000 nid=0x858 waiting on<br>
> condition [0x9e3ba000]<br>
> java.lang.Thread.State: WAITING (parking)<br>
> at sun.misc.Unsafe.park(Native Method)<br>
> - parking to wait for <0xaf5ec2b0> (a<br>
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)<br>
> at<br>
> java.util.concurrent.locks.LockSupport.park(LockSupport.java:158)<br>
> at java.util.concurrent.locks.AbstractQueuedSynchronizer<br>
> $ConditionObject.await(AbstractQueuedSynchronizer.java:1987)<br>
> at<br>
> java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:399)<br>
> at<br>
> java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:947)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:907)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "pool-1-thread-6" prio=10 tid=0x08441800 nid=0x857 waiting for monitor<br>
> entry [0x9e40b000]<br>
> java.lang.Thread.State: BLOCKED (on object monitor)<br>
> at<br>
> org.griphyn.vdl.mapping.AbstractDataNode.addListener(AbstractDataNode.java:583)<br>
> - waiting to lock <0xaed9c108> (a<br>
> org.griphyn.vdl.mapping.RootArrayDataNode)<br>
> at<br>
> org.griphyn.vdl.karajan.DSHandleFutureWrapper.<init>(DSHandleFutureWrapper.java:24)<br>
> at<br>
> org.griphyn.vdl.karajan.WrapperMap.addNodeListener(WrapperMap.java:61)<br>
> - locked <0xaf5ed6c8> (a org.griphyn.vdl.karajan.WrapperMap)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.addFutureListener(VDLFunction.java:523)<br>
> at org.griphyn.vdl.karajan.lib.Stagein.function(Stagein.java:88)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.post(VDLFunction.java:67)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.Sequential.startNext(Sequential.java:29)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.Sequential.executeChildren(Sequential.java:20)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.execute(FlowContainer.java:63)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.restart(FlowNode.java:139)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.start(FlowNode.java:197)<br>
> at<br>
> org.globus.cog.karajan.workflow.FlowElementWrapper.start(FlowElementWrapper.java:227)<br>
> at<br>
> org.globus.cog.karajan.workflow.events.EventBus.start(EventBus.java:104)<br>
> at<br>
> org.globus.cog.karajan.workflow.events.EventTargetPair.run(EventTargetPair.java:40)<br>
> at java.util.concurrent.Executors<br>
> $RunnableAdapter.call(Executors.java:441)<br>
> at java.util.concurrent.FutureTask<br>
> $Sync.innerRun(FutureTask.java:303)<br>
> at java.util.concurrent.FutureTask.run(FutureTask.java:138)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.runTask(ThreadPoolExecutor.java:886)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:908)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Locked ownable synchronizers:<br>
> - <0xaf5ec3a0> (a java.util.concurrent.locks.ReentrantLock<br>
> $NonfairSync)<br>
><br>
> "pool-1-thread-5" prio=10 tid=0x085d2000 nid=0x856 waiting on<br>
> condition [0x9e45c000]<br>
> java.lang.Thread.State: WAITING (parking)<br>
> at sun.misc.Unsafe.park(Native Method)<br>
> - parking to wait for <0xaf5ec2b0> (a<br>
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)<br>
> at<br>
> java.util.concurrent.locks.LockSupport.park(LockSupport.java:158)<br>
> at java.util.concurrent.locks.AbstractQueuedSynchronizer<br>
> $ConditionObject.await(AbstractQueuedSynchronizer.java:1987)<br>
> at<br>
> java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:399)<br>
> at<br>
> java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:947)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:907)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "pool-1-thread-4" prio=10 tid=0x085d2800 nid=0x855 waiting on<br>
> condition [0x9e4ad000]<br>
> java.lang.Thread.State: WAITING (parking)<br>
> at sun.misc.Unsafe.park(Native Method)<br>
> - parking to wait for <0xaf5ec2b0> (a<br>
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)<br>
> at<br>
> java.util.concurrent.locks.LockSupport.park(LockSupport.java:158)<br>
> at java.util.concurrent.locks.AbstractQueuedSynchronizer<br>
> $ConditionObject.await(AbstractQueuedSynchronizer.java:1987)<br>
> at<br>
> java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:399)<br>
> at<br>
> java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:947)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:907)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "pool-1-thread-3" prio=10 tid=0x08839400 nid=0x854 waiting on<br>
> condition [0x9e65c000]<br>
> java.lang.Thread.State: WAITING (parking)<br>
> at sun.misc.Unsafe.park(Native Method)<br>
> - parking to wait for <0xaf5ec2b0> (a<br>
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)<br>
> at<br>
> java.util.concurrent.locks.LockSupport.park(LockSupport.java:158)<br>
> at java.util.concurrent.locks.AbstractQueuedSynchronizer<br>
> $ConditionObject.await(AbstractQueuedSynchronizer.java:1987)<br>
> at<br>
> java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:399)<br>
> at<br>
> java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:947)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:907)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "pool-1-thread-2" prio=10 tid=0x08837800 nid=0x853 waiting on<br>
> condition [0x9e6ad000]<br>
> java.lang.Thread.State: WAITING (parking)<br>
> at sun.misc.Unsafe.park(Native Method)<br>
> - parking to wait for <0xaf5ec2b0> (a<br>
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)<br>
> at<br>
> java.util.concurrent.locks.LockSupport.park(LockSupport.java:158)<br>
> at java.util.concurrent.locks.AbstractQueuedSynchronizer<br>
> $ConditionObject.await(AbstractQueuedSynchronizer.java:1987)<br>
> at<br>
> java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:399)<br>
> at<br>
> java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:947)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:907)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "pool-1-thread-1" prio=10 tid=0x9e826c00 nid=0x852 waiting on<br>
> condition [0x9e6fe000]<br>
> java.lang.Thread.State: WAITING (parking)<br>
> at sun.misc.Unsafe.park(Native Method)<br>
> - parking to wait for <0xaf5ec2b0> (a<br>
> java.util.concurrent.locks.AbstractQueuedSynchronizer$ConditionObject)<br>
> at<br>
> java.util.concurrent.locks.LockSupport.park(LockSupport.java:158)<br>
> at java.util.concurrent.locks.AbstractQueuedSynchronizer<br>
> $ConditionObject.await(AbstractQueuedSynchronizer.java:1987)<br>
> at<br>
> java.util.concurrent.LinkedBlockingQueue.take(LinkedBlockingQueue.java:399)<br>
> at<br>
> java.util.concurrent.ThreadPoolExecutor.getTask(ThreadPoolExecutor.java:947)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:907)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "Hang checker" prio=10 tid=0x9e81bc00 nid=0x851 waiting for monitor<br>
> entry [0x9e4fe000]<br>
> java.lang.Thread.State: BLOCKED (on object monitor)<br>
> at org.griphyn.vdl.karajan.Monitor.dumpVariables(Monitor.java:220)<br>
> - waiting to lock <0xaf5ed6c8> (a<br>
> org.griphyn.vdl.karajan.WrapperMap)<br>
> at org.griphyn.vdl.karajan.HangChecker.run(HangChecker.java:54)<br>
> at java.util.TimerThread.mainLoop(Timer.java:512)<br>
> at java.util.TimerThread.run(Timer.java:462)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "Low Memory Detector" daemon prio=10 tid=0x08235800 nid=0x84f runnable<br>
> [0x00000000]<br>
> java.lang.Thread.State: RUNNABLE<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "CompilerThread1" daemon prio=10 tid=0x9f4a9800 nid=0x84e waiting on<br>
> condition [0x00000000]<br>
> java.lang.Thread.State: RUNNABLE<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "CompilerThread0" daemon prio=10 tid=0x9f4a7800 nid=0x84d waiting on<br>
> condition [0x00000000]<br>
> java.lang.Thread.State: RUNNABLE<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "Signal Dispatcher" daemon prio=10 tid=0x9f4a5c00 nid=0x84c runnable<br>
> [0x00000000]<br>
> java.lang.Thread.State: RUNNABLE<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "Finalizer" daemon prio=10 tid=0x9f497400 nid=0x84b in Object.wait()<br>
> [0x9f194000]<br>
> java.lang.Thread.State: WAITING (on object monitor)<br>
> at java.lang.Object.wait(Native Method)<br>
> - waiting on <0xa398ce08> (a java.lang.ref.ReferenceQueue$Lock)<br>
> at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:118)<br>
> - locked <0xa398ce08> (a java.lang.ref.ReferenceQueue$Lock)<br>
> at java.lang.ref.ReferenceQueue.remove(ReferenceQueue.java:134)<br>
> at java.lang.ref.Finalizer$FinalizerThread.run(Finalizer.java:159)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "Reference Handler" daemon prio=10 tid=0x9f496000 nid=0x84a in<br>
> Object.wait() [0x9f1e5000]<br>
> java.lang.Thread.State: WAITING (on object monitor)<br>
> at java.lang.Object.wait(Native Method)<br>
> - waiting on <0xa397b6e8> (a java.lang.ref.Reference$Lock)<br>
> at java.lang.Object.wait(Object.java:485)<br>
> at java.lang.ref.Reference<br>
> $ReferenceHandler.run(Reference.java:116)<br>
> - locked <0xa397b6e8> (a java.lang.ref.Reference$Lock)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "main" prio=10 tid=0x08224400 nid=0x844 in Object.wait() [0xb6a06000]<br>
> java.lang.Thread.State: WAITING (on object monitor)<br>
> at java.lang.Object.wait(Native Method)<br>
> - waiting on <0xaf50ac30> (a<br>
> org.griphyn.vdl.karajan.VDL2ExecutionContext)<br>
> at java.lang.Object.wait(Object.java:485)<br>
> at<br>
> org.globus.cog.karajan.workflow.ExecutionContext.waitFor(ExecutionContext.java:226)<br>
> - locked <0xaf50ac30> (a<br>
> org.griphyn.vdl.karajan.VDL2ExecutionContext)<br>
> at org.griphyn.vdl.karajan.Loader.main(Loader.java:201)<br>
><br>
> Locked ownable synchronizers:<br>
> - None<br>
><br>
> "VM Thread" prio=10 tid=0x9f492400 nid=0x849 runnable<br>
><br>
> "GC task thread#0 (ParallelGC)" prio=10 tid=0x0822b800 nid=0x845<br>
> runnable<br>
><br>
> "GC task thread#1 (ParallelGC)" prio=10 tid=0x0822cc00 nid=0x846<br>
> runnable<br>
><br>
> "GC task thread#2 (ParallelGC)" prio=10 tid=0x0822e400 nid=0x847<br>
> runnable<br>
><br>
> "GC task thread#3 (ParallelGC)" prio=10 tid=0x0822f800 nid=0x848<br>
> runnable<br>
><br>
> "VM Periodic Task Thread" prio=10 tid=0x9f4b4000 nid=0x850 waiting on<br>
> condition<br>
><br>
> JNI global references: 1392<br>
><br>
><br>
> Found one Java-level deadlock:<br>
> =============================<br>
> "pool-1-thread-8":<br>
> waiting to lock monitor 0x08b1859c (object 0xaf5ed6c8, a<br>
> org.griphyn.vdl.karajan.WrapperMap),<br>
> which is held by "pool-1-thread-6"<br>
> "pool-1-thread-6":<br>
> waiting to lock monitor 0x9e89178c (object 0xaed9c108, a<br>
> org.griphyn.vdl.mapping.RootArrayDataNode),<br>
> which is held by "pool-1-thread-8"<br>
><br>
> Java stack information for the threads listed above:<br>
> ===================================================<br>
> "pool-1-thread-8":<br>
> at org.griphyn.vdl.karajan.WrapperMap.close(WrapperMap.java:25)<br>
> - waiting to lock <0xaf5ed6c8> (a<br>
> org.griphyn.vdl.karajan.WrapperMap)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.closeShallow(VDLFunction.java:516)<br>
> - locked <0xaed9c108> (a<br>
> org.griphyn.vdl.mapping.RootArrayDataNode)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.SetFieldValue.deepCopy(SetFieldValue.java:121)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.SetFieldValue.function(SetFieldValue.java:49)<br>
> - locked <0xaed9c108> (a<br>
> org.griphyn.vdl.mapping.RootArrayDataNode)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.post(VDLFunction.java:67)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.completed(AbstractSequentialWithArguments.java:194)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:214)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.functions.Argument.post(Argument.java:48)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.completed(AbstractSequentialWithArguments.java:194)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:214)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.post(VDLFunction.java:71)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.completed(AbstractSequentialWithArguments.java:194)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:214)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.post(AbstractFunction.java:28)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.completed(AbstractSequentialWithArguments.java:194)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:214)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.post(AbstractFunction.java:28)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.Sequential.startNext(Sequential.java:29)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.Sequential.executeChildren(Sequential.java:20)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.execute(FlowContainer.java:63)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.restart(FlowNode.java:139)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.start(FlowNode.java:197)<br>
> at<br>
> org.globus.cog.karajan.workflow.FlowElementWrapper.start(FlowElementWrapper.java:227)<br>
> at<br>
> org.globus.cog.karajan.workflow.events.EventBus.start(EventBus.java:104)<br>
> at<br>
> org.globus.cog.karajan.workflow.events.EventTargetPair.run(EventTargetPair.java:40)<br>
> at java.util.concurrent.Executors<br>
> $RunnableAdapter.call(Executors.java:441)<br>
> at java.util.concurrent.FutureTask<br>
> $Sync.innerRun(FutureTask.java:303)<br>
> at java.util.concurrent.FutureTask.run(FutureTask.java:138)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.runTask(ThreadPoolExecutor.java:886)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:908)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
> "pool-1-thread-6":<br>
> at<br>
> org.griphyn.vdl.mapping.AbstractDataNode.addListener(AbstractDataNode.java:583)<br>
> - waiting to lock <0xaed9c108> (a<br>
> org.griphyn.vdl.mapping.RootArrayDataNode)<br>
> at<br>
> org.griphyn.vdl.karajan.DSHandleFutureWrapper.<init>(DSHandleFutureWrapper.java:24)<br>
> at<br>
> org.griphyn.vdl.karajan.WrapperMap.addNodeListener(WrapperMap.java:61)<br>
> - locked <0xaf5ed6c8> (a org.griphyn.vdl.karajan.WrapperMap)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.addFutureListener(VDLFunction.java:523)<br>
> at org.griphyn.vdl.karajan.lib.Stagein.function(Stagein.java:88)<br>
> at<br>
> org.griphyn.vdl.karajan.lib.VDLFunction.post(VDLFunction.java:67)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.Sequential.startNext(Sequential.java:29)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.Sequential.executeChildren(Sequential.java:20)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowContainer.execute(FlowContainer.java:63)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.restart(FlowNode.java:139)<br>
> at<br>
> org.globus.cog.karajan.workflow.nodes.FlowNode.start(FlowNode.java:197)<br>
> at<br>
> org.globus.cog.karajan.workflow.FlowElementWrapper.start(FlowElementWrapper.java:227)<br>
> at<br>
> org.globus.cog.karajan.workflow.events.EventBus.start(EventBus.java:104)<br>
> at<br>
> org.globus.cog.karajan.workflow.events.EventTargetPair.run(EventTargetPair.java:40)<br>
> at java.util.concurrent.Executors<br>
> $RunnableAdapter.call(Executors.java:441)<br>
> at java.util.concurrent.FutureTask<br>
> $Sync.innerRun(FutureTask.java:303)<br>
> at java.util.concurrent.FutureTask.run(FutureTask.java:138)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.runTask(ThreadPoolExecutor.java:886)<br>
> at java.util.concurrent.ThreadPoolExecutor<br>
> $Worker.run(ThreadPoolExecutor.java:908)<br>
> at java.lang.Thread.run(Thread.java:662)<br>
><br>
> Found 1 deadlock.<br>
><br>
><br>
> On Sat, Jun 18, 2011 at 2:52 PM, Mihael Hategan <<a href="mailto:hategan@mcs.anl.gov">hategan@mcs.anl.gov</a>><br>
> wrote:<br>
> Post the entire output of jstack please.<br>
><br>
><br>
> On Sat, 2011-06-18 at 00:08 -0500, Alberto Chavez wrote:<br>
> > I already did the svn updates for cog, and swift, rebuilt<br>
> swift with<br>
> > ant redist, and ant clean + ant dist,<br>
> > the test keeps hanging, but I'm missing probably something:<br>
> ><br>
> ><br>
> > $ svn update cog<br>
> > At revision 3167.<br>
> ><br>
> ><br>
> > $ cd cog/modules/<br>
> > $ svn update swift<br>
> > At revision 4632.<br>
> ><br>
> ><br>
> > I did ant redist, and it was successfully built, but the<br>
> test is still<br>
> > hanging, now it hung on the 11th iteration. However I'm not<br>
> quite sure<br>
> > if the svn was properly updated, since I did<br>
> ><br>
> ><br>
> > $ jstack -l 11471 | grep addListener<br>
> > at<br>
> ><br>
> org.griphyn.vdl.mapping.AbstractDataNode.addListener(AbstractDataNode.java:583)<br>
> > at<br>
> ><br>
> org.griphyn.vdl.mapping.AbstractDataNode.addListener(AbstractDataNode.java:583)<br>
> ><br>
> ><br>
> > but Mihael Mentioned that addListener is not<br>
> AbstractDataNode in the<br>
> > newer version.<br>
> > Any thoughts on that?<br>
> ><br>
> ><br>
> > Alberto.<br>
> > > Subject: RE: [Swift-devel] Swift unresponsive while using<br>
> local<br>
> > provider.<br>
> > > From: <a href="mailto:hategan@mcs.anl.gov">hategan@mcs.anl.gov</a><br>
> > > To: <a href="mailto:alberto_chavez@live.com">alberto_chavez@live.com</a><br>
> > > CC: <a href="mailto:ketancmaheshwari@gmail.com">ketancmaheshwari@gmail.com</a>;<br>
> <a href="mailto:swift-devel@ci.uchicago.edu">swift-devel@ci.uchicago.edu</a><br>
> > > Date: Fri, 17 Jun 2011 21:09:04 -0700<br>
> > ><br>
> > > On Fri, 2011-06-17 at 22:50 -0500, Alberto Chavez wrote:<br>
> > > > Oops, Too late.<br>
> > > > Do i need to do ant clean , and ant dist again?<br>
> > ><br>
> > > You're probably fine in most cases with just "ant dist".<br>
> But if you<br>
> > want<br>
> > > to be sure, do what Jonathan is saying: "ant redist"<br>
> > ><br>
> > ><br>
> > ><br>
> ><br>
><br>
><br>
><br>
><br>
> _______________________________________________<br>
> Swift-devel mailing list<br>
> <a href="mailto:Swift-devel@ci.uchicago.edu">Swift-devel@ci.uchicago.edu</a><br>
> <a href="http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel" target="_blank">http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel</a><br>
><br>
><br>
<br>
<br>
</div></div></blockquote></div><br>