[Swift-user] Block task failed: Connection to worker lost
Jonathan Ozik
xio247 at gmail.com
Wed Dec 3 13:16:13 CST 2014
Hi Yadu,
The tar.gz archive is here: https://www.dropbox.com/s/tt3ewapzaf0ygac/run001.tar.gz?dl=0 <https://www.dropbox.com/s/tt3ewapzaf0ygac/run001.tar.gz?dl=0>
I’m also attaching the swift.properties file that I used below.
Thank you,
Jonathan
> On Dec 3, 2014, at 11:04 AM, Yadu Nand Babuji <yadunand at uchicago.edu> wrote:
>
> Hi Jonathan,
>
> The issue you are seeing sounds pretty close to what David reported a
> while back.
> Could you send us a tar ball of your run directory from a failed run ?
>
> Could you also check if you've set lowOverAllocation and
> highOverAllocation in your sites definition ?
>
> Thanks,
> Yadu
>
> On 12/03/2014 10:50 AM, Ozik, Jonathan wrote:
>> Hi all,
>>
>> I’m trying to run a large set of simulations on Midway using Swift 0.95-RC5.
>> 768 of the 2187 tasks completed successfully and then I got the exception:
>>
>> exception @ swift-int.k, line: 530
>> Caused by: Block task failed: Connection to worker lost
>> org.globus.cog.coaster.TimeoutException: Channel timed out. lastTime=141203-145449.325, now=141203-145649.844, channel=TCPChannel [type: server, contact: 1202-5410010-000072-000000]
>> at org.globus.cog.coaster.channels.AbstractCoasterChannel.checkTimeouts(AbstractCoasterChannel.java:133)
>> at org.globus.cog.coaster.channels.AbstractCoasterChannel$1.run(AbstractCoasterChannel.java:124)
>> at java.util.TimerThread.mainLoop(Timer.java:555)
>> at java.util.TimerThread.run(Timer.java:505)
>>
>> Progress: Wed, 03 Dec 2014 14:59:51+0000 Submitted:651 Failed:6 Finished successfully:768 Failed but can retry:762
>> Progress: Wed, 03 Dec 2014 14:59:52+0000 Submitted:651 Failed:44 Finished successfully:768 Failed but can retry:724
>>
>> And the process seems to have stopped.
>>
>> What log file would be helpful for diagnosing this?
>>
>> Jonathan
>>
>>
>> _______________________________________________
>> Swift-user mailing list
>> Swift-user at ci.uchicago.edu
>> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user
>
> _______________________________________________
> Swift-user mailing list
> Swift-user at ci.uchicago.edu
> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20141203/ec93015a/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: swift.properties
Type: application/octet-stream
Size: 3052 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20141203/ec93015a/attachment.obj>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20141203/ec93015a/attachment-0001.html>
More information about the Swift-user
mailing list