[Swift-user] Channel Timeout on Beagle?

Matthew Shaxted Matthew.Shaxted at som.com
Wed May 27 09:50:14 CDT 2015


Hi Swift Users:

I am running some studies on Beagle using Swift, and experiencing a strange error. The Swift scripts run great on cloud and on the Beagle login node, but seems to be timing out for some reason.

Does anyone have insight into the cause of this? Thanks for any help.

Below is the error I am getting:

Host: cluster
Directory: epsweep-run004/jobs/a/RunEP-ai2mic9m  exception @ swift-int-staging.k, line: 181
Caused by: exception @ swift-int-staging.k, line: 177
Caused by: Block task failed: Connection to worker lost
org.globus.cog.coaster.TimeoutException: Channel timed out. lastTime=150526-142313.128,
50526-142514.107, channel=TCPChannel [type: server, contact: 0526-0802460-000014-000456
at org.globus.cog.coaster.channels.AbstractCoasterChannel.checkTimeouts(AbstractCoasterChannel.java:133)
        at org.globus.cog.coaster.channels.AbstractCoasterChannel$1.run(AbstractCoasterChannel.java:124)
        at java.util.TimerThread.mainLoop(Timer.java:566)
        at java.util.TimerThread.run(Timer.java:516)

Below is my sites.xml file:

<pool handle="cluster">
    <execution provider="coaster" jobmanager="local:pbs" />
    <profile namespace="globus" key="project">CI-SES000178</profile>
    <profile namespace="globus" key="jobsPerNode">24</profile>
    <profile namespace="globus" key="lowOverAllocation">100</profile>
    <profile namespace="globus" key="highOverAllocation">100</profile>
    <profile namespace="globus" key="providerAttributes">pbs.aprun;pbs.mpp;depth=24</profile>
    <profile namespace="globus" key="maxtime">10800</profile>
    <profile namespace="globus" key="maxWalltime">01:25:00</profile>
    <profile namespace="globus" key="userHomeOverride">/lustre/beagle2/mattshax/epsweep/swifthome</profile>
    <profile namespace="globus" key="slots">20</profile>
    <profile namespace="globus" key="maxnodes">600</profile>
    <profile namespace="globus" key="nodeGranularity">1</profile>
    <profile namespace="karajan" key="jobThrottle">180</profile>
    <profile namespace="karajan" key="initialScore">10000</profile>
    <!-- <profile namespace="karajan" key="workerLoggingLevel">trace</profile> -->
    <workdirectory>/dev/shm/mattshax/swiftapp</workdirectory>
  </pool>


MATTHEW SHAXTED
SKIDMORE, OWINGS & MERRILL LLP
224 SOUTH MICHIGAN AVENUE
CHICAGO, IL 60604
T  (312) 360-4368
MATTHEW.SHAXTED at SOM.COM<mailto:MATTHEW.SHAXTED at SOM.COM>

[cid:image004.png at 01D09862.876C67F0]<http://www.som.com/>
The information contained in this communication may be confidential, is intended only for the use of the recipient(s) named above, and may be legally privileged. If the reader of this message is not the intended recipient, you are hereby notified that any dissemination, distribution, or copying of this communication, or any of its contents, is strictly prohibited and may be unlawful. If you have received this communication in error, please return it to the sender immediately and delete the original message and any copy of it from your computer system. If you have any questions concerning this message, please contact the sender.

[cid:image003.gif at 01D09861.F8DCE460]

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20150527/d9369d42/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image003.gif
Type: image/gif
Size: 566 bytes
Desc: image003.gif
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20150527/d9369d42/attachment.gif>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: image004.png
Type: image/png
Size: 5311 bytes
Desc: image004.png
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20150527/d9369d42/attachment.png>


More information about the Swift-user mailing list