[Swift-user] coasters on eureka - block task ends prematurely
Michael Wilde
wilde at mcs.anl.gov
Tue Nov 17 11:07:42 CST 2009
Im getting the following on eureka for a 1-job cat sanity test of coasters:
eur$ swift -tc.file tc -sites.file sites.xml cats.swift
Swift svn swift-r3186 cog-r2577
RunID: 20091117-1031-71txxj43
Progress:
Worker task failed: 1117-311023-000000Block task ended prematurely
Progress: Active:1
Failed to transfer wrapper log from cats-20091117-1031-71txxj43/info/q
on coast
Progress: Submitted:1
Worker task failed: 1117-311023-000001Block task ended prematurely
Progress: Active:1
Failed to transfer wrapper log from cats-20091117-1031-71txxj43/info/s
on coast
Progress: Submitted:1
Worker task failed: 1117-311023-000002Block task ended prematurely
Progress: Active:1
Failed to transfer wrapper log from cats-20091117-1031-71txxj43/info/u
on coast
Execution failed:
Exception in cat:
Arguments: []
Host: coast
Directory: cats-20091117-1031-71txxj43/jobs/u/cat-u6s7zkjj
stderr.txt:
stdout.txt:
----
Caused by:
Task failed: 1117-311023-000002Block task ended prematurely
Cleaning up...
Shutting down service at https://10.40.9.151:58810
Got channel MetaChannel: 845296226 -> null
+ Done
eur$
--
tc is:
coast cat /bin/cat null null null
sites.xml is:
<config>
<pool handle="coast">
<filesystem provider="local" />
<execution provider="coaster" jobmanager="local:cobalt"/>
<profile namespace="globus" key="slots">1</profile>
<profile namespace="globus" key="nodeGranularity">1</profile>
<profile namespace="globus" key="workersPerNode">8</profile>
<profile namespace="globus" key="maxNodes">1</profile>
<profile namespace="globus" key="project">JGI-Pilot</profile>
<profile namespace="globus" key="kernelprofile">zeptoos</profile>
<profile namespace="globus" key="maxtime">1800</profile>
<profile namespace="globus" key="alcfbgpnat">true</profile>
<profile namespace="karajan" key="jobThrottle">0.63</profile>
<profile namespace="karajan" key="initialScore">100000</profile>
<workdirectory >/home/wilde/swiftwork</workdirectory>
<scratch>/scratch</scratch>
</pool>
</config>
--
Ive also tested with maxtime 3000 as in prior examples from Mihael.
Latest logs are on Eureka in:
eur$ pwd
/home/wilde/swift/lab
eur$ ls *log
23683.cobaltlog 23684.cobaltlog cats-20091117-1101-oapf33ye.0.rlog
cats-20091117-1101-oapf33ye.log swift.log
eur$
Moving logs to logs/ as I test further.
First sign of trouble (that I can see) in the log above (*ye.log) is:
2009-11-17 11:01:48,582-0600 INFO BlockQueueProcessor Plan time: 1
2009-11-17 11:01:50,785-0600 INFO BlockQueueProcessor Updated
allocsize: 8.66447649575794
2009-11-17 11:01:50,786-0600 INFO BlockQueueProcessor allocsize =
8.66447649575794, queuedsize = 1.0660596665516473, qsz = 1
2009-11-17 11:01:50,786-0600 INFO BlockQueueProcessor Plan time: 1
2009-11-17 11:01:51,940-0600 DEBUG TaskImpl Task(type=JOB_SUBMISSION,
identity=urn:cog-1258477276784) setting status to Completed
2009-11-17 11:01:51,941-0600 INFO Block Block task status changed:
Completed
2009-11-17 11:01:51,941-0600 WARN Block Worker task failed:
1117-011117-000000Block task ended prematurely
--
- Mike
More information about the Swift-user
mailing list