[Swift-devel] [Bug 179] New: coaster request throttling and (currentWorkers <0)
bugzilla-daemon at mcs.anl.gov
bugzilla-daemon at mcs.anl.gov
Fri Feb 27 20:06:32 CST 2009
http://bugzilla.mcs.anl.gov/swift/show_bug.cgi?id=179
Summary: coaster request throttling and (currentWorkers <0)
Product: Swift
Version: unspecified
Platform: PC
OS/Version: Linux
Status: NEW
Severity: normal
Priority: P2
Component: Log processing and plotting
AssignedTo: hategan at mcs.anl.gov
ReportedBy: aespinosa at cs.uchicago.edu
The number of currentWorkers becomes < 0. this has impact on how coasters get
throttled.
In an example session, it can be observed in the LRM creating 18-20 "make
coaster" requests (4 at start then 16-18 after 5 mins). with a 16
coastersPerNode you get a 320 processor allocation. this more than
MAX_WORKERS~256 and the max score possible from my sites.xml (102 max)
<profile namespace="karajan" key="initialScore">1</profile>
<profile namespace="karajan" key="jobThrottle">1</profile>
2009-02-25 20:31:15,590-0600 INFO Worker Worker stderr: null
2009-02-25 20:31:15,590-0600 WARN WorkerManager Worker terminated:
Worker[-1909333457]
2009-02-25 20:31:15,590-0600 WARN Worker Worker 335457820 status
change: Completed
2009-02-25 20:31:15,590-0600 INFO Worker Worker stdout: Job You has completed.
Writing job STDOUT and STDERR to cache files.
Returning job success.
2009-02-25 20:31:15,590-0600 INFO Worker Worker stderr: null
2009-02-25 20:31:15,590-0600 WARN WorkerManager Worker terminated:
Worker[335457820]
******2009-02-25 20:31:15,742-0600 INFO WorkerManager Current workers: -32****
2009-02-25 20:31:15,745-0600 INFO WorkerManager Ready: {}
2009-02-25 20:31:15,745-0600 INFO WorkerManager Busy:
[Worker[-1260987422], Worker[2142641145], Worker[2053757208
2009-02-25 20:31:15,751-0600 INFO WorkerManager Requested:
{640597733=Worker[640597733], -692025578=Worker[-69202
2009-02-25 20:31:15,751-0600 INFO WorkerManager Starting:
[Task(type=JOB_SUBMISSION, identity=urn:1235615211813-1
2009-02-25 20:31:15,752-0600 INFO WorkerManager Ids:
{1078934147=Worker[1078934147], 264613139=Worker[264613139],
2009-02-25 20:31:15,753-0600 INFO WorkerManager AllocationR:
[org.globus.cog.abstraction.coaster.service.job.mana
2009-02-25 20:31:15,873-0600 INFO AbstractKarajanChannel SC-null REQ:
--
Configure bugmail: http://bugzilla.mcs.anl.gov/swift/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.
More information about the Swift-devel
mailing list