[Swift-devel] [Bug 179] New: coaster request throttling and (currentWorkers <0)

bugzilla-daemon at mcs.anl.gov bugzilla-daemon at mcs.anl.gov
Fri Feb 27 20:06:32 CST 2009


http://bugzilla.mcs.anl.gov/swift/show_bug.cgi?id=179

           Summary: coaster request throttling and (currentWorkers <0)
           Product: Swift
           Version: unspecified
          Platform: PC
        OS/Version: Linux
            Status: NEW
          Severity: normal
          Priority: P2
         Component: Log processing and plotting
        AssignedTo: hategan at mcs.anl.gov
        ReportedBy: aespinosa at cs.uchicago.edu


The number of currentWorkers becomes < 0.  this has impact on how coasters get
throttled.

In an example session, it can be observed in the LRM  creating 18-20 "make
coaster" requests (4 at start then 16-18 after 5 mins).  with a 16
coastersPerNode you get a 320 processor allocation.  this more than
MAX_WORKERS~256 and the max score possible from my sites.xml (102 max)

   <profile namespace="karajan" key="initialScore">1</profile>
   <profile namespace="karajan" key="jobThrottle">1</profile>


2009-02-25 20:31:15,590-0600 INFO  Worker Worker stderr: null
2009-02-25 20:31:15,590-0600 WARN  WorkerManager Worker terminated:
Worker[-1909333457]
2009-02-25 20:31:15,590-0600 WARN  Worker Worker 335457820 status
change: Completed
2009-02-25 20:31:15,590-0600 INFO  Worker Worker stdout: Job You has completed.
Writing job STDOUT and STDERR to cache files.
Returning job success.

2009-02-25 20:31:15,590-0600 INFO  Worker Worker stderr: null
2009-02-25 20:31:15,590-0600 WARN  WorkerManager Worker terminated:
Worker[335457820]
******2009-02-25 20:31:15,742-0600 INFO  WorkerManager Current workers: -32****
2009-02-25 20:31:15,745-0600 INFO  WorkerManager Ready: {}
2009-02-25 20:31:15,745-0600 INFO  WorkerManager Busy:
[Worker[-1260987422], Worker[2142641145], Worker[2053757208
2009-02-25 20:31:15,751-0600 INFO  WorkerManager Requested:
{640597733=Worker[640597733], -692025578=Worker[-69202
2009-02-25 20:31:15,751-0600 INFO  WorkerManager Starting:
[Task(type=JOB_SUBMISSION, identity=urn:1235615211813-1
2009-02-25 20:31:15,752-0600 INFO  WorkerManager Ids:
{1078934147=Worker[1078934147], 264613139=Worker[264613139],
2009-02-25 20:31:15,753-0600 INFO  WorkerManager AllocationR:
[org.globus.cog.abstraction.coaster.service.job.mana
2009-02-25 20:31:15,873-0600 INFO  AbstractKarajanChannel SC-null REQ:


-- 
Configure bugmail: http://bugzilla.mcs.anl.gov/swift/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are the assignee for the bug, or are watching the assignee.



More information about the Swift-devel mailing list