[Swift-devel] more active processes than requested cores

Ben Clifford benc at hawaga.org.uk
Tue Jun 16 17:33:54 CDT 2009


Can you compare with the post-processed logs (especially info/worker logs, 
not execution layer stats), not the runtime counter - the runtime counter 
is necessarily reliant on the realtime delivery of status changes; the 
post-processed wrapper logs are not.

So maybe this is too many jobs running at once; maybe this is delayed 
statistics updates (as has been discussed here)

You need to turn on the wrapper log always transfer option in the config 
file to get all the wrapper logs back if you don't already have that.

On Tue, 16 Jun 2009, Allan Espinosa wrote:

> By the throttling parameters below, i do expect to have a thousand
> jobs active at a time.  But shouldn't the coaster request larger
> blocks to accommodate the 277 active jobs?
> 
> sge snapshot:
> ACTIVE JOBS--------------------------
> JOBID     JOBNAME    USERNAME      STATE   CORE  REMAINING  STARTTIME
> ================================================================================
> 779616    data       tg802895      Running 16     00:36:01  Tue Jun 16 15:59:41
> 779723    data       tg802895      Running 16     01:44:01  Tue Jun 16 17:07:41
> 779724    data       tg802895      Running 16     01:44:01  Tue Jun 16 17:07:41
> 779727    data       tg802895      Running 16     01:45:58  Tue Jun 16 17:09:38
> 
> 
> swift session snipper
> Progress:  Selecting site:38  Submitted:707  Active:278  Finished
> successfully:1861
> Progress:  Selecting site:38  Submitted:707  Active:277  Checking
> status:1  Finished successfully:1861
> 
> 
> sites.xml
> <config>
>   <pool handle="RANGER" >
>     <gridftp  url="gsiftp://gridftp.ranger.tacc.teragrid.org" />
>     <execution  provider="coaster"
> url="gatekeeper.ranger.tacc.teragrid.org" jobManager="gt2:gt2:SGE"/>
>     <profile namespace="globus" key="project">TG-CCR080022N</profile>
>     <workdirectory >/work/01035/tg802895/blast-runs</workdirectory>
>     <profile namespace="globus" key="workersPerNode">16</profile>
>     <profile namespace="globus" key="queue">development</profile>
>     <profile namespace="globus" key="slots">4</profile>
>     <profile namespace="globus" key="maxwalltime">00:30:00</profile>
>     <profile namespace="globus" key="nodeGranularity">2</profile>
>     <profile namespace="karajan" key="initialScore">2</profile>
>     <profile namespace="karajan" key="jobThrottle">10</profile>
>   </pool>
> </config>
> 
> i'll send the swift and coaster logs once the run finishes.
> 
> -Allan
> 
> 
> 



More information about the Swift-devel mailing list