[Swift-devel] problem with coasters on pbs provider (on pads)

Glen Hocky hockyg at uchicago.edu
Thu Aug 5 16:14:25 CDT 2010


I'm having a problem running on PADS. It seems that when I submit jobs with
workerspernode=8, the queuing system doesn't pick up on the fact that each
job submitted by swift should have ppn=8 (specifically, that is missing from
the qsub command registered by pbs)

when I do a qsub -f on my running jobs I get

> submit_args = -A CI-CCR000013 -l nodes=1,walltime=02:00:00,

        size=1 /home/hockyg/.globus/scripts/PBS4482066898055181239.submit


so there's no ppn=8 and i think it should also say size=8.

the result is that I get 56 jobs running on one node

BTW, this is with the latest version of cog (as well as earlier ones) and a
version of swift that's working for me on bgp

> Swift svn swift-r3432 (swift modified locally) cog-r2829




my sites entry is

>   <pool handle="pbs-coaster-test">

    <execution provider="coaster" url="none" jobManager="local:pbs"/>

    <profile namespace="globus" key="maxwalltime">01:00:00</profile>

    <profile namespace="globus" key="workersPerNode">8</profile>

    <profile namespace="globus" key="internalHostname">172.5.86.5</profile>

    <profile namespace="globus" key="slots">16</profile>

    <profile namespace="globus" key="nodeGranularity">1</profile>

    <profile namespace="globus" key="maxNodes">1</profile>

    <profile namespace="karajan" key="jobThrottle">1.27</profile>

    <profile namespace="karajan" key="initialScore">10000</profile>

    <profile namespace="globus" key="project">CI-CCR000013</profile>

    <gridftp  url="local://localhost" />

    <scratch>/tmp</scratch>


>  <workdirectory>/home/hockyg/reichman/glassy_dynamics/code/swift/run/real</workdirectory>

  </pool>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20100805/a60b89d0/attachment.html>


More information about the Swift-devel mailing list