[Swift-devel] problem with coasters on pbs provider (on pads)
Glen Hocky
hockyg at uchicago.edu
Thu Aug 5 16:14:25 CDT 2010
I'm having a problem running on PADS. It seems that when I submit jobs with
workerspernode=8, the queuing system doesn't pick up on the fact that each
job submitted by swift should have ppn=8 (specifically, that is missing from
the qsub command registered by pbs)
when I do a qsub -f on my running jobs I get
> submit_args = -A CI-CCR000013 -l nodes=1,walltime=02:00:00,
size=1 /home/hockyg/.globus/scripts/PBS4482066898055181239.submit
so there's no ppn=8 and i think it should also say size=8.
the result is that I get 56 jobs running on one node
BTW, this is with the latest version of cog (as well as earlier ones) and a
version of swift that's working for me on bgp
> Swift svn swift-r3432 (swift modified locally) cog-r2829
my sites entry is
> <pool handle="pbs-coaster-test">
<execution provider="coaster" url="none" jobManager="local:pbs"/>
<profile namespace="globus" key="maxwalltime">01:00:00</profile>
<profile namespace="globus" key="workersPerNode">8</profile>
<profile namespace="globus" key="internalHostname">172.5.86.5</profile>
<profile namespace="globus" key="slots">16</profile>
<profile namespace="globus" key="nodeGranularity">1</profile>
<profile namespace="globus" key="maxNodes">1</profile>
<profile namespace="karajan" key="jobThrottle">1.27</profile>
<profile namespace="karajan" key="initialScore">10000</profile>
<profile namespace="globus" key="project">CI-CCR000013</profile>
<gridftp url="local://localhost" />
<scratch>/tmp</scratch>
> <workdirectory>/home/hockyg/reichman/glassy_dynamics/code/swift/run/real</workdirectory>
</pool>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20100805/a60b89d0/attachment.html>
More information about the Swift-devel
mailing list