[Swift-devel] Swift 0.92(.1) on Fusion
Justin M Wozniak
wozniak at mcs.anl.gov
Wed Apr 13 09:11:11 CDT 2011
Thanks for digging into this- can you try this again from trunk? A Parvis
developer and I were able to run successfully there. (The queues are very
long.)
One thing I did when working on Fusion was cut out the generated submit
files and qsub them myself to verify that the #PBS settings did actually
work for me, you may want to try that too.
Justin
On Wed, 13 Apr 2011, David Kelly wrote:
> Hello,
>
> Recently when I try to run Swift on Fusion, my job never seems to
> execute. I have emailed Fusion support about this (ticket #70175) but
> thought it may also be useful to send to the list. I am trying to run
> the catsn.swift script for testing. I can see it in qstat. The
> sites.xml is based on the config listed in the Fusion cookbook, with a
> few small changes. I added an internalHostname entry and set it to the
> IP address attached to the Infiniband device. I also lowered the
> maxtime from 1000 to 10. The Fusion cookbook says "Set MAXTIME as in
> qsub walltime. This is on a per-allocation basis and should be at
> least 20% larger than your longest task". I am not sure how maxtime
> relates to walltime exactly, but the walltime value in the PBS file
> gets set to 00:00:00. I am not sure if this matters or not.
>
> I have also attached a compressed log file and the actual swift script
> I'm trying to run.
>
> Thanks,
> David
>
> $ swift -version
> Swift svn swift-r4076 cog-r3049
>
> qstat:
> 541724.fmgt2.l davidk shared Block-0412 -- 1 1 -- 00:00 Q --
>
> sites.xml:
> <config>
> <pool handle="fusion">
> <execution jobmanager="local:pbs" provider="coaster" url="none"/>
> <filesystem provider="local" url="none" />
> <profile namespace="globus" key="internalHostname">192.168.71.81</profile>
> <profile namespace="globus" key="maxtime">10</profile>
> <profile namespace="globus" key="workersPerNode">1</profile>
> <profile namespace="globus" key="slots">1</profile>
> <profile namespace="globus" key="nodeGranularity">1</profile>
> <profile namespace="globus" key="maxNodes">2</profile>
> <profile namespace="globus" key="queue">shared</profile>
> <profile namespace="karajan" key="jobThrottle">5.99</profile>
> <profile namespace="karajan" key="initialScore">10000</profile>
> <workdirectory>/home/davidk/swiftwork</workdirectory>
> </pool>
> </config>
>
> PBS submission file:
> #PBS -S /bin/bash
> #PBS -N Block-0412-211041-000000
> #PBS -m n
> #PBS -l nodes=1
> #PBS -l walltime=00:00:00
> #PBS -q shared
> #PBS -o /homes/davidk/.globus/scripts/PBS1298937999826083605.submit.stdout
> #PBS -e /homes/davidk/.globus/scripts/PBS1298937999826083605.submit.stderr
> WORKER_LOGGING_LEVEL=NONE
> #PBS -v WORKER_LOGGING_LEVEL
> cd / && /usr/bin/perl
> /homes/davidk/.globus/coasters/cscript1716491648595514240.pl
> http://192.168.71.81:46584 0412-211041-000000 NOLOGGING
> /bin/echo $? >/homes/davidk/.globus/scripts/PBS1298937999826083605.submit.exitcode
--
Justin M Wozniak
More information about the Swift-devel
mailing list