<br><br><div class="gmail_quote">On Tue, Oct 11, 2011 at 11:49 AM, David Kelly <span dir="ltr"><<a href="mailto:davidk@ci.uchicago.edu">davidk@ci.uchicago.edu</a>></span> wrote:<br><blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">
<br>
That could be it.. maybe a cleanup script is not getting the right parameters and failing. Do you happen to have a copy of the coaster log?</blockquote><div><br>just put it in /home/skenny/swift_logs<br><br> </div><blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">
Maybe there will be some clues in there.<br>
<div class="im"><br>
----- Original Message -----<br>
> From: "Sarah Kenny" <<a href="mailto:skenny@uchicago.edu">skenny@uchicago.edu</a>><br>
</div><div class="im">> To: "David Kelly" <<a href="mailto:davidk@ci.uchicago.edu">davidk@ci.uchicago.edu</a>><br>
> Cc: "Swift Devel" <<a href="mailto:swift-devel@ci.uchicago.edu">swift-devel@ci.uchicago.edu</a>>, "Swift User" <<a href="mailto:swift-user@ci.uchicago.edu">swift-user@ci.uchicago.edu</a>>, "Justin M Wozniak"<br>
> <<a href="mailto:wozniak@mcs.anl.gov">wozniak@mcs.anl.gov</a>><br>
</div>> Sent: Tuesday, October 11, 2011 1:32:37 PM<br>
<div class="im">> Subject: Re: [Swift-user] gram on ranger<br>
</div><div class="im">> so, this workflow completes all the jobs but then just hangs<br>
> indefinitely at the end...maybe a stray cleanup job?<br>
><br>
> log is here:<br>
><br>
> /home/skenny/swift_logs/corr-20111010-2104-fl5yngd9.log<br>
><br>
> just tweaked the sites file a bit from what david sent me:<br>
><br>
> <config><br>
> <pool handle="RANGER"><br>
> <execution provider="coaster" jobManager="gt2:SGE" url="<br>
> <a href="http://gatekeeper.ranger.tacc.teragrid.org" target="_blank">gatekeeper.ranger.tacc.teragrid.org</a> "/><br>
> <filesystem provider="gsiftp" url="gsiftp://<br>
</div>> <a href="http://gridftp.ranger.tacc.teragrid.org" target="_blank">gridftp.ranger.tacc.teragrid.org</a> "/><br>
<div><div></div><div class="h5">> <profile namespace="globus" key="maxtime">28800</profile><br>
> <profile namespace="globus" key="maxWallTime">00:15:00</profile><br>
> <profile namespace="globus" key="jobsPerNode">1</profile><br>
> <profile namespace="globus" key="nodeGranularity">64</profile><br>
> <profile namespace="globus" key="maxNodes">256</profile><br>
> <profile namespace="globus" key="queue">normal</profile><br>
> <profile namespace="karajan" key="jobThrottle">1</profile><br>
> <profile namespace="globus" key="project">TG-DBS080004N</profile><br>
> <profile namespace="globus" key="pe">16way</profile><br>
> <profile namespace="karajan" key="initialScore">10000</profile><br>
> <workdirectory>/work/00043/tg457040/sidgrid_out/skenny</workdirectory><br>
> </pool><br>
> </config><br>
><br>
><br>
><br>
> On Mon, Oct 10, 2011 at 3:43 PM, Sarah Kenny < <a href="mailto:skenny@uchicago.edu">skenny@uchicago.edu</a> ><br>
> wrote:<br>
><br>
><br>
> ok, thanks, got in the queue now...also, realized my last run may have<br>
> been using the old swift. apparently i had SWIFT_HOME set in my env<br>
> and that overrides the newer swift i had set in my PATH.<br>
><br>
> ~sk<br>
><br>
><br>
><br>
> On Mon, Oct 10, 2011 at 12:28 PM, David Kelly < <a href="mailto:davidk@ci.uchicago.edu">davidk@ci.uchicago.edu</a><br>
> > wrote:<br>
><br>
><br>
><br>
><br>
><br>
> Sarah,<br>
><br>
> Can you give this another try with the latest 0.93? I made some<br>
> changes to the coaster and sge providers and was able to get it<br>
> working with a simple catns script. Here is the configuration file I<br>
> was using:<br>
><br>
> <config><br>
> <pool handle="ranger"><br>
> <execution provider="coaster" jobManager="gt2:SGE" url="<br>
> <a href="http://gatekeeper.ranger.tacc.teragrid.org" target="_blank">gatekeeper.ranger.tacc.teragrid.org</a> "/><br>
><br>
> <filesystem provider="gsiftp" url="gsiftp://<br>
</div></div>> <a href="http://gridftp.ranger.tacc.teragrid.org" target="_blank">gridftp.ranger.tacc.teragrid.org</a> "/><br>
<div><div></div><div class="h5">> <profile namespace="globus" key="maxtime">3600</profile><br>
> <profile namespace="globus" key="maxWallTime">00:00:03</profile><br>
> <profile namespace="globus" key="jobsPerNode">1</profile><br>
> <profile namespace="globus" key="nodeGranularity">16</profile><br>
> <profile namespace="globus" key="maxNodes">16</profile><br>
> <profile namespace="globus" key="queue">development</profile><br>
> <profile namespace="karajan" key="jobThrottle">0.9</profile><br>
><br>
> <profile namespace="globus" key="project">TG-DBS080004N</profile><br>
><br>
> <profile namespace="globus" key="pe">16way</profile><br>
> <workdirectory>/share/home/01503/davidkel/swiftwork</workdirectory><br>
> </pool><br>
> </config><br>
><br>
> Thanks,<br>
><br>
> David<br>
><br>
> ----- Original Message -----<br>
><br>
> > From: "Sarah Kenny" < <a href="mailto:skenny@uchicago.edu">skenny@uchicago.edu</a> ><br>
> > To: "Justin M Wozniak" < <a href="mailto:wozniak@mcs.anl.gov">wozniak@mcs.anl.gov</a> ><br>
> > Cc: "Swift Devel" < <a href="mailto:swift-devel@ci.uchicago.edu">swift-devel@ci.uchicago.edu</a> >, "Swift User" <<br>
> > <a href="mailto:swift-user@ci.uchicago.edu">swift-user@ci.uchicago.edu</a> ><br>
><br>
><br>
><br>
> > Sent: Friday, October 7, 2011 3:13:57 PM<br>
> > Subject: Re: [Swift-user] gram on ranger<br>
> > /home/skenny/swift_logs/dummy-20111005-0126-6575n7x5.log<br>
> ><br>
> > on ci<br>
> ><br>
> ><br>
> > On Fri, Oct 7, 2011 at 8:16 AM, Justin M Wozniak <<br>
> > <a href="mailto:wozniak@mcs.anl.gov">wozniak@mcs.anl.gov</a><br>
> > > wrote:<br>
> ><br>
> ><br>
> ><br>
> > Can I take a look at the log?<br>
> ><br>
> ><br>
> ><br>
> ><br>
> > On Thu, 6 Oct 2011, Sarah Kenny wrote:<br>
> ><br>
> ><br>
> ><br>
> > hey all, i'm trying to submit to gram on ranger using the latest<br>
> > swift<br>
> > (built from trunk). it failes like so:<br>
> ><br>
> > Cannot submit job<br>
> > Caused by:<br>
> > org.globus.cog.abstraction. impl.common.task.<br>
> > TaskSubmissionException:<br>
> > Cannot<br>
> > submit job<br>
> > Caused by: org.globus.gram.GramException: Parameter not supported<br>
> > Cannot submit job<br>
> ><br>
> > the gram log was saying first that 'jobsPerNode' is not supported so<br>
> > i<br>
> > changed it to workersPerNode and then it was saying 'maxnodes' is<br>
> > not<br>
> > supported. here's my sites file:<br>
> ><br>
> > <config><br>
> > <pool handle="RANGER"><br>
> > <profile namespace="karajan" key="initialScore">10000</ profile><br>
> > <profile namespace="karajan" key="jobThrottle">1</profile><br>
> > <profile namespace="globus" key="maxWallTime">00:15:00</ profile><br>
> > <profile namespace="globus" key="maxTime">86400</profile><br>
> > <profile namespace="globus" key="slots">1</profile><br>
> > <profile namespace="globus" key="maxNodes">256</profile><br>
> > <profile namespace="globus" key="pe">16way</profile><br>
> > <profile namespace="globus" key="workersPerNode">1</ profile><br>
> > <profile namespace="globus" key="nodeGranularity">64</ profile><br>
> > <profile namespace="globus" key="queue">normal</profile><br>
> > <profile namespace="globus" key="project">TG-DBS080004N</ profile><br>
> > <filesystem provider="gsiftp" url="gsiftp://<br>
> > gridftp.ranger.tacc.teragrid. org "/><br>
><br>
> > <execution provider="coaster" jobManager="gt2:gt2:SGE" url="<br>
> > gatekeeper.ranger.tacc. <a href="http://teragrid.org" target="_blank">teragrid.org</a> "/><br>
><br>
> > <execution provider="gt2" jobManager="SGE" url="<br>
> > gatekeeper.ranger.tacc. <a href="http://teragrid.org" target="_blank">teragrid.org</a> "/><br>
> > <workdirectory>/work/00043/ tg457040</workdirectory><br>
><br>
> > </pool><br>
> > </config><br>
> ><br>
> > thoughts? ideas?<br>
> ><br>
> > --<br>
> > Justin M Wozniak<br>
> ><br>
> ><br>
> ><br>
> > --<br>
> > Sarah Kenny<br>
> > Programmer ~ Brain Circuits Laboratory ~ Rm 2224 Bio Sci III<br>
> > University of California Irvine, Dept. of Neurology ~ <a href="tel:773-818-8300" value="+17738188300">773-818-8300</a><br>
> ><br>
> ><br>
> > _______________________________________________<br>
> > Swift-user mailing list<br>
> > <a href="mailto:Swift-user@ci.uchicago.edu">Swift-user@ci.uchicago.edu</a><br>
> > <a href="https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user" target="_blank">https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user</a><br>
><br>
><br>
><br>
><br>
><br>
><br>
> --<br>
> Sarah Kenny<br>
> Programmer ~ Brain Circuits Laboratory ~ Rm 2224 Bio Sci III<br>
> University of California Irvine, Dept. of Neurology ~ <a href="tel:773-818-8300" value="+17738188300">773-818-8300</a><br>
><br>
><br>
><br>
><br>
> --<br>
> Sarah Kenny<br>
> Programmer ~ Brain Circuits Laboratory ~ Rm 2224 Bio Sci III<br>
> University of California Irvine, Dept. of Neurology ~ <a href="tel:773-818-8300" value="+17738188300">773-818-8300</a><br>
</div></div></blockquote></div><br><br clear="all"><br>-- <br>Sarah Kenny<br>Programmer ~ Brain Circuits Laboratory ~ Rm 2224 Bio Sci III<br>University of California Irvine, Dept. of Neurology ~ 773-818-8300<br><br>