Hi,<br><br>There is one site running the application successfully with jobmanager-condor:<br><br>site: GLOW<br>gatekeeper: <a href="http://cmsgrid01.hep.wisc.edu" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">
cmsgrid01.hep.wisc.edu</a><br>app_dir: /afs/hep.wisc.edu/osg/app
<br>data_dir: /afs/hep.wisc.edu/osg/data<br>condor_dir: /condor/bin<br>R_dir: /afs/hep.wisc.edu/osg/app/R-2.5.1/bin/R<br><br>Maybe it has some special configurations or arguments.<br><br>Jing<br><br><div><span class="gmail_quote">
On 8/20/07,
<b class="gmail_sendername">Jing Tie</b> <<a href="mailto:tiejing@gmail.com" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">tiejing@gmail.com</a>> wrote:</span><blockquote class="gmail_quote" style="border-left: 1px solid rgb(204, 204, 204); margin: 0pt 0pt 0pt 0.8ex; padding-left: 1ex;">
Right, it's the problem of condor. After replacing jobmanager-condor<br>with jobmanager, the job finished successfully.<br><br>Thanks,<br>Jing<br><br>On 8/20/07, Mihael Hategan <<a href="mailto:hategan@mcs.anl.gov" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">
hategan@mcs.anl.gov</a>> wrote:<br>> Right. The condor job manager has a bug. It does not properly quote<br>> arguments. So you'll see strange things like this if you use it.<br>><br>> Mihael<br>><br>
> On Mon, 2007-08-20 at 00:43 -0500, Jing Tie wrote:<br>> > Sure.<br>> ><br>> > On 8/20/07, Mihael Hategan <<a href="mailto:hategan@mcs.anl.gov" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">
hategan@mcs.anl.gov</a>> wrote:<br>> > > It puzzles me. Can you attach that file?
<br>> > ><br>> > > On Sun, 2007-08-19 at 21:37 -0500, Jing Tie wrote:<br>> > > > in $SWIFT_HOME/etc/swift.properties<br>> > > ><br>> > > ><br>> > > > Jing
<br>
> > > ><br>> > > > On 8/19/07, Mihael Hategan <<a href="mailto:hategan@mcs.anl.gov" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">hategan@mcs.anl.gov</a>> wrote:<br>
> > > > > On Sat, 2007-08-18 at 18:24 -0500, Jing Tie wrote:
<br>> > > > > > Hi,<br>> > > > > ><br>> > > > > > I am working on SID application now. Job cwtsmall is a script<br>> > > > > > wavelet.sh on AGLT2 site. In the
wavelet.sh, R runs runWaveletsAvg.R<br>> > > > > > on input data 101_FB-epochs.Rdata, and should output<br>> > > > > > 101-FBchannel1_cwt-avgResults.Rdata to<br>> > > > > > 101-FBchannel28_cwt-
avgResults.Rdata<br>> > > > > > these 28 files.<br>> > > > > ><br>> > > > > > But when I runed swift client with kickstart.enabled = false,<br>> > > > >
<br>> > > > > Where did you set this?<br>> > > > ><br>> > > > > Mihael<br>> > > > ><br>> > > > > > it had<br>> > > > > > the exit code 1024 error. And the
stderr.txt said: Kickstart<br>> > > > > > executable (101-FBchannel18_cwt-avgResults.Rdata) not found. Details<br>> > > > > > below:<br>> > > > > ><br>> > > > > > site: AGLT2
<br>> > > > > > gatekeeper: <a href="http://gate01.aglt2.org" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">gate01.aglt2.org</a><br>> > > > > > app_dir: /atlas/data08/OSG/APP/SIDGrid
<br>> > > > > > data_dir: /atlas/data08/OSG/DATA
<br>> > > > > > condor_dir: /opt/condor/bin<br>> > > > > > R_dir: /atlas/data08/OSG/APP/R-2.5.1/bin/R<br>> > > > > ><br>> > > > > > output:<br>> > > > > > Application exception: Job cwtsmall failed with an exit code of 1024
<br>> > > > > > sys:throw @ vdl-int.k, line: 109<br>> > > > > > vdl:checkexitcode @ vdl-int.k, line: 370<br>> > > > > > vdl:execute2 @ execute-default.k
, line: 22<br>> > > > > > vdl:execute @ sid-wf1.kml, line: 20<br>> > > > > > wavelettransf @ sid-wf1.kml, line: 362<br>> > > > > > batchtrials @
sid-wf1.kml, line: 402<br>> > > > > > vdl:mains @ sid-wf1.kml, line: 399<br>> > > > > > cwtsmall failed<br>> > > > > > Provenance graph saved in sid-wf1-8cnxmo0qetg10.dot
<br>> > > > > > The following errors have occurred:<br>> > > > > > 1. Application "cwtsmall" failed (Job cwtsmall failed with an exit code of 1024)<br>> > > > > > Arguments: "scripts/runWaveletsAvg.R, 101, FB"
<br>> > > > > > Host: NWICG_NotreDame<br>> > > > > > Directory: sid-wf1-8cnxmo0qetg10/cwtsmall-zeb72rfi<br>> > > > > > STDERR: Kickstart executable
<br>> > > > > > (101-FBchannel18_cwt-avgResults.Rdata) not found<br>> > > > > > STDOUT:<br>> > > > > > Errors detected. Cleanup not done.<br>> > > > > > Execution completed with errors
<br>> > > > > > sys:throw @ vdl.k, line: 140<br>> > > > > > vdl:mains @ sid-wf1.kml, line: 399<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowNode.fail
(FlowNode.java:413)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowNode.fail(FlowNode.java:417)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.GenerateErrorNode.post
(GenerateErrorNode.java:28)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.childCompleted<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.Sequential.notificationEvent
(Sequential.java:33)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowNode.event(FlowNode.java:334)<br>> > > > > > at org.globus.cog.karajan.workflow.events.EventBus.send
(EventBus.java:123)<br>> > > > > > at org.globus.cog.karajan.workflow.events.EventBus.sendHooked(EventBus.java:97)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowNode.fireNotificationEvent
(FlowNode.java:172)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:298)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.executeChildren
(AbstractFunction.java:37)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowContainer.execute(FlowContainer.java:63)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowNode.restart
(FlowNode.java:239)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowNode.start(FlowNode.java:280)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowNode.controlEvent
(FlowNode.java:392)<br>> > > > > > at org.globus.cog.karajan.workflow.nodes.FlowNode.event(FlowNode.java:331)<br>> > > > > > at org.globus.cog.karajan.workflow.FlowElementWrapper.event
(FlowElementWrapper.java:227)<br>> > > > > > at org.globus.cog.karajan.workflow.events.EventBus.send(EventBus.java:123)<br>> > > > > > at org.globus.cog.karajan.workflow.events.EventBus.sendHooked
(EventBus.java:97)<br>> > > > > > at org.globus.cog.karajan.workflow.events.EventWorker.run(EventWorker.java:69)<br>> > > > > ><br>> > > > > > I found that there are about 8 sites in OSG having the problem.
<br>> > > > > ><br>> > > > > > Many thanks,<br>> > > > > > Jing<br>> > > > > > _______________________________________________<br>> > > > > > Swift-user mailing list
<br>> > > > > > <a href="mailto:Swift-user@ci.uchicago.edu" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">Swift-user@ci.uchicago.edu</a><br>> > > > > > <a href="http://mail.ci.uchicago.edu/mailman/listinfo/swift-user" target="_blank" onclick="return top.js.OpenExtLink(window,event,this)">
http://mail.ci.uchicago.edu/mailman/listinfo/swift-user
</a><br>> > > > > ><br>> > > > ><br>> > > > ><br>> > > ><br>> > ><br>> > ><br>><br>><br></blockquote></div><br>