[Swift-devel] could not start coaster on Ranger

Zhao Zhang zhaozhang at uchicago.edu
Wed May 20 11:51:18 CDT 2009


Hi,

I am trying to run a sanity test with coaster on ranger.
Here is my sites.xml definition:
[zzhang at communicado sites]$ cat coaster_test/tgranger-sge-gram2.xml
<config>
  <pool handle="tgtacc" >
    <gridftp  url="gsiftp://gridftp.ranger.tacc.teragrid.org" />
    <execution  provider="coaster" 
url="gatekeeper.ranger.tacc.teragrid.org" jobManager="gt2:gt2:SGE"/>
    <!-- <profile namespace="globus" 
key="project">TG-DBS080005N</profile> -->
    <profile namespace="globus" key="project">TG-MCB080099N</profile>
    <workdirectory >/work/00946/zzhang/work</workdirectory>
    <profile namespace="env" 
key="SWIFT_JOBDIR_PATH">/tmp/zzhang/jobdir</profile>
    <profile namespace="globus" key="coastersPerNode">16</profile>
    <profile namespace="globus" key="queue">development</profile>
    <profile namespace="globus" 
key="coasterWorkerMaxwalltime">00:40:00</profile>
    <profile namespace="globus" key="maxwalltime">31</profile>
    <profile namespace="karajan" key="initialScore">50</profile>
    <profile namespace="karajan" key="jobThrottle">10</profile>
  </pool>
</config>


Then it failed with the following errors. Is there any configuration 
error I put there in the sites.xml? or any
other information you need, please let me know.

zhao

[zzhang at communicado sites]$ ./run-site coaster_test/tgranger-sge-gram2.xml
testing site configuration: coaster_test/tgranger-sge-gram2.xml
Removing files from previous runs
Running test 061-cattwo at Wed May 20 11:47:23 CDT 2009
Swift 0.9rc2 swift-r2860 cog-r2388

RunID: 20090520-1147-yov15zb6
Progress:  uninitialized:1
Progress:  Stage in:1
Progress:  Submitting:1
Progress:  Submitted:1
Progress:  Active:1
Failed to transfer wrapper log from 
061-cattwo-20090520-1147-yov15zb6/info/k on tgtacc
Progress:  Submitted:1
Progress:  Active:1
Failed to transfer wrapper log from 
061-cattwo-20090520-1147-yov15zb6/info/m on tgtacc
Progress:  Stage in:1
Progress:  Active:1
Failed to transfer wrapper log from 
061-cattwo-20090520-1147-yov15zb6/info/o on tgtacc
Progress:  Failed:1
Execution failed:
        Exception in cat:
Arguments: [061-cattwo.1.in, 061-cattwo.2.in]
Host: tgtacc
Directory: 061-cattwo-20090520-1147-yov15zb6/jobs/o/cat-oa9l83bj
stderr.txt:

stdout.txt:

----

Caused by:
        Failed to start worker:
null
null
org.globus.gram.GramException: The job manager detected an invalid 
script response
        at 
org.globus.cog.abstraction.impl.execution.gt2.JobSubmissionTaskHandler.statusChanged(JobSubmissionTaskHandler.java:530)
        at org.globus.gram.GramJob.setStatus(GramJob.java:184)
        at org.globus.gram.GramCallbackHandler.run(CallbackHandler.java:176)
        at java.lang.Thread.run(Thread.java:619)

Cleaning up...
Shutting down service at https://129.114.50.163:60748
Got channel MetaChannel: 4609608 -> GSSSChannel-null(1)
- Done
SWIFT RETURN CODE NON-ZERO - test 061-cattwo




More information about the Swift-devel mailing list