[Swift-devel] Re: coaster issue
Zhao Zhang
zhaozhang at uchicago.edu
Fri May 1 15:18:55 CDT 2009
Mihael Hategan wrote:
> On Fri, 2009-05-01 at 14:57 -0500, Zhao Zhang wrote:
>
>> Mihael Hategan wrote:
>>
>>> On Fri, 2009-05-01 at 14:21 -0500, Zhao Zhang wrote:
>>>
>>>
>>>> Good, Thanks, Mihael. I write it down in my test book, with
>>>> "error-reason-solution" tuple.
>>>>
>>>> Here comes another two:
>>>> Cannot create directory /nfs/osg-data/osgedu/benc/swift/061-cattwo-20090501-1035-4fpfuxk2
>>>> 500-System error in mkdir: Permission denied
>>>>
>>>>
>>> I think that's fairly clear. You'll need to find a workdir that you can
>>> write to.
>>>
>>>
>>>
>> I found "$TMP Location /nfs/osg-data" on
>> http://vors.grid.iu.edu/cgi-bin/index.cgi?region=0&VO=24&grid=1&dtype=0&res=387#SITELISTING,
>>
>> but it still doesn't work. Do you know any dir that I could try out?
>>
>
> What exactly did you try (i.e. paste relevant lines from sites.xml).
>
I used this one /nfs/osg-data/osgedu/benc/swift
>
>>>
>>>
>>>> Caused by:
>>>> Cannot submit job: Could not submit job (qsub reported an exit code of 188). no error output
>>>>
>>>>
>>> I'm sure that means something, but can you try with globus instead of
>>> the PBS provider?
>>>
>>>
>> Changing <execution provider="coaster" url="osg-ce.grid.uj.ac.za"
>> jobManager="gt2:pbs" />
>> to <execution provider="coaster" url="osg-ce.grid.uj.ac.za"
>> jobManager="gt2:gt2" /> ?
>>
>
> gt2:gt2:pbs.
>
I tried it out:
[zzhang at tp-grid1 sites]$ ./run-site coaster/uj-pbs-gram2.xml
testing site configuration: coaster/uj-pbs-gram2.xml
Removing files from previous runs
Running test 061-cattwo at Fri May 1 15:15:25 CDT 2009
Swift 0.9rc2 swift-r2860 cog-r2388
RunID: 20090501-1515-bnucy0g4
Progress:
Progress: Stage in:1
Progress: Submitting:1
Progress: Submitting:1
Progress: Submitted:1
Progress: Active:1
Failed to transfer wrapper log from
061-cattwo-20090501-1515-bnucy0g4/info/d on teraport
Progress: Stage in:1
Progress: Active:1
Progress: Failed but can retry:1
Failed to transfer wrapper log from
061-cattwo-20090501-1515-bnucy0g4/info/f on teraport
Progress: Stage in:1
Progress: Active:1
Progress: Failed but can retry:1
Failed to transfer wrapper log from
061-cattwo-20090501-1515-bnucy0g4/info/h on teraport
Progress: Failed:1
Execution failed:
Exception in cat:
Arguments: [061-cattwo.1.in, 061-cattwo.2.in]
Host: teraport
Directory: 061-cattwo-20090501-1515-bnucy0g4/jobs/h/cat-hzcz77aj
stderr.txt:
stdout.txt:
----
Caused by:
Failed to start worker:
null
null
org.globus.gram.GramException: The job failed when the job manager
attempted to run it
at
org.globus.cog.abstraction.impl.execution.gt2.JobSubmissionTaskHandler.statusChanged(JobSubmissionTaskHandler.java:530)
at org.globus.gram.GramJob.setStatus(GramJob.java:184)
at org.globus.gram.GramCallbackHandler.run(CallbackHandler.java:176)
at java.lang.Thread.run(Thread.java:595)
Cleaning up...
Shutting down service at https://152.106.18.251:49647
Got channel MetaChannel: 1969344019 -> GSSSChannel-null(1)
- Done
SWIFT RETURN CODE NON-ZERO - test 061-cattwo
The sites.xml file I am using is
[zzhang at tp-grid1 sites]$ cat coaster/uj-pbs-gram2.xml
<config>
<pool handle="teraport" >
<gridftp url="gsiftp://osg-ce.grid.uj.ac.za" />
<execution provider="coaster" url="osg-ce.grid.uj.ac.za"
jobManager="gt2:gt2:pbs" />
<workdirectory >/nfs/home/benc/swifttest</workdirectory>
<profile namespace="karajan" key="jobThrottle">4</profile>
</pool>
</config>
>
>
>
More information about the Swift-devel
mailing list