[Swift-devel] Re: coaster issue

Zhao Zhang zhaozhang at uchicago.edu
Fri May 1 15:18:55 CDT 2009



Mihael Hategan wrote:
> On Fri, 2009-05-01 at 14:57 -0500, Zhao Zhang wrote:
>   
>> Mihael Hategan wrote:
>>     
>>> On Fri, 2009-05-01 at 14:21 -0500, Zhao Zhang wrote:
>>>   
>>>       
>>>> Good, Thanks, Mihael. I write it down in my test book, with 
>>>> "error-reason-solution" tuple.
>>>>
>>>> Here comes another two:
>>>> Cannot create directory /nfs/osg-data/osgedu/benc/swift/061-cattwo-20090501-1035-4fpfuxk2
>>>> 500-System error in mkdir: Permission denied
>>>>     
>>>>         
>>> I think that's fairly clear. You'll need to find a workdir that you can
>>> write to.
>>>
>>>   
>>>       
>> I found "$TMP Location /nfs/osg-data" on 
>> http://vors.grid.iu.edu/cgi-bin/index.cgi?region=0&VO=24&grid=1&dtype=0&res=387#SITELISTING, 
>>
>> but it still doesn't work. Do you know any dir that I could try out?
>>     
>
> What exactly did you try (i.e. paste relevant lines from sites.xml).
>   
I used this one /nfs/osg-data/osgedu/benc/swift
>   
>>>   
>>>       
>>>> Caused by:
>>>> 	Cannot submit job: Could not submit job (qsub reported an exit code of 188). no error output
>>>>     
>>>>         
>>> I'm sure that means something, but can you try with globus instead of
>>> the PBS provider?
>>>   
>>>       
>> Changing <execution provider="coaster" url="osg-ce.grid.uj.ac.za" 
>> jobManager="gt2:pbs" />
>> to <execution provider="coaster" url="osg-ce.grid.uj.ac.za" 
>> jobManager="gt2:gt2" /> ?
>>     
>
> gt2:gt2:pbs.
>   
I tried it out:
[zzhang at tp-grid1 sites]$ ./run-site coaster/uj-pbs-gram2.xml
testing site configuration: coaster/uj-pbs-gram2.xml
Removing files from previous runs
Running test 061-cattwo at Fri May 1 15:15:25 CDT 2009
Swift 0.9rc2 swift-r2860 cog-r2388

RunID: 20090501-1515-bnucy0g4
Progress:
Progress:  Stage in:1
Progress:  Submitting:1
Progress:  Submitting:1
Progress:  Submitted:1
Progress:  Active:1
Failed to transfer wrapper log from 
061-cattwo-20090501-1515-bnucy0g4/info/d on teraport
Progress:  Stage in:1
Progress:  Active:1
Progress: Failed but can retry:1
Failed to transfer wrapper log from 
061-cattwo-20090501-1515-bnucy0g4/info/f on teraport
Progress:  Stage in:1
Progress:  Active:1
Progress: Failed but can retry:1
Failed to transfer wrapper log from 
061-cattwo-20090501-1515-bnucy0g4/info/h on teraport
Progress:  Failed:1
Execution failed:
        Exception in cat:
Arguments: [061-cattwo.1.in, 061-cattwo.2.in]
Host: teraport
Directory: 061-cattwo-20090501-1515-bnucy0g4/jobs/h/cat-hzcz77aj
stderr.txt:

stdout.txt:

----

Caused by:
        Failed to start worker:
null
null
org.globus.gram.GramException: The job failed when the job manager 
attempted to run it
        at 
org.globus.cog.abstraction.impl.execution.gt2.JobSubmissionTaskHandler.statusChanged(JobSubmissionTaskHandler.java:530)
        at org.globus.gram.GramJob.setStatus(GramJob.java:184)
        at org.globus.gram.GramCallbackHandler.run(CallbackHandler.java:176)
        at java.lang.Thread.run(Thread.java:595)

Cleaning up...
Shutting down service at https://152.106.18.251:49647
Got channel MetaChannel: 1969344019 -> GSSSChannel-null(1)
- Done
SWIFT RETURN CODE NON-ZERO - test 061-cattwo

The sites.xml file I am using is
[zzhang at tp-grid1 sites]$ cat coaster/uj-pbs-gram2.xml
<config>

  <pool handle="teraport" >
    <gridftp  url="gsiftp://osg-ce.grid.uj.ac.za" />
    <execution provider="coaster" url="osg-ce.grid.uj.ac.za" 
jobManager="gt2:gt2:pbs" />

    <workdirectory >/nfs/home/benc/swifttest</workdirectory>
    <profile namespace="karajan" key="jobThrottle">4</profile>
  </pool>

</config>

>
>
>   



More information about the Swift-devel mailing list