[Swift-devel] How does swift know if a task is successful

Zhao Zhang zhaozhang at uchicago.edu
Tue Mar 17 23:58:10 CDT 2009


Hi, Mihael

I modified the vdl-int.k in cog/module/swift/libexec, and rebuilt swift, 
and I used my customized wrapper.sh.
I ran the first.swift as a test,  the job returned successful,  and the 
output file was still staged out. Any ideas?
Thanks.

zhao

Mihael Hategan wrote:
> On Tue, 2009-03-17 at 13:36 -0500, Zhao Zhang wrote:
>   
>> Hi, Mihael
>>
>> I commented the following lines
>>   /*dir:make(ldir)
>>      restartOnError(".*", 2
>>      task:transfer(srchost=host, srcfile=bname,
>>      srcdir=rdir, destdir=ldir, desthost=dhost, destprovider=provider)
>>   )*/
>>
>>     
>
> Did you modify this file in dist/?/libexec? If not, did you re-compile
> swift after the modification?
>
> Put an echo or a log message in place, to see if your change is picked
> up by swift next time.
>
>   
>> Then I modified wrapper.sh to not to copy output file back, but I still 
>> got an error.
>> The log file is at 
>> http://www.ci.uchicago.edu/~zzhang/first-20090317-1327-oqgttus8.log
>> Thanks
>>
>> zhao
>>
>> zzhang at login6.surveyor:~/new_dock6> ./run_swift_ssh.sh 1117 64 first.swift
>> waiting for at least 64 nodes to register before submitting workload...
>> waiting to find at least 1 services in file 
>> /home/falkon/users/zzhang/1117/config/Client-service-URIs.config...
>> all done, file has found at least 1 services
>> found at least 64 registered, submitting workload...
>> Swift svn swift-r2676 (swift modified locally) cog-r2305
>>
>> RunID: 20090317-1327-oqgttus8
>> Progress:
>> Progress:  Selecting site:1 Stage in:1
>> Progress:  Submitting:1 Submitted:1
>> Progress:  Submitted:1 Failed but can retry:1
>> Failed to transfer wrapper log from 
>> first-20090317-1327-oqgttus8/info/b/n/bgp000
>> Progress:  Submitted:1 Active:1
>> Failed to transfer wrapper log from 
>> first-20090317-1327-oqgttus8/info/e/n/bgp000
>> Progress:  Submitted:1 Active:1
>> Failed to transfer wrapper log from 
>> first-20090317-1327-oqgttus8/info/g/n/bgp000
>> Execution failed:
>>         Exception in echo:
>> Arguments: [Hello, world!]
>> Host: bgp000
>> Directory: first-20090317-1327-oqgttus8/jobs/g/n/echo-gnlq238j
>> stderr.txt:
>>
>> stdout.txt:
>>
>> ----
>>
>> Caused by:
>>         Cannot transfer 
>> "/tmp/first-20090317-1327-oqgttus8/shared/hello.txt" to 
>> "/gpfs/home/zzhang/new_dock6/./hello.txt"
>> Caused by:
>>         No such file
>>
>>
>> Mihael Hategan wrote:
>>     
>>> On Tue, 2009-03-17 at 12:23 -0500, Zhao Zhang wrote:
>>>   
>>>       
>>>> Hi, Mihael
>>>>
>>>> yes, can I do that?
>>>>     
>>>>         
>>> You should know this by now:
>>> in vdl-int.k, in doStageout, comment out the task:transfer invocation
>>> (and dir:make).
>>>
>>>   
>>>       
>>>> zhao
>>>>
>>>> Mihael Hategan wrote:
>>>>     
>>>>         
>>>>> On Tue, 2009-03-17 at 12:14 -0500, Zhao Zhang wrote:
>>>>>   
>>>>>       
>>>>>           
>>>>>> Here comes another question, is there any place that I could set to 
>>>>>> disable swift's waiting for data feature?
>>>>>>     
>>>>>>         
>>>>>>             
>>>>> Do you mean disable the stage-outs?
>>>>>
>>>>>   
>>>>>       
>>>>>           
>>>>>> Or is there any way for me to cheat swift that the data is already 
>>>>>> there? thanks.
>>>>>>
>>>>>> zhao
>>>>>>
>>>>>> Mihael Hategan wrote:
>>>>>>     
>>>>>>         
>>>>>>             
>>>>>>> On Fri, 2009-03-13 at 17:07 -0500, Zhao Zhang wrote:
>>>>>>>   
>>>>>>>       
>>>>>>>           
>>>>>>>               
>>>>>>>> Hi, All
>>>>>>>>
>>>>>>>> I have a question on how swift knows if a task is successful.
>>>>>>>> In my case, I am using a status notification instead of a status file.
>>>>>>>>
>>>>>>>> So my question is is this status notification the only thing swift is
>>>>>>>> waiting for, or is swift also waiting for the output data to appear to
>>>>>>>> say that a job is successful?
>>>>>>>>     
>>>>>>>>         
>>>>>>>>             
>>>>>>>>                 
>>>>>>> Once the job is done, swift will attempt to stage out all the files that
>>>>>>> it expects the job to have produced.
>>>>>>>
>>>>>>> Should one of those files not be there, there will be failures.
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>>   
>>>>>>>       
>>>>>>>           
>>>>>>>               
>>>>>   
>>>>>       
>>>>>           
>>>   
>>>       
>
>
>   



More information about the Swift-devel mailing list