[Swift-devel] Clustering and Temp Dirs with Swift

Ian Foster foster at mcs.anl.gov
Mon Oct 29 09:17:50 CDT 2007


unique + user-controlled (two parts) sounds a good idea, I understand 
the motivation

Michael Wilde wrote:
> I was suggesting that workflow IDs get their global uniqueness via a 
> composite name, not a single globally unique GUID.
>
> As we collect data in a central place, I envision a hierarchy of 
> $SWIFT_LOGS/project/user/submithost/workflow-run/intermediate-dirs/objects 
>
>
> (or something similar)
>
> This hierarchy doesnt have to be consistent or constant, as long as 
> there is as well-defined notion of a workflow's "run directory" and 
> the path to each run dir is unique. The the log processor will find 
> everything.
>
> As a user, having to constantly work in a space of "dense" unique 
> names is hard - its a source of cognitive dissonance.
>
> If the system would give me a choice of using simpler name, nicely 
> balance my files over directories for performance, and accept my log 
> data for analysis, that would be great. But most important is that it 
> work well and fast.
>
> Given a choice, I'd much rather work using the current "dissonant" 
> names than not work. So my comments on naming are a minor issue and we 
> can put them aside for now. (I will try harder to stop talking about 
> this ;)
>
> We're currently focusing on solving the performance problems and 
> continually enhancing the log processing for analysis (related). We 
> should keep doing that, and can review our file-naming issues in a few 
> months from now, unless naming changes are needed for directory 
> balancing.
>
> - Mike
>
>
> On 10/29/07 7:35 AM, Ian Foster wrote:
>> If they are not globally unique, don't we have problems when we 
>> combine logs from multiple sources?
>>
>> Sent via BlackBerry from T-Mobile
>>
>> -----Original Message-----
>> From: Ben Clifford <benc at hawaga.org.uk>
>>
>> Date: Mon, 29 Oct 2007 08:47:04 To:Michael Wilde <wilde at mcs.anl.gov>
>> Cc:swiftdevel <swift-devel at ci.uchicago.edu>
>> Subject: Re: [Swift-devel] Clustering and Temp Dirs with Swift
>>
>>
>>
>> On Sun, 28 Oct 2007, Michael Wilde wrote:
>>
>>> Workflow IDs dont need to be unique outside of a user or group.
>>
>> The way I've been thinking things would work with log file names 
>> (which to an extent overlaps with workflow IDs) is this:
>>
>>   * Swift generates a log file name by default that is very unique
>>     (i.e. its present format is workflow name + timestamp + random)
>>
>>   * The log file name can be overridden with the -log command line 
>> option
>>     (which was broken but I fixed it in r1357)
>>
>>   * To get domain-specific log file naming with your own
>>     uniqueness rules (eg. a sequence number), use -log
>>     to specify that.
>>
>> I think the present log naming is a good way to name things in the 
>> absence of any domain-specific naming strategy; and I think -log is a 
>> good way for a domain specific naming strategy to be plugged in.
>>
>

-- 

   Ian Foster, Director, Computation Institute
Argonne National Laboratory & University of Chicago
Argonne: MCS/221, 9700 S. Cass Ave, Argonne, IL 60439
Chicago: Rm 405, 5640 S. Ellis Ave, Chicago, IL 60637
Tel: +1 630 252 4619.  Web: www.ci.uchicago.edu.
      Globus Alliance: www.globus.org.




More information about the Swift-devel mailing list