[Swift-devel] several alternatives to design	the	data	management	system for Swift on SuperComputers
    Ioan Raicu 
    iraicu at cs.uchicago.edu
       
    Mon Dec  1 21:32:24 CST 2008
    
    
  
But its not just about directories and GPFS locking.... its about 8 or 
16 large servers with 10Gb/s network connectivity (as is the case for 
GPFS) compared to potentially 40K servers, each with 1Gb/s connectivity 
(as would be the case in our example).  The potential raw throughput of 
the later case, when we use all 40K nodes as servers to the file system, 
is orders of magnitude larger than a static configuration with 8 or 16 
servers.  Its not yet clear we can actually achieve anything close to 
the upper bound of performance at full scale, but it should be obvious 
that the performance characteristics will be quite different between 
GPFS and CIO.
Ioan
Mihael Hategan wrote:
> On Mon, 2008-12-01 at 17:10 -0600, Ioan Raicu wrote:
>   
>> Mihael Hategan wrote: 
>>     
>>> On Mon, 2008-12-01 at 16:52 -0600, Ioan Raicu wrote:
>>>
>>>   
>>> ...
>>>
>>>   
>>>       
>>>> I don't think you realize how expensive GPFS access is when doing so
>>>> at 100K CPU scale.
>>>>     
>>>>         
>>> I don't think I understand what you mean by "access". As I said, things
>>> that generate contention are going to be slow.
>>>
>>> If the problem requires that contention to happen, then it doesn't
>>> matter what the solution is. If it does not, then I suspect that there
>>> is a way to avoid contention in GPFS, too (sticking things in different
>>> directories).
>>>   
>>>       
>> The basic idea is that many smaller shared file systems will scale
>> better than 1 large file system, as the contention is localized.
>>     
>
> Which is the same behaviour you get if you have a hierarchy of
> directories. This is what Ben implemented in Swift.
>
>   
>>  The problem is that having 1 global namespace is simple and straight
>> forward, but having N local namespaces is not, and requires extra
>> management.
>>     
>
> Right. That's why most filesystems I know of treat directories as
> independent files containing file metadata (aka. "local namespaces").
>
>
>   
-- 
===================================================
Ioan Raicu
Ph.D. Candidate
===================================================
Distributed Systems Laboratory
Computer Science Department
University of Chicago
1100 E. 58th Street, Ryerson Hall
Chicago, IL 60637
===================================================
Email: iraicu at cs.uchicago.edu
Web:   http://www.cs.uchicago.edu/~iraicu
http://dev.globus.org/wiki/Incubator/Falkon
http://dsl-wiki.cs.uchicago.edu/index.php/Main_Page
===================================================
===================================================
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20081201/c8df0399/attachment.html>
    
    
More information about the Swift-devel
mailing list