[Swift-devel] GPFS issue of SWIFT on BGP

Zhao Zhang zhaozhang at uchicago.edu
Tue Feb 10 14:13:49 CST 2009


Hi,

Ben Clifford wrote:
> On Tue, 10 Feb 2009, Zhao Zhang wrote:
>
>   
>> Now, we start swift on Login Node, and the working directory will be created
>> on IO Node, so that all intermediate files and final
>> result files will be copied back to Login Node(GPFS) once they are generated.
>> Here we got an old problem, all IO nodes are trying
>> to write files in the same directory, which we are trying to avoid all the
>> way.
>> My solution would be modify the ssh-provider source code, implement an
>> asynchronous collector logic there.
>>     
>
> Can you describe what is going on here more explicitly.
>
> How do multiple IO nodes end up writing to the same GPFS directory?
>   
In previous case, we have 512 IO nodes each create 1 file in the same 
directory, that would take 30 minutes to finish.
Besides, some time only 510 files could be created.
> It is unclear to me from what you write how that comes about - as I 
> understand it:
>
>  . submit side data files are posix-accessed only by the swift submit-side 
> client
>   
yes
>  . files on the I/O nodes (the remote sites) use pset-local storage
>   
yes
>  . any communication between the I/O nodes and submit-side client happens 
> over ssh.
>   
yes
> Where does an I/O node access machine-wide GPFS?
>   
data transfer from  I/O nodes to submit-side client is  writing to GPFS 
through ssh-provider.

zhao



More information about the Swift-devel mailing list