[Swift-devel] GPFS issue of SWIFT on BGP
Ben Clifford
benc at hawaga.org.uk
Tue Feb 10 14:43:05 CST 2009
On Tue, 10 Feb 2009, Zhao Zhang wrote:
> What if there are 640 ssh-providers sending result files at the same time? Do
> you know any successful test case
> with hundreds of ssh-providers working together with one submit host?
Control is the other way round.
The Swift client will pull files down from the I/O nodes when jobs are
finished. (that is done by the dostageout call in execute2 in
libexec/vdl-int.k)
Swift has rate limiting on the number of file transfers and file
operations that can be in progress at any one time. By default, the limit
is 4 (for file transfers) and 8 (for file operations). This is controlled
by the throttle.transfers and throttle.file.operations settings in
swift.properties.
I think (but I am not sure) that this is a limit for the whole of Swift,
rather than per site (but I am not sure).
If jobs are finishing faster than Swift can stage out the data (which is
likely to happen) then a queue of transfer requests will build up inside
Swift.
I think it is quite likely (though I have no numerical evidence) that you
will find provider-ssh copies files too slowly for your liking; in which
case you would need to come up with a faster way of moving files between
the IO nodes and the submitting node. But you should see what happens with
provider-ssh first. You should easily be able to compute throughput rates
when you have log files for this.
--
More information about the Swift-devel
mailing list