[Swift-devel] Swift and BGP

Mihael Hategan hategan at mcs.anl.gov
Wed Nov 18 16:39:22 CST 2009


On Wed, 2009-11-18 at 16:36 -0600, skenny at uchicago.edu wrote:
> >On Mon, 2009-10-26 at 11:56 -0500, Mihael Hategan wrote:
> >> So here's how one would go with this on intrepid:
> >> - determine the maximum number of workers (avg-exec-time * 100)
> >> - set the nodeGranularity to 512 nodes, 4 workers per node.
> Also set
> >> maxWorkers to 512 so that only 512 node blocks are
> requested. For some
> >> reason 512 node partitions start almost instantly (even if
> you have 6 of
> >> them) while 1024 node partitions you have to wait for.
> >> - set the total number of blocks ("slots" parameter) to
> >> no-of-workers/2048.
> >> - set the jobThrottle to 2*no-of-workers/100
> >> - make sure you also have foreach,max.threads set to
> 2*no-of-workers
> >> (though that depends on the structure of the program).
> >> - run on login6. There is no point in using the normal
> login machines
> >> since they have a limit of 1024 file descriptors per process.
> >> 
> 
> so, am i correct in understanding that currently swift can
> only run on login6 when running on intrepid? i ask because i'm
> currently not able to get on login6, but decided to try a
> 512-job workflow on login3 and got this:

Right. The error below is precisely the reason why login6 is needed.

> 
> 
> Progress:  Submitted:56  Active:456
> Server died: Too many open files
> java.net.SocketException: Too many open files
>         at





More information about the Swift-devel mailing list