[Swift-devel] How to increase Swift-coaster task rate

Jonathan Monette jonmon at mcs.anl.gov
Thu Apr 26 22:32:24 CDT 2012


Yes. I will get a log file representing the run for you Mihael. 

I believe java was actually IBM but I could be wrong. I do not think we asked the Cray engineer to load sun java. It would be nice if swift logged that info in the log file. 

On Apr 26, 2012, at 21:44, Michael Wilde <wilde at mcs.anl.gov> wrote:

> We need to re-run the tests on a machine we can get to. The current tests were run for us by a Cray engineer, and we havent tarred the logs back yet.
> 
> Submit host machine was a 6-core Cray login host, probably AMD at 2.5GHz or so, probably 32GB RAM.
> 
> Coaster service was local in Swift jvm. Can try a separate service.
> 
> Java was (I think) Sun 1.6.0_21, IBM Java is there. Not sure if Java was 64 bit.
> 
> Jon will re-run clean tests on the Raven Cray where we have full access.
> 
> Advice on dialing the logging way down would be valuable. Outside of that we're trying to avoid all ancillary touches of shared filesystems.
> 
> Thanks,
> 
> - Mike
> 
> 
> ----- Original Message -----
>> From: "Mihael Hategan" <hategan at mcs.anl.gov>
>> To: "Michael Wilde" <wilde at mcs.anl.gov>
>> Cc: "Swift Devel" <swift-devel at ci.uchicago.edu>
>> Sent: Thursday, April 26, 2012 8:47:38 PM
>> Subject: Re: How to increase Swift-coaster task rate
>> Can I see the log?
>> 
>> I don't think that there is a set of things that I can easily point
>> out,
>> but I can try to see what the problems might be.
>> 
>> What's the submit host (cpu/cores/mem, etc)?
>> 
>> Separate service or auto/local?
>> 
>> Mihael
>> 
>> On Thu, 2012-04-26 at 20:37 -0500, Michael Wilde wrote:
>>> Mihael,
>>> 
>>> David, Jon, and I are working on Cray benchmarks for a paper for the
>>> Cray Users Group.
>>> 
>>> In tests so far, we are being limited by job submission rates of
>>> about 80 tasks/sec.
>>> 
>>> We'd like very much to drive that up closer to 200/sec if at all
>>> possible for the benchmarks we're trying to run.
>>> 
>>> The current tests are doing sleep 0 jobs with no file transfer to
>>> about 2400 cores on a Cray benchmark system. The workdir is set to
>>> /dev/shm. The throttles are almost all set way up (Jon can post the
>>> specific config and values).
>>> 
>>> One thing we have not yet done is try to get the log traffic way
>>> down; thats next up to try.
>>> 
>>> We'll revert to testing against 480 cores on raven for now. That
>>> should still be enough to push the upper limit of Swift, Karajan and
>>> coasters.
>>> 
>>> Can you give us a set of things to check (set, turn off, etc) to try
>>> to get closer to 200 tasks/sec? Do we need to set <scratch> to
>>> /dev/shm in addition to work dir?
>>> 
>>> This latest run was I think with provider staging and pin coaster
>>> files.
>>> 
>>> Thanks,
>>> 
>>> - Mike
>>> 
>>> 
> 
> -- 
> Michael Wilde
> Computation Institute, University of Chicago
> Mathematics and Computer Science Division
> Argonne National Laboratory
> 
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel



More information about the Swift-devel mailing list