[Swift-devel] Experiments on Beagle

Lorenzo Pesce lpesce at uchicago.edu
Thu Oct 25 11:52:17 CDT 2012


Failures are starting to appear and dominate the results. I am letting it run anyway so you get a chance to look at it live.

On Oct 25, 2012, at 10:22 AM, Michael Wilde wrote:

> Lorenzo, 10K cores sounds great.
> 
> Regarding not using all the nodes: I have seen that on Cray test runs, but only at >16K cores.  Its also possible that one or more throttle settings are holding back your runs.
> 
> Can you point us to the run directory where we can watch your log file and see your config files and your script?
> 
> - Mike
> 
> 
> ----- Original Message -----
>> From: "Lorenzo Pesce" <lpesce at uchicago.edu>
>> To: "swift-devel Devel" <swift-devel at ci.uchicago.edu>
>> Sent: Thursday, October 25, 2012 10:05:52 AM
>> Subject: [Swift-devel] Experiments on Beagle
>> Hi --
>> I am running on more than 10,000 cores because there are a good number
>> of users having problems running their jobs, which left the machine
>> for me =)
>> 
>> I am doing work for a user, so don't worry too much.
>> 
>> I am just writing in case you want to take a look at how the
>> simulations are proceeding, how the memory is used (login5, user
>> lpesce) and how the number of tasks goes up and down as jobs are
>> completed.
>> (For example, I asked for 500 nodes and I am getting only a little
>> over 400, but the machine has available the nodes I asked for, swift
>> is just not taking them as far as I can tell, the number first spiked
>> up to the requested number of nodes --I think-- then winded down)
>> 
>> _______________________________________________
>> Swift-devel mailing list
>> Swift-devel at ci.uchicago.edu
>> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel
> 
> -- 
> Michael Wilde
> Computation Institute, University of Chicago
> Mathematics and Computer Science Division
> Argonne National Laboratory
> 




More information about the Swift-devel mailing list