[Swift-devel] Experiments on Beagle
Lorenzo Pesce
lpesce at uchicago.edu
Thu Oct 25 10:50:04 CDT 2012
Run started to falter and I killed it. I resent it out capturing the screen to *.screenlog
(lots of failed wrappers, it might be the app itself or the system, I don't know).
I hope now I am capturing all the info.
On Oct 25, 2012, at 10:22 AM, Michael Wilde wrote:
> Lorenzo, 10K cores sounds great.
>
> Regarding not using all the nodes: I have seen that on Cray test runs, but only at >16K cores. Its also possible that one or more throttle settings are holding back your runs.
>
> Can you point us to the run directory where we can watch your log file and see your config files and your script?
>
> - Mike
>
>
> ----- Original Message -----
>> From: "Lorenzo Pesce" <lpesce at uchicago.edu>
>> To: "swift-devel Devel" <swift-devel at ci.uchicago.edu>
>> Sent: Thursday, October 25, 2012 10:05:52 AM
>> Subject: [Swift-devel] Experiments on Beagle
>> Hi --
>> I am running on more than 10,000 cores because there are a good number
>> of users having problems running their jobs, which left the machine
>> for me =)
>>
>> I am doing work for a user, so don't worry too much.
>>
>> I am just writing in case you want to take a look at how the
>> simulations are proceeding, how the memory is used (login5, user
>> lpesce) and how the number of tasks goes up and down as jobs are
>> completed.
>> (For example, I asked for 500 nodes and I am getting only a little
>> over 400, but the machine has available the nodes I asked for, swift
>> is just not taking them as far as I can tell, the number first spiked
>> up to the requested number of nodes --I think-- then winded down)
>>
>> _______________________________________________
>> Swift-devel mailing list
>> Swift-devel at ci.uchicago.edu
>> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel
>
> --
> Michael Wilde
> Computation Institute, University of Chicago
> Mathematics and Computer Science Division
> Argonne National Laboratory
>
More information about the Swift-devel
mailing list