[Fwd: Re: [Swift-devel] Re: swift-falkon problem... plots to explain plateaus...]

Michael Wilde wilde at mcs.anl.gov
Tue Mar 25 08:16:07 CDT 2008


On 3/25/08 3:31 AM, Mihael Hategan wrote:
> On Tue, 2008-03-25 at 00:28 -0500, Michael Wilde wrote:
>> I eyeballed the wrapperlogs to get a rough idea of what was happening.
>>
>> I ran with wrapperlog saving and no other changes for wf's of 10, 100 
>> and 500 jobs, to see how the exec time grew.  At 500 jobs it grew to 
>> about 30+ seconds for a core app exec time of about 1 sec. (Im just 
>> recollecting the times as at this point I didnt write much down).
>>
> 
> I would personally like to see those logs.

I listed all the runs in the previous mail (below), Mihael. They are on 
CI NFS at ~benc/swift-logs/wilde/run{345-350}. Let us know what you find.

Thanks,

- Mike

On 3/25/08 12:28 AM, Michael Wilde wrote:
 > I eyeballed the wrapperlogs to get a rough idea of what was happening.
 >
 > I ran with wrapperlog saving and no other changes for wf's of 10, 100
 > and 500 jobs, to see how the exec time grew.  At 500 jobs it grew to
 > about 30+ seconds for a core app exec time of about 1 sec. (Im just
 > recollecting the times as at this point I didnt write much down).
 >
 > First results showed more time spent in the app wrapper than in
 > wrapper.sh.  I remedied this by using /tmp as the app-wrapper's working
 > dir, and caching the app binary on /tmp.  This brought a 20+ sec app
 > exec time down to about 3 seconds.
 >
 > With this fixed, the total time in wrapper.sh including the app is now
 > about 15 seconds, with 3 being in the app-wrapper itself. The time seems
 > about evenly spread over the several wrapper.sh operations, which is not
 > surprising when 500 wrappers hit NFS all at once.
 >
 > I then tried 3 more tests:
 > - a run to see if the app-executable caching on /tmp had an effect
 >   (it didnt)
 > - a run to see if turning of wrapperlog retrieval had an effect
 > - a run with data operation throttles (both) set to 100 from 10
 >
 > None of these last three things had a significant effect.
 >
 > Tomorrow I will try some mods to the wrapper script. Turning off wrapper
 > logging in a previous trial yesterday *seemed* to shave 20-30% off the
 > run time.  I need to verify this.
 >
 > I'm also going to try to use /tmp for the jobdir and reduce wrapper.sh
 > overhead; also will leave the (tiny) job output on /tmp for later
 > aggregation (will have some swift questions on that).
 >
 > Ben, if you want to look at any of these logs, the runs are in
 > swift-logs/wilde in the order described above (w/comment files):
 >
 > 346: 10 job workflow
 > 347: 100 job wf
 > 348: 500 job wf
 > 349: 500 jobs w/ improved app-wrapper
 > 350: 500 jobs w/ improved app-wrapper & executable on /tmp
 > 351: 500 jobs, wrapperlog saving off
 > 352: 500 jobs, wrapperlog saving off, data throttles at 100 (from 20)
 >
 > All but the first of these should have falkon logs saved as well.
 >
 > I have several ideas on how to proceed, but welcome advice and any
 > discoveries from log analysis.
 >
 > Thanks,
 >
 > Mike
 >
 >
 > On 3/24/08 10:15 PM, Michael Wilde wrote:
 >> Ben, do you have a script to sum the time spent per step of
 >> wrapper.sh, over a set in -info files?
 >>
 >> On 3/24/08 6:36 PM, Ben Clifford wrote:
 >>> On Mon, 24 Mar 2008, Mihael Hategan wrote:
 >>>
 >>>> As far as I can remember, Ben added fairly comprehensive logging 
to the
 >>>> wrapper. That may shed some light on the issue.
 >>>
 >>> Indeed I did; and that logging information can be sent back to the
 >>> submit host by enabling wrapperlog.always.transfer=true
 >>>
 >>
 >




More information about the Swift-devel mailing list