[Swift-devel] Support request: Swift jobs flooding uc-teragrid?
Mike Kubal
mikekubal at yahoo.com
Tue Jan 29 20:00:09 CST 2008
The attachment contains the swift script, tc file,
sites file and swift.properties file.
I didn't provide any additional command line
arguments.
MikeK
--- Michael Wilde <wilde at mcs.anl.gov> wrote:
> [ was Re: Swift jobs on UC/ANL TG ]
>
> Hi. Im at OHare and will be flying soon.
> Ben or Mihael, if you are online, can you
> investigate?
>
> Yes, there are significant throttles turned on by
> default, and the
> system opens those very gradually.
>
> MikeK, can you post to the swift-devel list your
> swift.properties file,
> command line options, and your swift source code?
>
> Thanks,
>
> MikeW
>
>
> On 1/29/08 8:11 AM, Ti Leggett wrote:
> > The default walltime is 15 minutes. Are you doing
> fork jobs or pbs jobs?
> > You shouldn't be doing fork jobs at all. Mike W, I
> thought there were
> > throttles in place in Swift to prevent this type
> of overrun? Mike K,
> > I'll need you to either stop these types of jobs
> until Mike W can verify
> > throttling or only submit a few 10s of jobs at a
> time.
> >
> > On Jan 28, 2008, at 01/28/08 07:13 PM, Mike Kubal
> wrote:
> >
> >> Yes, I'm submitting molecular dynamics
> simulations
> >> using Swift.
> >>
> >> Is there a default wall-time limit for jobs on
> tg-uc?
> >>
> >>
> >>
> >> --- joseph insley <insley at mcs.anl.gov> wrote:
> >>
> >>> Actually, these numbers are now escalating...
> >>>
> >>> top - 17:18:54 up 2:29, 1 user, load average:
> >>> 149.02, 123.63, 91.94
> >>> Tasks: 469 total, 4 running, 465 sleeping, 0
> >>> stopped, 0 zombie
> >>>
> >>> insley at tg-grid1:~> ps -ef | grep kubal | wc -l
> >>> 479
> >>>
> >>> insley at tg-viz-login1:~> time globusrun -a -r
> >>> tg-grid.uc.teragrid.org
> >>> GRAM Authentication test successful
> >>> real 0m26.134s
> >>> user 0m0.090s
> >>> sys 0m0.010s
> >>>
> >>>
> >>> On Jan 28, 2008, at 5:15 PM, joseph insley
> wrote:
> >>>
> >>>> Earlier today tg-grid.uc.teragrid.org (the
> UC/ANL
> >>> TG GRAM host)
> >>>> became unresponsive and had to be rebooted. I
> am
> >>> now seeing slow
> >>>> response times from the Gatekeeper there again.
> >>> Authenticating to
> >>>> the gatekeeper should only take a second or
> two,
> >>> but it is
> >>>> periodically taking up to 16 seconds:
> >>>>
> >>>> insley at tg-viz-login1:~> time globusrun -a -r
> >>> tg-grid.uc.teragrid.org
> >>>> GRAM Authentication test successful
> >>>> real 0m16.096s
> >>>> user 0m0.060s
> >>>> sys 0m0.020s
> >>>>
> >>>> looking at the load on tg-grid, it is rather
> high:
> >>>>
> >>>> top - 16:55:26 up 2:06, 1 user, load
> average:
> >>> 89.59, 78.69, 62.92
> >>>> Tasks: 398 total, 20 running, 378 sleeping,
> 0
> >>> stopped, 0 zombie
> >>>>
> >>>> And there appear to be a large number of
> processes
> >>> owned by kubal:
> >>>> insley at tg-grid1:~> ps -ef | grep kubal | wc -l
> >>>> 380
> >>>>
> >>>> I assume that Mike is using swift to do the job
> >>> submission. Is
> >>>> there some throttling of the rate at which jobs
> >>> are submitted to
> >>>> the gatekeeper that could be done that would
> >>> lighten this load
> >>>> some? (Or has that already been done since
> >>> earlier today?) The
> >>>> current response times are not unacceptable,
> but
> >>> I'm hoping to
> >>>> avoid having the machine grind to a halt as it
> did
> >>> earlier today.
> >>>>
> >>>> Thanks,
> >>>> joe.
> >>>>
> >>>>
> >>>>
> >>>
> ===================================================
> >>>> joseph a.
> >>>> insley
> >>>
> >>>> insley at mcs.anl.gov
> >>>> mathematics & computer science division
> >>> (630) 252-5649
> >>>> argonne national laboratory
> >>> (630)
> >>>> 252-5986 (fax)
> >>>>
> >>>>
> >>>
> >>>
> ===================================================
> >>> joseph a. insley
> >>>
> >>> insley at mcs.anl.gov
> >>> mathematics & computer science division
> (630)
> >>> 252-5649
> >>> argonne national laboratory
> >>> (630)
> >>> 252-5986 (fax)
> >>>
> >>>
> >>>
> >>
> >>
> >>
> >>
> >>
>
____________________________________________________________________________________
>
> >>
> >> Be a better friend, newshound, and
> >> know-it-all with Yahoo! Mobile. Try it now.
> >>
>
http://mobile.yahoo.com/;_ylt=Ahu06i62sR8HDtDypao8Wcj9tAcJ
> >>
> >
> >
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
>
http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
>
>
____________________________________________________________________________________
Never miss a thing. Make Yahoo your home page.
http://www.yahoo.com/r/hs
More information about the Swift-devel
mailing list