[Swift-devel] misassignment of jobs
Mihael Hategan
hategan at mcs.anl.gov
Sun Nov 21 20:37:48 CST 2010
Ok. I will remove the idle timeouts from the worker. I do not expect any
negative consequences there given the reasoning I outlined before.
Mihael
On Sun, 2010-11-21 at 19:37 -0600, Michael Wilde wrote:
> OK, re bug 2: I didnt connect the symptoms of this issue with your earlier comments on timeouts, and just verified that you are correct: with the same extended timeouts I was using to try to keep a persistent coaster service up for an extended time, the failing case for bug 2 works.
>
> I'll try to reproduce bug 1 now, then 3.
>
> - Mike
>
>
> ----- Original Message -----
> > On Sun, 2010-11-21 at 17:10 -0600, Michael Wilde wrote:
> > > Mihael,
> > >
> > > If you're in fixin' mode,
> >
> > I've been in fixin' mode for the past two months :)
> >
> > > I'll spend some time now trying to reproduce the 3 coaster problems
> > > that are high on my "needed for users" list:
> > >
> > > 1. Swift hangs/fails talking to persistent server if it sites idle
> > > for
> > > a few minutes, even with large timeout values (which were possibly
> > > not
> > > set correctly or fully).
> > >
> > > 2. With normal coaster mode, if workers start toiming out for lack
> > > of work, the Swift run dies.
> >
> > That one is addressed by removing the worker timeout. As I mentioned
> > in
> > a previous email, that timeout is a artifact of an older worker
> > management scheme.
> >
> > >
> > > 3. Errors in provider staging at high volume.
> > >
> > > If you already have test cases for these issues, let me know, and
> > > I'll
> > > focus on the missing ones. But Im assuming for now you need all
> > > three.
> >
> > I have test cases for 1 and 3. I couldn't reproduce the problems so
> > far.
> >
> > Mihael
>
More information about the Swift-devel
mailing list