[Swift-devel] [Bug 72] Campaign for scaling wf up to 244 molecules
Mihael Hategan
hategan at mcs.anl.gov
Thu Jun 28 16:41:45 CDT 2007
On Thu, 2007-06-28 at 16:36 -0500, Ioan Raicu wrote:
> There is an option to have a pool of threads work on these data
> structures, but the pool size is set to 1.
Right, but the submit() method was called from different threads. Can we
stop arguing about the obvious?
> Point is well taken, we have fixed this, but I am not convinced this
> is where the problem was. We'll see after we do another run with all
> the extra logging.
Can you commit the updates to svn?
>
> Ioan
>
> Mihael Hategan wrote:
> > > > - did Mihael discover an error in Falkon mutex code?
> > > >
> > > >
> > > >
> > > We are not sure, but we are adding extra synchronization in several
> > > parts of the Falkon provider. The reason we are saying that we are not
> > > sure is that we stress tested (pushing 50~100 tasks/sec) both the Falkon
> > > provider and Falkon itself over and over again, and we never encountered
> > > this. Now, we have a workflow that has an average of 1 task/sec, I find
> > > it hard to beleive that a synchronization issue that never surfaced
> > > before under stress testing is surfacing now under such a light load.
> > >
> >
> > ?!?
> > You are mutating maps and list from concurrent threads without
> > synchronization. That is a problem regardless of any other
> > considerations.
> >
> > Mihael
> >
> >
> >
> >
> >
>
> --
> ============================================
> Ioan Raicu
> Ph.D. Student
> ============================================
> Distributed Systems Laboratory
> Computer Science Department
> University of Chicago
> 1100 E. 58th Street, Ryerson Hall
> Chicago, IL 60637
> ============================================
> Email: iraicu at cs.uchicago.edu
> Web: http://www.cs.uchicago.edu/~iraicu
> http://dsl.cs.uchicago.edu/
> ============================================
> ============================================
More information about the Swift-devel
mailing list