[Swift-devel] Re: 244 MolDyn run was successful!

Ben Clifford benc at hawaga.org.uk
Mon Aug 27 12:37:51 CDT 2007



On Mon, 27 Aug 2007, Ioan Raicu wrote:

> On a similar note, IMO, the heuristic in Karajan should be modified to take
> into account the task execution time of the failed or successful task, and not
> just the number of tasks.  This would ensure that Swift is not throttling task
> submission to Falkon when there are 1000s of successful tasks that take on the
> order of 100s of second to complete, yet there are also 1000s of failed tasks
> that are only 10 ms long.  This is exactly the case with MolDyn, when we get a
> bad node in a bunch of 100s of nodes, which ends up throttling the number of
> active and running tasks to about 100, regardless of the number of processors
> Falkon has. 

Is that different from when submitting to PBS or GRAM where there are 
1000s of successful tasks taking 100s of seconds to complete but with 
1000s of failed tasks that are only 10ms long?

-- 



More information about the Swift-devel mailing list