[Swift-devel] Imbalanced scheduling with coasters and multiple sites

Mihael Hategan hategan at mcs.anl.gov
Tue Apr 7 00:09:44 CDT 2009


On Mon, 2009-04-06 at 23:56 -0500, Michael Wilde wrote:
> The latest rev shows a similar failure on the surface, but I think 
> different patterns in the coaster logs.
> 
> The workflow is 40 simple "cat" jobs, data.txt to a default-mapped outfile.
> 
> This time 39 of 40 jobs ran on abe, and then the workflow lingered and 
> finally failed, with 39 ok, 1 failure.
> 
> All the logs for this run are in
>    /home/wilde/swift/lab/20090406-2330-72p9ale0
> 
> below that are dirs for the abe and qb logs coaster and gram logs.
> Abe had no gram log for this run.
> 
> I suspect this one is worth looking at.

Indeed. Can you paste your sites file?

There's some oddity there.





More information about the Swift-devel mailing list