[Swift-devel] Update on Teraport problems with wavlet workflow
Ben Clifford
benc at hawaga.org.uk
Wed Feb 28 12:24:13 CST 2007
do you have kickstart records for the nodes that *do* run?
On Wed, 28 Feb 2007, Tiberiu Stef-Praun wrote:
> Nothing gets generated in the individual job's temporary directories.
> There is no kickstart record.
> It would be really useful finding out the hostname of the node on
> which these jobs ran.
>
> Let me retry some more workflow runs.
>
> On 2/28/07, Ben Clifford <benc at hawaga.org.uk> wrote:
> >
> >
> > On Wed, 28 Feb 2007, Ben Clifford wrote:
> >
> > > do you have kickstart records for the jobs that are failing?
> >
> > if you do, then:
> >
> > > > Summary/Speculation: bad teraport node causes job to be declared as
> > > > done even though the execution failed
> >
> > this speculation can be investigated further by:
> >
> > finding a job that breaks. finding the node name from the kickstart
> > record. grepping all the kickstart records to find other kickstart records
> > for those jobs. looking to see if they all fail, or if some work and some
> > fail. then report back findings here.
> >
> > --
> >
>
>
>
More information about the Swift-devel
mailing list