[Swift-devel] Kickstart runs on localhost are failing

Mihael Hategan hategan at mcs.anl.gov
Sun Nov 4 21:15:39 CST 2007


On Sun, 2007-11-04 at 21:07 -0600, Mihael Hategan wrote:
> On Sun, 2007-11-04 at 19:37 -0600, Michael Wilde wrote:
> > I get job exceptions when I run with kickstart on localhost,
> > regardless of whether clustered or not.
> > 
> > The jobs seem to run (3x each) but fail each time. First time gets 
> > "Application exception: Missing argument jobdir", 2nd & 3rd get 
> > "Application exception: The cache already contains 
> > localhost:awf4-20071104-1843-ds8hn11a..."
> 
> That probably shouldn't happen unless you're trying to assign to the
> same variable twice. Does this work without kickstart?

Where "shouldn't" should be interpreted as "unless there's a bug", which
isn't necessarily unlikely.

> 
> > 
> > Clustered run is in run137, unclustered in run138
> > The latter log dir has a file swiftdata.find.out which lists all the 
> > files in my data dir (has a local/ branch at the top for localhost jobs).
> > 
> > Error in both cases is below.
> > 
> > Will try next doing kickstart in both ways via gram.
> > 
> > - Mike
> > 
> > 2007-11-04 18:47:40,946-0600 DEBUG vdl:execute2 APPLICATION_EXCEPTION 
> > jobid=angle4-cgqcmmji - Application exception: Missing argument jobdir 
> > for sys:element(rhost, wfdir, jobid, jobdir)
> > 2007-11-04 18:47:41,085-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-2-1194223436415) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-cgqcmmji-stderr.txt not found.
> > 2007-11-04 18:47:41,344-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-2-1194223436424) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-cgqcmmji-stdout.txt not found.
> > 2007-11-04 18:47:41,503-0600 DEBUG vdl:execute2 APPLICATION_EXCEPTION 
> > jobid=angle4-bgqcmmji - Application exception: Missing argument jobdir 
> > for sys:element(rhost, wfdir, jobid, jobdir)
> > 2007-11-04 18:47:41,553-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-1-1194223436458) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-bgqcmmji-stderr.txt not found.
> > 2007-11-04 18:47:41,638-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-1-1194223436467) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-bgqcmmji-stdout.txt not found.
> > 2007-11-04 18:47:41,882-0600 DEBUG vdl:execute2 APPLICATION_EXCEPTION 
> > jobid=angle4-agqcmmji - Application exception: Missing argument jobdir 
> > for sys:element(rhost, wfdir, jobid, jobdir)
> > 2007-11-04 18:47:41,987-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-3-1194223436500) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-agqcmmji-stderr.txt not found.
> > 2007-11-04 18:47:42,047-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-3-1194223436507) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-agqcmmji-stdout.txt not found.
> > 2007-11-04 18:51:18,439-0600 DEBUG vdl:execute2 APPLICATION_EXCEPTION 
> > jobid=angle4-dgqcmmji - Application exception: The cache already 
> > contains localhost:awf4-20071104-1843-ds8hn11a/shared/cf0000.angle.
> > 2007-11-04 18:51:18,628-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-2-1194223436543) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-dgqcmmji-stderr.txt not found.
> > 2007-11-04 18:51:18,762-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-2-1194223436550) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-dgqcmmji-stdout.txt not found.
> > 2007-11-04 18:51:25,976-0600 DEBUG vdl:execute2 APPLICATION_EXCEPTION 
> > jobid=angle4-egqcmmji - Application exception: The cache already 
> > contains localhost:awf4-20071104-1843-ds8hn11a/shared/of0002.angle.
> > 2007-11-04 18:51:26,401-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-1-1194223436585) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-egqcmmji-stderr.txt not found.
> > 2007-11-04 18:51:26,726-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-1-1194223436592) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-egqcmmji-stdout.txt not found.
> > 2007-11-04 18:51:28,040-0600 DEBUG vdl:execute2 APPLICATION_EXCEPTION 
> > jobid=angle4-fgqcmmji - Application exception: The cache already 
> > contains localhost:awf4-20071104-1843-ds8hn11a/shared/cf0001.angle.
> > 2007-11-04 18:51:28,492-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-3-1194223436627) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-fgqcmmji-stderr.txt not found.
> > 2007-11-04 18:51:28,816-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-3-1194223436634) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-fgqcmmji-stdout.txt not found.
> > 2007-11-04 18:54:44,088-0600 DEBUG vdl:execute2 APPLICATION_EXCEPTION 
> > jobid=angle4-hgqcmmji - Application exception: The cache already 
> > contains localhost:awf4-20071104-1843-ds8hn11a/shared/of0002.angle.
> > 2007-11-04 18:54:44,440-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-1-1194223436670) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-hgqcmmji-stderr.txt not found.
> > 2007-11-04 18:54:44,652-0600 DEBUG TaskImpl Task(type=FILE_OPERATION, 
> > identity=urn:0-1-1194223436677) setting status to Failed 
> > org.globus.cog.abstraction.impl.file.FileNotFoundException: 
> > angle4-hgqcmmji-stdout.txt not found.
> > 2007-11-04 18:54:44,741-0600 DEBUG VDL2ExecutionContext Exception in angle4:
> > Exception in angle4:
> >          sys:exception @ vdl-int.k, line: 423
> >          at 
> > org.globus.cog.karajan.workflow.nodes.functions.KException.function(KException.java:29)
> > 2007-11-04 18:54:46,190-0600 INFO  ExecutionContext Detailed exception:
> > Exception in angle4:
> >          sys:exception @ vdl-int.k, line: 423
> >          at 
> > org.globus.cog.karajan.workflow.nodes.functions.KException.function(KException.java:29)
> > _______________________________________________
> > Swift-devel mailing list
> > Swift-devel at ci.uchicago.edu
> > http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
> > 
> 
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
> 




More information about the Swift-devel mailing list