[Swift-devel] [Bug 186] New: File-not-found errors in swift log should be sent to stdout/err
bugzilla-daemon at mcs.anl.gov
bugzilla-daemon at mcs.anl.gov
Sun Mar 22 11:34:10 CDT 2009
https://bugzilla.mcs.anl.gov/swift/show_bug.cgi?id=186
Summary: File-not-found errors in swift log should be sent to
stdout/err
Product: Swift
Version: unspecified
Platform: All
OS/Version: All
Status: NEW
Keywords: error-handling
Severity: normal
Priority: P2
Component: SwiftScript language
AssignedTo: benc at hawaga.org.uk
ReportedBy: wilde at mcs.anl.gov
In a case when the files that a dataset is mapped to are not found, the details
of this error seem to be left buried in the .log file, and what shows up on
stdout/err is the much more cryptic error that the app did not produce the an
expected output file. Its not yet clear to me if swift even got to the point
of executing the app, though.
I get this on stdout/err:
--
Swift svn swift-r2724 (swift modified locally) cog-r2333
RunID: 20090322-1102-rveraq3f
Progress:
Failed to transfer wrapper log from t1-20090322-1102-rveraq3f/info/m on
localhost
Execution failed:
java.io.FileNotFoundException:
_concurrent/results-c6c862ba-4992-4726-b193-c92753858e0e-7 (No such file or
directory)
sur$
--
Yet the log is filled with errors, like these, which would have more
immediately told me what was wrong:
--
sur$ ./checklog
Errors found in swift log t1-20090322-1102-rveraq3f.log: ( -rw-rw-r-- 1 wilde
users 168177 Mar 22 11:02 t1-20090322-1102-rveraq3f.log ):
2009-03-22 11:02:29,988-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-15-1237737749283) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/03/T1af7.0000.0003.rmsd
2009-03-22 11:02:29,997-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-6-1237737749285) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/00/T1af7.0000.0000.rmsd
2009-03-22 11:02:30,000-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-7-1237737749288) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/01/T1af7.0000.0001.log
2009-03-22 11:02:30,009-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-8-1237737749292) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/01/T1af7.0000.0001.pdt
2009-03-22 11:02:30,012-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-15-1237737749297) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/03/T1af7.0000.0003.rmsd
2009-03-22 11:02:30,015-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-9-1237737749303) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/01/T1af7.0000.0001.rmsd
2009-03-22 11:02:30,018-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-5-1237737749300) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/00/T1af7.0000.0000.pdt
2009-03-22 11:02:30,026-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-1-1237737749305) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/04/T1af7.0000.0004.log
2009-03-22 11:02:30,027-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-4-1237737749307) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/00/T1af7.0000.0000.log
2009-03-22 11:02:30,030-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-6-1237737749309) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/00/T1af7.0000.0000.rmsd
2009-03-22 11:02:30,031-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-12-1237737749290) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/02/T1af7.0000.0002.rmsd
2009-03-22 11:02:30,041-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-14-1237737749315) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/03/T1af7.0000.0003.pdt
2009-03-22 11:02:30,039-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-13-1237737749312) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/03/T1af7.0000.0003.log
2009-03-22 11:02:30,041-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-10-1237737749321) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/02/T1af7.0000.0002.log
2009-03-22 11:02:30,044-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-2-1237737749319) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/04/T1af7.0000.0004.pdt
2009-03-22 11:02:30,047-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-15-1237737749326) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/03/T1af7.0000.0003.rmsd
2009-03-22 11:02:30,049-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-3-1237737749323) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/04/T1af7.0000.0004.rmsd
2009-03-22 11:02:30,053-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-1-1237737749335) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/04/T1af7.0000.0004.log
2009-03-22 11:02:30,053-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-5-1237737749333) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/00/T1af7.0000.0000.pdt
2009-03-22 11:02:30,056-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-6-1237737749337) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/00/T1af7.0000.0000.rmsd
2009-03-22 11:02:30,056-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-12-1237737749341) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/02/T1af7.0000.0002.rmsd
2009-03-22 11:02:30,057-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-4-1237737749339) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/00/T1af7.0000.0000.log
2009-03-22 11:02:30,060-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-8-1237737749350) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/01/T1af7.0000.0001.pdt
2009-03-22 11:02:30,060-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-10-1237737749353) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/02/T1af7.0000.0002.log
2009-03-22 11:02:30,060-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-7-1237737749330) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/01/T1af7.0000.0001.log
2009-03-22 11:02:30,063-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-11-1237737749344) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/02/T1af7.0000.0002.pdt
2009-03-22 11:02:30,064-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-2-1237737749355) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/04/T1af7.0000.0004.pdt
2009-03-22 11:02:30,067-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-3-1237737749357) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/04/T1af7.0000.0004.rmsd
2009-03-22 11:02:30,072-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-5-1237737749370) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/00/T1af7.0000.0000.pdt
2009-03-22 11:02:30,073-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-1-1237737749364) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/04/T1af7.0000.0004.log
2009-03-22 11:02:30,073-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-12-1237737749368) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/02/T1af7.0000.0002.rmsd
2009-03-22 11:02:30,076-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-9-1237737749379) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/01/T1af7.0000.0001.rmsd
2009-03-22 11:02:30,076-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-13-1237737749366) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/03/T1af7.0000.0003.log
2009-03-22 11:02:30,077-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-4-1237737749372) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/00/T1af7.0000.0000.log
2009-03-22 11:02:30,082-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-3-1237737749386) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/04/T1af7.0000.0004.rmsd
2009-03-22 11:02:30,083-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-8-1237737749381) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/01/T1af7.0000.0001.pdt
2009-03-22 11:02:30,084-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-7-1237737749391) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/01/T1af7.0000.0001.log
2009-03-22 11:02:30,083-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-1237737749398) setting status to Failed File not found:
/home/wilde/oops/swift/work/t1-20090322-1102-rveraq3f/jobs/m/analyze_round-m9us4b8j/stderr.txt
2009-03-22 11:02:30,085-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-9-1237737749400) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/01/T1af7.0000.0001.rmsd
2009-03-22 11:02:30,086-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-2-1237737749395) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/04/T1af7.0000.0004.pdt
2009-03-22 11:02:30,087-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-11-1237737749402) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/02/T1af7.0000.0002.pdt
2009-03-22 11:02:30,090-0500 DEBUG TaskImpl Task(type=FILE_OPERATION,
identity=urn:0-7-0-1-1237737749414) setting status to Failed
org.globus.cog.abstraction.impl.file.FileNotFoundException:
analyze_round-m9us4b8j-stderr.txt not found.
2009-03-22 11:02:30,090-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-10-1237737749404) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/02/T1af7.0000.0002.log
2009-03-22 11:02:30,091-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-13-1237737749406) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/03/T1af7.0000.0003.log
2009-03-22 11:02:30,093-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-11-1237737749417) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/02/T1af7.0000.0002.pdt
2009-03-22 11:02:30,095-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-14-1237737749384) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/03/T1af7.0000.0003.pdt
2009-03-22 11:02:30,095-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-1237737749421) setting status to Failed File not found:
/home/wilde/oops/swift/work/t1-20090322-1102-rveraq3f/jobs/m/analyze_round-m9us4b8j/_concurrent/results-c6c862ba-4992-4726-b193-c92753858e0e-7
2009-03-22 11:02:30,097-0500 DEBUG TaskImpl Task(type=FILE_OPERATION,
identity=urn:0-7-0-1-1237737749426) setting status to Failed
org.globus.cog.abstraction.impl.file.FileNotFoundException:
analyze_round-m9us4b8j-_concurrent/results-c6c862ba-4992-4726-b193-c92753858e0e-7
not found.
2009-03-22 11:02:30,099-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-14-1237737749428) setting status to Failed File not found:
/gpfs/home/wilde/oops/swift/./out.n7b/T1af7/0000/00/03/T1af7.0000.0003.pdt
2009-03-22 11:02:30,105-0500 DEBUG TaskImpl Task(type=FILE_TRANSFER,
identity=urn:0-7-0-1-1237737749435) setting status to Failed File not found:
/home/wilde/oops/swift/work/t1-20090322-1102-rveraq3f/info/m/analyze_round-m9us4b8j-info
2009-03-22 11:02:30,107-0500 DEBUG TaskImpl Task(type=FILE_OPERATION,
identity=urn:0-7-0-1-1237737749438) setting status to Failed
org.globus.cog.abstraction.impl.file.FileNotFoundException:
analyze_round-m9us4b8j-info not found.
2009-03-22 11:02:30,108-0500 WARN vdl:transferwrapperlog Failed to transfer
wrapper log from t1-20090322-1102-rveraq3f/info/m on localhost
2009-03-22 11:02:30,108-0500 DEBUG vdl:transferwrapperlog Exception for wrapper
log failure from t1-20090322-1102-rveraq3f/info/m on localhost: File not found:
/home/wilde/oops/swift/work/t1-20090322-1102-rveraq3f/info/m/analyze_round-m9us4b8j-info
2009-03-22 11:02:30,113-0500 INFO vdl:execute END_FAILURE thread=0-7-0
tr=analyze_round
2009-03-22 11:02:30,117-0500 INFO SetFutureFault Failing
org.griphyn.vdl.mapping.RootDataNode identifier
tag:benc at ci.uchicago.edu,2008:swift:dataset:20090322-1102-3yoo96q3:720000000020
type file with no value at dataset=results (not closed) (mapping=false)
2009-03-22 11:02:30,121-0500 INFO SetFutureFault Failing
org.griphyn.vdl.mapping.RootDataNode identifier
tag:benc at ci.uchicago.edu,2008:swift:dataset:20090322-1102-3yoo96q3:720000000018
type SecSeq with no value at dataset=nsecseq (not closed) (mapping=false)
2009-03-22 11:02:30,140-0500 DEBUG Loader Swift finished with errors
sur$
--
Possibly relevant, my properties were:
lazy.errors=true
wrapperlog.always.transfer=true
# remove all limits on job submit rates
throttle.submit=off
throttle.host.submit=off
throttle.score.job.factor=off
# set data transfer and data management rate limits very high
throttle.transfers=1000
throttle.file.operations=1000
# Keep the workflow work directories intact on the execution sites
sitedir.keep=true
# Dont retry any job failues (while we are debugging. for production =2 is
better)
execution.retries=0
sur$
--
Configure bugmail: https://bugzilla.mcs.anl.gov/swift/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are watching the assignee of the bug.
You are watching the reporter.
More information about the Swift-devel
mailing list