[Swift-devel] Strange Problem with TG-UCANL
Veronika Nefedova
nefedova at mcs.anl.gov
Thu Oct 25 11:07:47 CDT 2007
If you are using kickstart - try to use this setting (on TG-UC):
gridlaunch="/home/nefedova/pegasus/src/tools/kickstart/kickstart" in
your site.xml. file ( replace the one you have with this one)
Nika
On Oct 25, 2007, at 10:50 AM, Andrew Robert Jamieson wrote:
> Any thoughts on why this would happen on a simple "hello world"
> (see below)
> Thanks,
> Andrew
>
>
> ********************
> andrewj at tg-viz-login1:~/CADGrid/Swifty/vdsk-0.3-dev/examples/vdsk>
> swift -debug -tc.file ~/CADGrid/Swifty/UCANL-tc.data -sites.file
> ~/.swift/sites.xml first.swift
> Recompilation suppressed.
> Using sites /home/andrewj/.swift/sites.xml
> Using tc.data: /home/andrewj/CADGrid/Swifty/UCANL-tc.data
> Swift v0.3-dev r1339
>
> Swift v0.3-dev r1339
>
> RunID: 20071025-1044-zo4kzfjg
> RunID: 20071025-1044-zo4kzfjg
> echo started
> START thread=0 tr=echo
> START host=UCANL - Initializing shared directory
> Task(type=FILE_OPERATION, identity=urn:0-1193327080146) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080146) setting
> status to Completed
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080149) setting
> status to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080149) setting
> status to Active
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080149) setting
> status to Completed
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080153) setting
> status to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080153) setting
> status to Active
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080153) setting
> status to Completed
> Task(type=FILE_OPERATION, identity=urn:0-1193327080156) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080156) setting
> status to Completed
> Task(type=FILE_OPERATION, identity=urn:0-1193327080158) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080158) setting
> status to Completed
> Task(type=FILE_OPERATION, identity=urn:0-1193327080160) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080160) setting
> status to Completed
> END host=UCANL - Done initializing shared directory
> THREAD_ASSOCIATION jobid=echo-0gj1k5ji thread=0 host=UCANL
> START jobid=echo-0gj1k5ji host=UCANL - Initializing directory
> structure
> START path= dir=first-20071025-1044-zo4kzfjg/shared - Creating
> directory structure
> Task(type=FILE_OPERATION, identity=urn:0-1193327080162) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080162) setting
> status to Completed
> END jobid=echo-0gj1k5ji - Done initializing directory structure
> START jobid=echo-0gj1k5ji - Staging in files
> END jobid=echo-0gj1k5ji - Staging in finished
> JOB_START jobid=echo-0gj1k5ji tr=echo arguments=[Hello, world!]
> tmpdir=first-20071025-1044-zo4kzfjg/echo-0gj1k5ji host=UCANL
> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080164) setting
> status to Submitted
> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080164) setting
> status to Active
> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080164) setting
> status to Completed
> START jobid=echo-0gj1k5ji
> Task(type=FILE_OPERATION, identity=urn:0-1193327080166) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080166) setting
> status to Failed
> org.globus.cog.abstraction.impl.file.FileResourceException: Cannot
> delete /disks/scratchgpfs1/andrewj/first-20071025-1044-zo4kzfjg/
> status/echo-0gj1k5ji-success
> Task(type=FILE_OPERATION, identity=urn:0-1193327080168) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080168) setting
> status to Completed
> NO_STATUS_FILE jobid=echo-0gj1k5ji - Both status files are missing
> APPLICATION_EXCEPTION jobid=echo-0gj1k5ji - Application exception:
> No status file was found. Check the shared filesystem on UCANL
> sys:throw @ vdl-int.k, line: 96
> sys:else @ vdl-int.k, line: 94
> sys:if @ vdl-int.k, line: 82
> sys:try @ vdl-int.k, line: 70
> vdl:checkjobstatus @ vdl-int.k, line: 379
> sys:sequential @ vdl-int.k, line: 355
> sys:try @ vdl-int.k, line: 354
> task:allocatehost @ vdl-int.k, line: 336
> vdl:execute2 @ execute-default.k, line: 23
> sys:restartonerror @ execute-default.k, line: 21
> sys:sequential @ execute-default.k, line: 19
> sys:try @ execute-default.k, line: 18
> sys:if @ execute-default.k, line: 17
> sys:then @ execute-default.k, line: 16
> sys:if @ execute-default.k, line: 15
> vdl:execute @ first.kml, line: 16
> greeting @ first.kml, line: 43
> vdl:mainp @ first.kml, line: 42
> mainp @ vdl.k, line: 148
> vdl:mains @ first.kml, line: 41
> vdl:mains @ first.kml, line: 41
> rlog:restartlog @ first.kml, line: 39
> kernel:project @ first.kml, line: 2
> first-20071025-1044-zo4kzfjg
>
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080170) setting
> status to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080170) setting
> status to Active
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080170) setting
> status to Failed Exception in getFile
> Task(type=FILE_OPERATION, identity=urn:0-1193327080173) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080173) setting
> status to Completed
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080176) setting
> status to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080176) setting
> status to Active
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080176) setting
> status to Failed Exception in getFile
> Task(type=FILE_OPERATION, identity=urn:0-1193327080179) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080179) setting
> status to Completed
> THREAD_ASSOCIATION jobid=echo-1gj1k5ji thread=0 host=UCANL
> START jobid=echo-1gj1k5ji host=UCANL - Initializing directory
> structure
> END jobid=echo-1gj1k5ji - Done initializing directory structure
> START jobid=echo-1gj1k5ji - Staging in files
> END jobid=echo-1gj1k5ji - Staging in finished
> JOB_START jobid=echo-1gj1k5ji tr=echo arguments=[Hello, world!]
> tmpdir=first-20071025-1044-zo4kzfjg/echo-1gj1k5ji host=UCANL
> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080183) setting
> status to Submitted
> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080183) setting
> status to Active
> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080183) setting
> status to Completed
> START jobid=echo-1gj1k5ji
> Task(type=FILE_OPERATION, identity=urn:0-1193327080185) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080185) setting
> status to Failed
> org.globus.cog.abstraction.impl.file.FileResourceException: Cannot
> delete /disks/scratchgpfs1/andrewj/first-20071025-1044-zo4kzfjg/
> status/echo-1gj1k5ji-success
> Task(type=FILE_OPERATION, identity=urn:0-1193327080187) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080187) setting
> status to Completed
> NO_STATUS_FILE jobid=echo-1gj1k5ji - Both status files are missing
> APPLICATION_EXCEPTION jobid=echo-1gj1k5ji - Application exception:
> No status file was found. Check the shared filesystem on UCANL
> sys:throw @ vdl-int.k, line: 96
> sys:else @ vdl-int.k, line: 94
> sys:if @ vdl-int.k, line: 82
> sys:try @ vdl-int.k, line: 70
> vdl:checkjobstatus @ vdl-int.k, line: 379
> sys:sequential @ vdl-int.k, line: 355
> sys:try @ vdl-int.k, line: 354
> task:allocatehost @ vdl-int.k, line: 336
> vdl:execute2 @ execute-default.k, line: 23
> sys:restartonerror @ execute-default.k, line: 21
> sys:sequential @ execute-default.k, line: 19
> sys:try @ execute-default.k, line: 18
> sys:if @ execute-default.k, line: 17
> sys:then @ execute-default.k, line: 16
> sys:if @ execute-default.k, line: 15
> vdl:execute @ first.kml, line: 16
> greeting @ first.kml, line: 43
> vdl:mainp @ first.kml, line: 42
> mainp @ vdl.k, line: 148
> vdl:mains @ first.kml, line: 41
> vdl:mains @ first.kml, line: 41
> rlog:restartlog @ first.kml, line: 39
> kernel:project @ first.kml, line: 2
> first-20071025-1044-zo4kzfjg
>
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080189) setting
> status to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080189) setting
> status to Active
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080189) setting
> status to Failed Exception in getFile
> Task(type=FILE_OPERATION, identity=urn:0-1193327080192) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080192) setting
> status to Completed
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080195) setting
> status to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080195) setting
> status to Active
> Task(type=FILE_TRANSFER, identity=urn:0-1193327080195) setting
> status to Failed Exception in getFile
> Task(type=FILE_OPERATION, identity=urn:0-1193327080198) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080198) setting
> status to Completed
> THREAD_ASSOCIATION jobid=echo-2gj1k5ji thread=0 host=UCANL
> START jobid=echo-2gj1k5ji host=UCANL - Initializing directory
> structure
> END jobid=echo-2gj1k5ji - Done initializing directory structure
> START jobid=echo-2gj1k5ji - Staging in files
> END jobid=echo-2gj1k5ji - Staging in finished
> JOB_START jobid=echo-2gj1k5ji tr=echo arguments=[Hello, world!]
> tmpdir=first-20071025-1044-zo4kzfjg/echo-2gj1k5ji host=UCANL
> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080202) setting
> status to Submitted
> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080202) setting
> status to Active
> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080202) setting
> status to Completed
> START jobid=echo-2gj1k5ji
> Task(type=FILE_OPERATION, identity=urn:0-1193327080204) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1193327080204) setting
> status to Completed
> SUCCESS jobid=echo-2gj1k5ji - Success file found
> JOB_END jobid=echo-2gj1k5ji
> START jobid=echo-2gj1k5ji - Staging out files
> FILE_STAGE_OUT_START srcname=hello.txt srcdir=first-20071025-1044-
> zo4kzfjg/shared/ srchost=UCANL destdir= desthost=localhost
> provider=file
> Task(type=FILE_OPERATION, identity=urn:0-1-1193327080206) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1-1193327080206) setting
> status to Completed
> Task(type=FILE_TRANSFER, identity=urn:0-1-1193327080209) setting
> status to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1-1193327080209) setting
> status to Active
> Task(type=FILE_TRANSFER, identity=urn:0-1-1193327080209) setting
> status to Completed
> FILE_STAGE_OUT_END srcname=hello.txt srcdir=first-20071025-1044-
> zo4kzfjg/shared/ srchost=UCANL destdir= desthost=localhost
> provider=file
> Task(type=FILE_OPERATION, identity=urn:0-1-1193327080213) setting
> status to Active
> Task(type=FILE_OPERATION, identity=urn:0-1-1193327080213) setting
> status to Completed
> END jobid=echo-2gj1k5ji - Staging out finished
> echo completed
> END_SUCCESS thread=0 tr=echo
> START cleanups=[[first-20071025-1044-zo4kzfjg, UCANL]]
> START dir=first-20071025-1044-zo4kzfjg host=UCANL
> Task(type=JOB_SUBMISSION, identity=urn:0-1-1193327080216) setting
> status to Submitted
> Task(type=JOB_SUBMISSION, identity=urn:0-1-1193327080216) setting
> status to Completed
> END dir=first-20071025-1044-zo4kzfjg host=UCANL
> Swift finished - workflow had no errors
>
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
>
More information about the Swift-devel
mailing list