[Swift-devel] Strange Problem with TG-UCANL
Andrew Robert Jamieson
andrewj at uchicago.edu
Thu Oct 25 12:46:40 CDT 2007
Thanks for the suggestion, unfortunately I am not using kickstart.
On Thu, 25 Oct 2007, Veronika Nefedova wrote:
> If you are using kickstart - try to use this setting (on TG-UC):
> gridlaunch="/home/nefedova/pegasus/src/tools/kickstart/kickstart" in your
> site.xml. file ( replace the one you have with this one)
>
> Nika
>
> On Oct 25, 2007, at 10:50 AM, Andrew Robert Jamieson wrote:
>
>> Any thoughts on why this would happen on a simple "hello world"
>> (see below)
>> Thanks,
>> Andrew
>>
>>
>> ********************
>> andrewj at tg-viz-login1:~/CADGrid/Swifty/vdsk-0.3-dev/examples/vdsk> swift
>> -debug -tc.file ~/CADGrid/Swifty/UCANL-tc.data -sites.file
>> ~/.swift/sites.xml first.swift
>> Recompilation suppressed.
>> Using sites /home/andrewj/.swift/sites.xml
>> Using tc.data: /home/andrewj/CADGrid/Swifty/UCANL-tc.data
>> Swift v0.3-dev r1339
>>
>> Swift v0.3-dev r1339
>>
>> RunID: 20071025-1044-zo4kzfjg
>> RunID: 20071025-1044-zo4kzfjg
>> echo started
>> START thread=0 tr=echo
>> START host=UCANL - Initializing shared directory
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080146) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080146) setting status to
>> Completed
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080149) setting status to
>> Submitted
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080149) setting status to
>> Active
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080149) setting status to
>> Completed
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080153) setting status to
>> Submitted
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080153) setting status to
>> Active
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080153) setting status to
>> Completed
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080156) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080156) setting status to
>> Completed
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080158) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080158) setting status to
>> Completed
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080160) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080160) setting status to
>> Completed
>> END host=UCANL - Done initializing shared directory
>> THREAD_ASSOCIATION jobid=echo-0gj1k5ji thread=0 host=UCANL
>> START jobid=echo-0gj1k5ji host=UCANL - Initializing directory structure
>> START path= dir=first-20071025-1044-zo4kzfjg/shared - Creating directory
>> structure
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080162) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080162) setting status to
>> Completed
>> END jobid=echo-0gj1k5ji - Done initializing directory structure
>> START jobid=echo-0gj1k5ji - Staging in files
>> END jobid=echo-0gj1k5ji - Staging in finished
>> JOB_START jobid=echo-0gj1k5ji tr=echo arguments=[Hello, world!]
>> tmpdir=first-20071025-1044-zo4kzfjg/echo-0gj1k5ji host=UCANL
>> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080164) setting status to
>> Submitted
>> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080164) setting status to
>> Active
>> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080164) setting status to
>> Completed
>> START jobid=echo-0gj1k5ji
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080166) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080166) setting status to
>> Failed org.globus.cog.abstraction.impl.file.FileResourceException: Cannot
>> delete /disks/scratchgpfs1/andrewj/first-20071025-1044-zo4kzfjg/
>> status/echo-0gj1k5ji-success
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080168) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080168) setting status to
>> Completed
>> NO_STATUS_FILE jobid=echo-0gj1k5ji - Both status files are missing
>> APPLICATION_EXCEPTION jobid=echo-0gj1k5ji - Application exception: No
>> status file was found. Check the shared filesystem on UCANL
>> sys:throw @ vdl-int.k, line: 96
>> sys:else @ vdl-int.k, line: 94
>> sys:if @ vdl-int.k, line: 82
>> sys:try @ vdl-int.k, line: 70
>> vdl:checkjobstatus @ vdl-int.k, line: 379
>> sys:sequential @ vdl-int.k, line: 355
>> sys:try @ vdl-int.k, line: 354
>> task:allocatehost @ vdl-int.k, line: 336
>> vdl:execute2 @ execute-default.k, line: 23
>> sys:restartonerror @ execute-default.k, line: 21
>> sys:sequential @ execute-default.k, line: 19
>> sys:try @ execute-default.k, line: 18
>> sys:if @ execute-default.k, line: 17
>> sys:then @ execute-default.k, line: 16
>> sys:if @ execute-default.k, line: 15
>> vdl:execute @ first.kml, line: 16
>> greeting @ first.kml, line: 43
>> vdl:mainp @ first.kml, line: 42
>> mainp @ vdl.k, line: 148
>> vdl:mains @ first.kml, line: 41
>> vdl:mains @ first.kml, line: 41
>> rlog:restartlog @ first.kml, line: 39
>> kernel:project @ first.kml, line: 2
>> first-20071025-1044-zo4kzfjg
>>
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080170) setting status to
>> Submitted
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080170) setting status to
>> Active
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080170) setting status to
>> Failed Exception in getFile
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080173) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080173) setting status to
>> Completed
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080176) setting status to
>> Submitted
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080176) setting status to
>> Active
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080176) setting status to
>> Failed Exception in getFile
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080179) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080179) setting status to
>> Completed
>> THREAD_ASSOCIATION jobid=echo-1gj1k5ji thread=0 host=UCANL
>> START jobid=echo-1gj1k5ji host=UCANL - Initializing directory structure
>> END jobid=echo-1gj1k5ji - Done initializing directory structure
>> START jobid=echo-1gj1k5ji - Staging in files
>> END jobid=echo-1gj1k5ji - Staging in finished
>> JOB_START jobid=echo-1gj1k5ji tr=echo arguments=[Hello, world!]
>> tmpdir=first-20071025-1044-zo4kzfjg/echo-1gj1k5ji host=UCANL
>> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080183) setting status to
>> Submitted
>> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080183) setting status to
>> Active
>> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080183) setting status to
>> Completed
>> START jobid=echo-1gj1k5ji
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080185) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080185) setting status to
>> Failed org.globus.cog.abstraction.impl.file.FileResourceException: Cannot
>> delete /disks/scratchgpfs1/andrewj/first-20071025-1044-zo4kzfjg/
>> status/echo-1gj1k5ji-success
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080187) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080187) setting status to
>> Completed
>> NO_STATUS_FILE jobid=echo-1gj1k5ji - Both status files are missing
>> APPLICATION_EXCEPTION jobid=echo-1gj1k5ji - Application exception: No
>> status file was found. Check the shared filesystem on UCANL
>> sys:throw @ vdl-int.k, line: 96
>> sys:else @ vdl-int.k, line: 94
>> sys:if @ vdl-int.k, line: 82
>> sys:try @ vdl-int.k, line: 70
>> vdl:checkjobstatus @ vdl-int.k, line: 379
>> sys:sequential @ vdl-int.k, line: 355
>> sys:try @ vdl-int.k, line: 354
>> task:allocatehost @ vdl-int.k, line: 336
>> vdl:execute2 @ execute-default.k, line: 23
>> sys:restartonerror @ execute-default.k, line: 21
>> sys:sequential @ execute-default.k, line: 19
>> sys:try @ execute-default.k, line: 18
>> sys:if @ execute-default.k, line: 17
>> sys:then @ execute-default.k, line: 16
>> sys:if @ execute-default.k, line: 15
>> vdl:execute @ first.kml, line: 16
>> greeting @ first.kml, line: 43
>> vdl:mainp @ first.kml, line: 42
>> mainp @ vdl.k, line: 148
>> vdl:mains @ first.kml, line: 41
>> vdl:mains @ first.kml, line: 41
>> rlog:restartlog @ first.kml, line: 39
>> kernel:project @ first.kml, line: 2
>> first-20071025-1044-zo4kzfjg
>>
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080189) setting status to
>> Submitted
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080189) setting status to
>> Active
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080189) setting status to
>> Failed Exception in getFile
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080192) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080192) setting status to
>> Completed
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080195) setting status to
>> Submitted
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080195) setting status to
>> Active
>> Task(type=FILE_TRANSFER, identity=urn:0-1193327080195) setting status to
>> Failed Exception in getFile
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080198) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080198) setting status to
>> Completed
>> THREAD_ASSOCIATION jobid=echo-2gj1k5ji thread=0 host=UCANL
>> START jobid=echo-2gj1k5ji host=UCANL - Initializing directory structure
>> END jobid=echo-2gj1k5ji - Done initializing directory structure
>> START jobid=echo-2gj1k5ji - Staging in files
>> END jobid=echo-2gj1k5ji - Staging in finished
>> JOB_START jobid=echo-2gj1k5ji tr=echo arguments=[Hello, world!]
>> tmpdir=first-20071025-1044-zo4kzfjg/echo-2gj1k5ji host=UCANL
>> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080202) setting status to
>> Submitted
>> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080202) setting status to
>> Active
>> Task(type=JOB_SUBMISSION, identity=urn:0-1193327080202) setting status to
>> Completed
>> START jobid=echo-2gj1k5ji
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080204) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1193327080204) setting status to
>> Completed
>> SUCCESS jobid=echo-2gj1k5ji - Success file found
>> JOB_END jobid=echo-2gj1k5ji
>> START jobid=echo-2gj1k5ji - Staging out files
>> FILE_STAGE_OUT_START srcname=hello.txt srcdir=first-20071025-1044-
>> zo4kzfjg/shared/ srchost=UCANL destdir= desthost=localhost provider=file
>> Task(type=FILE_OPERATION, identity=urn:0-1-1193327080206) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1-1193327080206) setting status to
>> Completed
>> Task(type=FILE_TRANSFER, identity=urn:0-1-1193327080209) setting status to
>> Submitted
>> Task(type=FILE_TRANSFER, identity=urn:0-1-1193327080209) setting status to
>> Active
>> Task(type=FILE_TRANSFER, identity=urn:0-1-1193327080209) setting status to
>> Completed
>> FILE_STAGE_OUT_END srcname=hello.txt srcdir=first-20071025-1044-
>> zo4kzfjg/shared/ srchost=UCANL destdir= desthost=localhost provider=file
>> Task(type=FILE_OPERATION, identity=urn:0-1-1193327080213) setting status to
>> Active
>> Task(type=FILE_OPERATION, identity=urn:0-1-1193327080213) setting status to
>> Completed
>> END jobid=echo-2gj1k5ji - Staging out finished
>> echo completed
>> END_SUCCESS thread=0 tr=echo
>> START cleanups=[[first-20071025-1044-zo4kzfjg, UCANL]]
>> START dir=first-20071025-1044-zo4kzfjg host=UCANL
>> Task(type=JOB_SUBMISSION, identity=urn:0-1-1193327080216) setting status to
>> Submitted
>> Task(type=JOB_SUBMISSION, identity=urn:0-1-1193327080216) setting status to
>> Completed
>> END dir=first-20071025-1044-zo4kzfjg host=UCANL
>> Swift finished - workflow had no errors
>>
>> _______________________________________________
>> Swift-devel mailing list
>> Swift-devel at ci.uchicago.edu
>> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
>>
>
>
More information about the Swift-devel
mailing list