[Swift-user] Exitcode file not found

David Kelly dk0966 at cs.ship.edu
Fri Aug 20 11:04:30 CDT 2010


Hello,

While running Mike's MODIS demo on PADS with pbs and coasters, I receive the
following error:

Worker task failed:
org.globus.cog.abstraction.impl.scheduler.common.ProcessException: Exitcode
file not found 5 queue polls after the job was reported done
    at
org.globus.cog.abstraction.impl.scheduler.common.Job.close(Job.java:66)
    at
org.globus.cog.abstraction.impl.scheduler.common.Job.setState(Job.java:177)
    at
org.globus.cog.abstraction.impl.scheduler.pbs.QueuePoller.processStdout(QueuePoller.java:126)
    at
org.globus.cog.abstraction.impl.scheduler.common.AbstractQueuePoller.pollQueue(AbstractQueuePoller.java:169)
    at
org.globus.cog.abstraction.impl.scheduler.common.AbstractQueuePoller.run(AbstractQueuePoller.java:82)
    at java.lang.Thread.run(Thread.java:619)

I also receive errors relating to qdel:

Canceling job
Failed to shut down block
org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: Failed
to cancel task. qdel returned with an exit code of 1
    at
org.globus.cog.abstraction.impl.scheduler.common.AbstractExecutor.cancel(AbstractExecutor.java:159)
    at
org.globus.cog.abstraction.impl.scheduler.common.AbstractJobSubmissionTaskHandler.cancel(AbstractJobSubmissionTaskHandler.java:85)
    at
org.globus.cog.abstraction.impl.common.AbstractTaskHandler.cancel(AbstractTaskHandler.java:70)
    at
org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.cancel(ExecutionTaskHandler.java:101)
    at
org.globus.cog.abstraction.impl.common.task.ExecutionTaskHandler.cancel(ExecutionTaskHandler.java:90)
    at
org.globus.cog.abstraction.coaster.service.job.manager.BlockTaskSubmitter.cancel(BlockTaskSubmitter.java:44)
    at
org.globus.cog.abstraction.coaster.service.job.manager.Block.forceShutdown(Block.java:293)
    at
org.globus.cog.abstraction.coaster.service.job.manager.Block.shutdown(Block.java:274)
    at
org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.shutdownBlocks(BlockQueueProcessor.java:518)
    at
org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.shutdown(BlockQueueProcessor.java:510)
    at
org.globus.cog.abstraction.coaster.service.job.manager.JobQueue.shutdown(JobQueue.java:108)
    at
org.globus.cog.abstraction.coaster.service.CoasterService.shutdown(CoasterService.java:249)
    at
org.globus.cog.abstraction.coaster.service.ServiceShutdownHandler.requestComplete(ServiceShutdownHandler.java:28)
    at
org.globus.cog.karajan.workflow.service.handlers.RequestHandler.receiveCompleted(RequestHandler.java:84)
    at
org.globus.cog.karajan.workflow.service.channels.AbstractKarajanChannel.handleRequest(AbstractKarajanChannel.java:387)
    at
org.globus.cog.karajan.workflow.service.channels.AbstractPipedChannel.actualSend(AbstractPipedChannel.java:86)
    at
org.globus.cog.karajan.workflow.service.channels.AbstractPipedChannel$Sender.run(AbstractPipedChannel.java:115)
Canceling job

Checking through the mailing list archives, I found an instance where this
was happening when the work directory was /var/tmp and not consistent across
all nodes. The work directory in my configuration is /home/davidk/swiftwork,
so I'm not sure what's causing it. Attached are the sites.xml, tc.data,
swift.properties and the script I'm using. The full log can be found in
/home/davidk/modis/run.0019.

Thanks,
David
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20100820/f6f0154b/attachment.html>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: modis.swift
Type: application/octet-stream
Size: 1270 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20100820/f6f0154b/attachment.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: sites.xml
Type: text/xml
Size: 730 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20100820/f6f0154b/attachment.xml>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: swift.properties
Type: application/octet-stream
Size: 11600 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20100820/f6f0154b/attachment-0001.obj>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: tc.data
Type: application/octet-stream
Size: 2026 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/swift-user/attachments/20100820/f6f0154b/attachment-0002.obj>


More information about the Swift-user mailing list