[Swift-devel] swift error ( gridftp problem)

Michael Wilde wilde at mcs.anl.gov
Wed Jul 1 07:00:07 CDT 2009


Yi, I dont have an answer for you, but it certainly seems to be a 
problem at the GridFTP level, not a Swift problem.

Do you have GLOBUS_TCP_PORT_RANGE and GLOBUS_TCP_SOURCE_RANGE set in 
your client environment (ie on the "local computer")?

 From that local computer, with an ordinary (e.g., DOEGrids or NCSA) 
certificate, can you access files on for example TeraPort?

- Mike


On 7/1/09 4:33 AM, yizhu wrote:
> Hi,
> 
> I have a problem when try running swift on Amazon EC2 with swift on 
> local computer.
> 
> The EC2 is configured as a globus Installed PBS cluster with one head 
> node and several and shared the /home/ directory via NFS , i've use 
> simpleCA to create a credential for both headnode (host certificate) and 
> user (user certificate).
> 
> after make simpleCA working, I finally get rid of "Authentication 
> Failure" when running swift, but a new problem occurs; it stuck on 
> "Progress:  Initializing site shared directory:1" and finally failed 
> after several try.  After that, I checked the "swift workdirectory" and 
> found that new directory has been created with a 0 byte file "_swiftwrap".
> 
> I also tried run globus-url-copy on client side, it failed with the file 
> named created at remote site but with 0 byte size. It seems that gridftp 
> can successfully create the directory and filename, but can not actually 
> transfer the data.
> 
> For the firewall setting on EC2, i opened tcp/udp 2119 (gridftp), 
> tcp/udp 2811(gram2), tcp/udp 8443 (gram4), (ssh), (https), (http).
> 
> 
> -Yi
> [1]  Swift failed
> -bash-3.2$ swift -tc.file ../tc.test.data -sites.file ../sites.test.xml 
> first.swift
> Swift 0.9 swift-r2860 cog-r2388
> 
> RunID: 20090701-0344-zn2a66ub
> Progress:
> Progress:  Initializing site shared directory:1
> Progress:  Initializing site shared directory:1
> Progress:  Initializing site shared directory:1
> Progress:  Initializing site shared directory:1
> Progress:  Initializing site shared directory:1
> Progress:  Failed:1
> Execution failed:
>     Could not initialize shared directory on ec2_basecluster
> Caused by:
>     Reply wait timeout. (error code 4)
> -bash-3.2$
> 
> [2]  Grid-ftp-failed
> -bash-3.2$ globus-url-copy file:////home/yizhu/firstswift/hello.txt 
> gsiftp://ec2-174-129-90-225.compute-1.amazonaws.com/rec_data.txt
> -bash-3.2$ globus-url-copy file:////home/yizhu/firstswift/hello.txt 
> gsiftp://ec2-174-129-90-225.compute-1.amazonaws.com:2811/home/torqueuser/rec_data.txt 
> 
> GlobusUrlCopy error: UrlCopy transfer failed. [Caused by: Server refused 
> performing the request. Custom message:  (error code 1) [Nested 
> exception message:  Custom message: Unexpected reply: 500-Command 
> failed. : globus_gridftp_server_file.c:globus_l_gfs_file_recv:1770:
> 500-globus_l_gfs_file_open failed.
> 500-globus_gridftp_server_file.c:globus_l_gfs_file_open:1694:
> 500-globus_xio_register_open failed.
> 500-globus_xio_file_driver.c:globus_l_xio_file_open:438:
> 500-Unable to open file /home/torqueuser/home/torqueuser/rec_data.txt
> 500-globus_xio_file_driver.c:globus_l_xio_file_open:381:
> 500-System error in open: No such file or directory
> 500-globus_xio: A system call failed: No such file or directory
> 500 End.]]
> -bash-3.2$
> -bash-3.2$
> 
> 
> [3]-bash-3.2$ cat tc.test.data
> 
> ...
> ...
> 
> ec2_basecluster  echo            /bin/echo       INSTALLED 
> INTEL32::LINUX  null
> ec2_basecluster  cat             /bin/cat        INSTALLED 
> INTEL32::LINUX  null
> ec2_basecluster  ls              /bin/ls         INSTALLED 
> INTEL32::LINUX  null
> ec2_basecluster  grep            /bin/grep       INSTALLED 
> INTEL32::LINUX  null
> ec2_basecluster  sort            /bin/sort       INSTALLED 
> INTEL32::LINUX  null
> ec2_basecluster  paste           /bin/paste      INSTALLED 
> INTEL32::LINUX  null
> ec2_basecluster  wc              /bin/wc         INSTALLED 
> INTEL32::LINUX  null
> ec2_basecluster  touch           /bin/touch      INSTALLED 
> INTEL32::LINUX  null
> ec2_basecluster  sleep           /bin/sleep      INSTALLED 
> INTEL32::LINUX  null
> 
> ...
> ...
> 
> [4] -bash-3.2$ cat sites.test.xml
> <config>
> 
> ...
> 
> 
>   <pool handle="ec2_basecluster">
>     <gridftp url="gsiftp://ec2-174-129-90-225.compute-1.amazonaws.com" />
>     <jobmanager universe="vanilla" 
> url="ec2-174-129-90-225.compute-1.amazonaws.com/jobmanager-pbs" 
> major="2" />
>     <workdirectory >/home/torqueuser</workdirectory>
>   </pool>
> ...
> </config>
> 
> [5] debug version of swift run
> -bash-3.2$ swift -tc.file ../tc.test.data -sites.file ../sites.test.xml 
> first.swift -debug
> Max heap: 268435456
> kmlversion is >85d4b03e-7b73-49b7-81aa-096255181491<
> build version is >85d4b03e-7b73-49b7-81aa-096255181491<
> Recompilation suppressed.
> Stack dump:
> Level 1
> [iA = 0, iB = 0, bA = false, bB = false]
>     vdl:instanceconfig = Swift configuration []
>     vdl:operation = run
>     vds.home = /home/yizhu/swift-0.9/bin/..
> 
> 
> Using sites file: ../sites.test.xml
> Using tc.data: ../tc.test.data
> Setting resources to: {ec2_basecluster=ec2_basecluster}
> Swift 0.9 swift-r2860 cog-r2388
> 
> Swift 0.9 swift-r2860 cog-r2388
> 
> RUNID id=tag:benc at ci.uchicago.edu,2007:swift:run:20090701-0348-vrb1yxl6
> RunID: 20090701-0348-vrb1yxl6
> closed org.griphyn.vdl.mapping.RootDataNode identifier 
> tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000001 
> type string value=Hello, world! dataset=unnamed SwiftScript value (closed)
> ROOTPATH 
> dataset=tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000001 
> path=$
> VALUE 
> dataset=tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000001 
> VALUE=Hello, world!
> NEW 
> id=tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000001 
> 
> Found mapped data org.griphyn.vdl.mapping.RootDataNode identifier 
> tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000002 
> type messagefile with no value at dataset=outfile (not closed).$
> NEW 
> id=tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000002 
> 
> Progress:
> PROCEDURE line=3 thread=0 name=greeting
> PARAM thread=0 direction=output variable=t 
> provenanceid=tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000002 
> 
> closed org.griphyn.vdl.mapping.RootDataNode identifier 
> tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000003 
> type string value=hello.txt dataset=unnamed SwiftScript value (closed)
> ROOTPATH 
> dataset=tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000003 
> path=$
> VALUE 
> dataset=tag:benc at ci.uchicago.edu,2008:swift:dataset:20090701-0348-xecqa0zc:720000000003 
> VALUE=hello.txt
> START thread=0 tr=echo
> Sorted: [ec2_basecluster:0.000(1.000):0/1 overload: 0]
> Rand: 0.8176156212454151, sum: 1.0
> Next contact: ec2_basecluster:0.000(1.000):0/1 overload: 0
> START host=ec2_basecluster - Initializing shared directory
> multiplyScore(ec2_basecluster:0.000(1.000):1/1 overload: 0, -0.01)
> Old score: 0.000, new score: -0.010
> No global submit throttle set. Using default (100)
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105282) setting status 
> to Submitting
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105282) setting status 
> to Submitted
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105282) setting status 
> to Active
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105282) setting status 
> to Completed
> multiplyScore(ec2_basecluster:-0.010(0.994):1/1 overload: 0, 0.01)
> Old score: -0.010, new score: 0.000
> multiplyScore(ec2_basecluster:0.000(1.000):1/1 overload: 0, 0.1)
> Old score: 0.000, new score: 0.100
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105282) Completed. 
> Waiting: 0, Running: 0. Heap size: 64M, Heap free: 30M, Max heap: 256M
> multiplyScore(ec2_basecluster:0.100(1.060):1/1 overload: 0, -0.2)
> Old score: 0.100, new score: -0.100
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105285) setting status 
> to Submitting
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105285) setting status 
> to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105285) setting status 
> to Active
> 
> 
> 
> 
> 
> 
> 
> 
> Progress:  Initializing site shared directory:1
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105285) setting status 
> to Failed null
> multiplyScore(ec2_basecluster:-0.100(0.943):1/1 overload: 0, -0.5)
> Old score: -0.100, new score: -0.600
> Releasing contact 2
> commitDelayedScore(ec2_basecluster:-0.600(0.705):0/1 overload: 0, 0.1
> Sorted: [ec2_basecluster:-0.500(0.747):0/1 overload: 0]
> Rand: 0.4103224563240889, sum: 1.0
> Next contact: ec2_basecluster:-0.500(0.747):0/1 overload: 0
> Progress:  Initializing site shared directory:1
> START host=ec2_basecluster - Initializing shared directory
> multiplyScore(ec2_basecluster:-0.500(0.747):1/1 overload: -140, -0.01)
> Old score: -0.500, new score: -0.510
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105288) setting status 
> to Submitting
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105288) setting status 
> to Submitted
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105288) setting status 
> to Active
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105288) setting status 
> to Completed
> multiplyScore(ec2_basecluster:-0.510(0.742):1/1 overload: 0, 0.01)
> Old score: -0.510, new score: -0.500
> multiplyScore(ec2_basecluster:-0.500(0.747):1/1 overload: 0, 0.1)
> Old score: -0.500, new score: -0.400
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105288) Completed. 
> Waiting: 0, Running: 0. Heap size: 64M, Heap free: 28M, Max heap: 256M
> multiplyScore(ec2_basecluster:-0.400(0.791):1/1 overload: 0, -0.2)
> Old score: -0.400, new score: -0.600
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105291) setting status 
> to Submitting
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105291) setting status 
> to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105291) setting status 
> to Active
> 
> 
> Progress:  Initializing site shared directory:1
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105291) setting status 
> to Failed null
> multiplyScore(ec2_basecluster:-0.600(0.705):1/1 overload: 0, -0.5)
> Old score: -0.600, new score: -1.100
> Releasing contact 3
> commitDelayedScore(ec2_basecluster:-1.100(0.530):0/1 overload: 0, 0.1
> Sorted: [ec2_basecluster:-1.000(0.561):0/1 overload: 0]
> Rand: 0.653323366777857, sum: 1.0
> Next contact: ec2_basecluster:-1.000(0.561):0/1 overload: 0
> Progress:  Initializing site shared directory:1
> START host=ec2_basecluster - Initializing shared directory
> multiplyScore(ec2_basecluster:-1.000(0.561):1/1 overload: -199, -0.01)
> Old score: -1.000, new score: -1.010
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105294) setting status 
> to Submitting
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105294) setting status 
> to Submitted
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105294) setting status 
> to Active
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105294) setting status 
> to Completed
> multiplyScore(ec2_basecluster:-1.010(0.557):1/1 overload: 0, 0.01)
> Old score: -1.010, new score: -1.000
> multiplyScore(ec2_basecluster:-1.000(0.561):1/1 overload: 0, 0.1)
> Old score: -1.000, new score: -0.900
> Task(type=FILE_OPERATION, identity=urn:0-1-1246438105294) Completed. 
> Waiting: 0, Running: 0. Heap size: 64M, Heap free: 27M, Max heap: 256M
> multiplyScore(ec2_basecluster:-0.900(0.593):1/1 overload: 0, -0.2)
> Old score: -0.900, new score: -1.100
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105297) setting status 
> to Submitting
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105297) setting status 
> to Submitted
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105297) setting status 
> to Active
> Progress:  Initializing site shared directory:1
> Task(type=FILE_TRANSFER, identity=urn:0-1-1246438105297) setting status 
> to Failed null
> multiplyScore(ec2_basecluster:-1.100(0.530):1/1 overload: 0, -0.5)
> Old score: -1.100, new score: -1.600
> Releasing contact 4
> commitDelayedScore(ec2_basecluster:-1.600(0.403):0/1 overload: 0, 0.1
> END_FAILURE thread=0 tr=echo
> Progress:  Failed:1
> Could not initialize shared directory on ec2_basecluster
> Could not initialize shared directory on ec2_basecluster
> Caused by: null
> Caused by: 
> org.globus.cog.abstraction.impl.file.IrrecoverableResourceException
> Caused by: org.globus.ftp.exception.ServerException: Reply wait timeout. 
> (error code 4)
>     at 
> org.globus.cog.karajan.workflow.nodes.functions.KException.function(KException.java:29) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.post(AbstractFunction.java:45) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.childCompleted(AbstractSequentialWithArguments.java:192) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.Sequential.notificationEvent(Sequential.java:33) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.event(FlowNode.java:332)
>     at 
> org.globus.cog.karajan.workflow.events.EventBus.send(EventBus.java:125)
>     at 
> org.globus.cog.karajan.workflow.events.EventBus.sendHooked(EventBus.java:99) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.fireNotificationEvent(FlowNode.java:176) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:296)
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.post(AbstractFunction.java:46) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.Sequential.startNext(Sequential.java:51) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.Sequential.executeChildren(Sequential.java:27) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.executeChildren(AbstractFunction.java:40) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowContainer.execute(FlowContainer.java:63) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.restart(FlowNode.java:233)
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.start(FlowNode.java:278)
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.controlEvent(FlowNode.java:391) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.event(FlowNode.java:329)
>     at 
> org.globus.cog.karajan.workflow.FlowElementWrapper.event(FlowElementWrapper.java:227) 
> 
>     at 
> org.globus.cog.karajan.workflow.events.EventBus.send(EventBus.java:125)
>     at 
> org.globus.cog.karajan.workflow.events.EventBus.sendHooked(EventBus.java:99) 
> 
>     at 
> org.globus.cog.karajan.workflow.events.EventWorker.run(EventWorker.java:69)
> Caused by: null
> Caused by: 
> org.globus.cog.abstraction.impl.file.IrrecoverableResourceException
> Caused by: org.globus.ftp.exception.ServerException: Reply wait timeout. 
> (error code 4)
>     at 
> org.globus.cog.karajan.workflow.events.FailureNotificationEvent.<init>(FailureNotificationEvent.java:36) 
> 
>     at 
> org.globus.cog.karajan.workflow.events.FailureNotificationEvent.<init>(FailureNotificationEvent.java:42) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.failImmediately(FlowNode.java:151) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.grid.AbstractGridNode.taskFailed(AbstractGridNode.java:314) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.grid.AbstractGridNode.statusChanged(AbstractGridNode.java:276) 
> 
>     at 
> org.globus.cog.karajan.scheduler.AbstractScheduler.fireJobStatusChangeEvent(AbstractScheduler.java:168) 
> 
>     at 
> org.globus.cog.karajan.scheduler.LateBindingScheduler.statusChanged(LateBindingScheduler.java:656) 
> 
>     at 
> org.globus.cog.karajan.scheduler.WeightedHostScoreScheduler.statusChanged(WeightedHostScoreScheduler.java:421) 
> 
>     at 
> org.griphyn.vdl.karajan.VDSAdaptiveScheduler.statusChanged(VDSAdaptiveScheduler.java:410) 
> 
>     at 
> org.globus.cog.abstraction.impl.common.task.TaskImpl.notifyListeners(TaskImpl.java:236) 
> 
>     at 
> org.globus.cog.abstraction.impl.common.task.TaskImpl.setStatus(TaskImpl.java:224) 
> 
>     at 
> org.globus.cog.abstraction.impl.common.AbstractDelegatedTaskHandler.failTask(AbstractDelegatedTaskHandler.java:54) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.transferFailed(DelegatedFileTransferHandler.java:581) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.run(DelegatedFileTransferHandler.java:505) 
> 
>     at java.lang.Thread.run(Thread.java:595)
> Caused by: 
> org.globus.cog.abstraction.impl.file.IrrecoverableResourceException
>     at 
> org.globus.cog.abstraction.impl.file.ftp.AbstractFTPFileResource.translateException(AbstractFTPFileResource.java:44) 
> 
>     at 
> org.globus.cog.abstraction.impl.file.ftp.AbstractFTPFileResource.translateException(AbstractFTPFileResource.java:33) 
> 
>     at 
> org.globus.cog.abstraction.impl.file.gridftp.old.FileResourceImpl.putFile(FileResourceImpl.java:430) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.doDestination(DelegatedFileTransferHandler.java:355) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.CachingDelegatedFileTransferHandler.doDestination(CachingDelegatedFileTransferHandler.java:47) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.run(DelegatedFileTransferHandler.java:492) 
> 
>     ... 1 more
> Caused by: org.globus.ftp.exception.ServerException: Reply wait timeout. 
> (error code 4)
>     at 
> org.globus.ftp.vanilla.FTPServerFacade$LocalControlChannel.waitFor(FTPServerFacade.java:511) 
> 
>     at org.globus.ftp.vanilla.TransferMonitor.run(TransferMonitor.java:129)
>     ... 1 more
> Execution failed:
>     Could not initialize shared directory on ec2_basecluster
> Caused by:
>     Reply wait timeout. (error code 4)
> Detailed exception:
> Could not initialize shared directory on ec2_basecluster
> Caused by: null
> Caused by: 
> org.globus.cog.abstraction.impl.file.IrrecoverableResourceException
> Caused by: org.globus.ftp.exception.ServerException: Reply wait timeout. 
> (error code 4)
>     at 
> org.globus.cog.karajan.workflow.nodes.functions.KException.function(KException.java:29) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.post(AbstractFunction.java:45) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.AbstractSequentialWithArguments.childCompleted(AbstractSequentialWithArguments.java:192) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.Sequential.notificationEvent(Sequential.java:33) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.event(FlowNode.java:332)
>     at 
> org.globus.cog.karajan.workflow.events.EventBus.send(EventBus.java:125)
>     at 
> org.globus.cog.karajan.workflow.events.EventBus.sendHooked(EventBus.java:99) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.fireNotificationEvent(FlowNode.java:176) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.complete(FlowNode.java:296)
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowContainer.post(FlowContainer.java:58) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.post(AbstractFunction.java:46) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.Sequential.startNext(Sequential.java:51) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.Sequential.executeChildren(Sequential.java:27) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.functions.AbstractFunction.executeChildren(AbstractFunction.java:40) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowContainer.execute(FlowContainer.java:63) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.restart(FlowNode.java:233)
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.start(FlowNode.java:278)
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.controlEvent(FlowNode.java:391) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.event(FlowNode.java:329)
>     at 
> org.globus.cog.karajan.workflow.FlowElementWrapper.event(FlowElementWrapper.java:227) 
> 
>     at 
> org.globus.cog.karajan.workflow.events.EventBus.send(EventBus.java:125)
>     at 
> org.globus.cog.karajan.workflow.events.EventBus.sendHooked(EventBus.java:99) 
> 
>     at 
> org.globus.cog.karajan.workflow.events.EventWorker.run(EventWorker.java:69)
> Caused by: null
> Caused by: 
> org.globus.cog.abstraction.impl.file.IrrecoverableResourceException
> Caused by: org.globus.ftp.exception.ServerException: Reply wait timeout. 
> (error code 4)
>     at 
> org.globus.cog.karajan.workflow.events.FailureNotificationEvent.<init>(FailureNotificationEvent.java:36) 
> 
>     at 
> org.globus.cog.karajan.workflow.events.FailureNotificationEvent.<init>(FailureNotificationEvent.java:42) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.FlowNode.failImmediately(FlowNode.java:151) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.grid.AbstractGridNode.taskFailed(AbstractGridNode.java:314) 
> 
>     at 
> org.globus.cog.karajan.workflow.nodes.grid.AbstractGridNode.statusChanged(AbstractGridNode.java:276) 
> 
>     at 
> org.globus.cog.karajan.scheduler.AbstractScheduler.fireJobStatusChangeEvent(AbstractScheduler.java:168) 
> 
>     at 
> org.globus.cog.karajan.scheduler.LateBindingScheduler.statusChanged(LateBindingScheduler.java:656) 
> 
>     at 
> org.globus.cog.karajan.scheduler.WeightedHostScoreScheduler.statusChanged(WeightedHostScoreScheduler.java:421) 
> 
>     at 
> org.griphyn.vdl.karajan.VDSAdaptiveScheduler.statusChanged(VDSAdaptiveScheduler.java:410) 
> 
>     at 
> org.globus.cog.abstraction.impl.common.task.TaskImpl.notifyListeners(TaskImpl.java:236) 
> 
>     at 
> org.globus.cog.abstraction.impl.common.task.TaskImpl.setStatus(TaskImpl.java:224) 
> 
>     at 
> org.globus.cog.abstraction.impl.common.AbstractDelegatedTaskHandler.failTask(AbstractDelegatedTaskHandler.java:54) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.transferFailed(DelegatedFileTransferHandler.java:581) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.run(DelegatedFileTransferHandler.java:505) 
> 
>     at java.lang.Thread.run(Thread.java:595)
> Caused by: 
> org.globus.cog.abstraction.impl.file.IrrecoverableResourceException
>     at 
> org.globus.cog.abstraction.impl.file.ftp.AbstractFTPFileResource.translateException(AbstractFTPFileResource.java:44) 
> 
>     at 
> org.globus.cog.abstraction.impl.file.ftp.AbstractFTPFileResource.translateException(AbstractFTPFileResource.java:33) 
> 
>     at 
> org.globus.cog.abstraction.impl.file.gridftp.old.FileResourceImpl.putFile(FileResourceImpl.java:430) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.doDestination(DelegatedFileTransferHandler.java:355) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.CachingDelegatedFileTransferHandler.doDestination(CachingDelegatedFileTransferHandler.java:47) 
> 
>     at 
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.run(DelegatedFileTransferHandler.java:492) 
> 
>     ... 1 more
> Caused by: org.globus.ftp.exception.ServerException: Reply wait timeout. 
> (error code 4)
>     at 
> org.globus.ftp.vanilla.FTPServerFacade$LocalControlChannel.waitFor(FTPServerFacade.java:511) 
> 
>     at org.globus.ftp.vanilla.TransferMonitor.run(TransferMonitor.java:129)
>     ... 1 more
> Swift finished with errors
> -bash-3.2$
> -bash-3.2$
> 
> 
> 
> 
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel



More information about the Swift-devel mailing list