[Swift-devel] swift not working

Michael Andric andric at uchicago.edu
Tue Apr 21 12:41:58 CDT 2009


also failed again on ucanl64
[...]
Progress:  Submitting:1  Submitted:1 Failed but can retry:1
Failed to transfer wrapper log from AFNIsnr-20090421-1231-4h6oa8k5/info/e on
ANLUCTERAGRID64
Progress:  Submitting:1 Failed but can retry:2
Progress:  Submitting:1 Failed but can retry:2
Progress:  Submitting:1 Failed but can retry:2
Progress:  Stage in:1  Submitting:1 Failed but can retry:1
Progress:  Submitting:1  Submitted:1 Failed but can retry:1
Progress:  Submitting:1 Failed but can retry:2
Failed to transfer wrapper log from AFNIsnr-20090421-1231-4h6oa8k5/info/g on
ANLUCTERAGRID64
Execution failed:
        Exception in AFNI_3dvolreg:
Arguments: [-twopass, -twodup, -dfile, mot_RFL2.run5_trim, -base,
ts.5_trim+orig[92], -prefix, volreg.RFL2.run5_trim, ts.5_trim+orig.BRIK]
Host: ANLUCTERAGRID64
Directory: AFNIsnr-20090421-1231-4h6oa8k5/jobs/g/AFNI_3dvolreg-gbrmkp9j
stderr.txt:

stdout.txt:

----

Caused by:
        Cannot submit job: ; nested exception is:
        java.net.SocketTimeoutException: Read timed out
gwynn 7%






On Tue, Apr 21, 2009 at 12:29 PM, Michael Andric <andric at uchicago.edu>wrote:

> just tried on bigred - also failed
> [...]
> Progress: Failed but can retry:3
> Progress:  Stage in:1 Failed but can retry:2
> Exception occured in the exception handling code, so it cannot be properly
> propagated to the user
> java.net.SocketException: Broken pipe
>         at java.net.SocketOutputStream.socketWrite0(Native Method)
>         at
> java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:92)
>         at java.net.SocketOutputStream.write(SocketOutputStream.java:115)
>         at java.io.DataOutputStream.writeByte(DataOutputStream.java:136)
>         at
> org.globus.ftp.dc.EBlockImageDCWriter.endOfData(EBlockImageDCWriter.java:63)
>         at
> org.globus.ftp.dc.GridFTPTransferSourceThread.shutdown(GridFTPTransferSourceThread.java:62)
>         at
> org.globus.ftp.dc.TransferSourceThread.run(TransferSourceThread.java:87)
> Progress: Failed but can retry:3
> Failed to transfer wrapper log from AFNIsnr-20090421-1220-q9598ll1/info/o
> on BIGRED
> Progress:  Failed:1 Failed but can retry:2
> Execution failed:
>         Exception in AFNI_3dvolreg:
> Arguments: [-twopass, -twodup, -dfile, mot_RFL2.run4_trim, -base,
> ts.4_trim+orig[92], -prefix, volreg.RFL2.run4_trim, ts.4_trim+orig.BRIK]
> Host: BIGRED
> Directory: AFNIsnr-20090421-1220-q9598ll1/jobs/o/AFNI_3dvolreg-od58kp9j
> stderr.txt:
>
> stdout.txt:
>
> ----
>
> Caused by:
>         Server refused performing the request. Custom message:  (error code
> 1) [Nested exception message:  Custom message: Unexpected reply: 451 ocurred
> during retrieve()
> org.globus.ftp.exception.ServerException: Refusing to start transfer before
> previous transfer completes (error code 5)
> org.globus.ftp.exception.ServerException: Refusing to start transfer before
> previous transfer completes (error code 5)
>         at
> org.globus.ftp.dc.TransferThreadManager.startTransfer(TransferThreadManager.java:129)
>         at
> org.globus.ftp.extended.GridFTPServerFacade.retrieve(GridFTPServerFacade.java:431)
>         at org.globus.ftp.FTPClient.put(FTPClient.java:1289)
>         at
> org.globus.cog.abstraction.impl.file.gridftp.old.FileResourceImpl.putFile(FileResourceImpl.java:427)
>         at
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.doDestination(DelegatedFileTransferHandler.java:355)
>         at
> org.globus.cog.abstraction.impl.fileTransfer.CachingDelegatedFileTransferHandler.doDestination(CachingDelegatedFileTransferHandler.java:47)
>         at
> org.globus.cog.abstraction.impl.fileTransfer.DelegatedFileTransferHandler.run(DelegatedFileTransferHandler.java:492)
>         at java.lang.Thread.run(Thread.java:595)
> ]
> gwynn 4%
>
> On Tue, Apr 21, 2009 at 12:25 PM, Mihael Hategan <hategan at mcs.anl.gov>wrote:
>
>> Ok. My point was, in general, that if there are such catastrophic
>> failures, spacing out in time a couple of attempts generally raises the
>> confidence that the problem is not some spooky transient issue.
>>
>> On Tue, 2009-04-21 at 17:19 +0000, Ben Clifford wrote:
>> > I can recreate non-swift brokenness on both of those sites.
>> >
>> > On Tue, 21 Apr 2009, Mihael Hategan wrote:
>> >
>> > > Did you also try that one again?
>> > >
>> > > On Tue, 2009-04-21 at 12:14 -0500, Michael Andric wrote:
>> > > > what about ucanl?
>> > > >
>> > > > On Tue, Apr 21, 2009 at 12:11 PM, Ben Clifford <benc at hawaga.org.uk>
>> > > > wrote:
>> > > >
>> > > >         On Tue, 21 Apr 2009, Michael Andric wrote:
>> > > >
>> > > >
>> > > >         > /disks/ci-gpfs/fmri/cnari/config/sites_bsd.xml
>> > > >
>> > > >
>> > > >         I cannot make a manual submission to gwynn.bsd.uchicago.edu
>> > > >         using
>> > > >         globus-job-run, so this is not a Swift problem. I think this
>> > > >         is something
>> > > >         that support at ci should deal with.
>> > > >
>> > > >         --
>> > > >
>> > > >
>> > > > _______________________________________________
>> > > > Swift-devel mailing list
>> > > > Swift-devel at ci.uchicago.edu
>> > > > http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
>> > >
>> > >
>>
>>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20090421/c4d1c94b/attachment.html>


More information about the Swift-devel mailing list