[Swift-devel] swift not working

Michael Wilde wilde at mcs.anl.gov
Tue Apr 21 11:12:52 CDT 2009


What do these log messages mean:

2009-04-21 09:44:05,634-0500 DEBUG vdl:execute2 APPLICATION_EXCEPTION 
jobid=AFNI_3dvolreg-f0rydp9j - Application exception: Cannot submit job
Caused by: 
org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: 
Cannot submit job
Caused by: org.globus.gram.GramException: Data transfer to the server 
failed [Caused by: Token length 1248813600 > 33554432]
2009-04-21 09:44:05,635-0500 DEBUG vdl:execute2 APPLICATION_EXCEPTION 
jobid=AFNI_3dvolreg-g0rydp9j - Application exception: Cannot submit job
Caused by: 
org.globus.cog.abstraction.impl.common.task.TaskSubmissionException: 
Cannot submit job
Caused by: org.globus.gram.GramException: Data transfer to the server 
failed [Caused by: Token length 1248813600 > 33554432]

Googling for Token length  > 33554432 gets a lot of hits including:

http://bugzilla.globus.org/globus/show_bug.cgi?id=2210



On 4/21/09 11:08 AM, Mihael Hategan wrote:
> I don't see anything wrong with your setup.
> 
> Is this a persistent problem (i.e. can you try to run these again)?
> 
> On Tue, 2009-04-21 at 10:31 -0500, Michael Andric wrote:
>> /disks/ci-gpfs/fmri/cnari/config/sites_ucanl64.xml
>>
>>
>> and 
>>
>>
>> /disks/ci-gpfs/fmri/cnari/config/sites_bsd.xml
>>
>> On Tue, Apr 21, 2009 at 10:10 AM, Ben Clifford <benc at hawaga.org.uk>
>> wrote:
>>         
>>         Please can you send the sites.xml files for the below two
>>         sites. They both
>>         look like network errors of some kind.
>>         
>>         
>>         On Tue, 21 Apr 2009, Michael Andric wrote:
>>         
>>         > Normally, I would hit up Sarah for a fix on this, but since
>>         she's on
>>         > vacation I'm hoping someone else out there could help with
>>         this.  I'm unable
>>         > to get swift jobs submitted.  I've tried submitting to both
>>         the ucanl64 and
>>         > bsd clusters.  The run dir (with log files) is here:
>>         > /disks/ci-gpfs/fmri/cnari/swift/projects/andric/SNR/RFL2
>>         >
>>         >
>>         > Here's what I get from ucanl:
>>         >
>>         > [...]Progress:  Submitting:1  Submitted:1 Failed but can
>>         retry:1
>>         > Failed to transfer wrapper log from
>>         AFNIsnr-20090421-0930-q403bn99/info/e on
>>         > ANLUCTERAGRID64
>>         > Progress:  Submitting:1 Failed but can retry:2
>>         > Progress:  Submitting:1 Failed but can retry:2
>>         > Progress:  Submitting:1 Failed but can retry:2
>>         > Progress:  Stage in:1  Submitting:1 Failed but can retry:1
>>         > Progress:  Submitting:2 Failed but can retry:1
>>         > Progress:  Submitting:1  Submitted:1 Failed but can retry:1
>>         > Failed to transfer wrapper log from
>>         AFNIsnr-20090421-0930-q403bn99/info/g on
>>         > ANLUCTERAGRID64
>>         > Progress:  Submitting:1  Failed:1 Failed but can retry:1
>>         > Execution failed:
>>         >         Exception in AFNI_3dvolreg:
>>         > Arguments: [-twopass, -twodup, -dfile, mot_RFL2.run6_trim,
>>         -base,
>>         > ts.6_trim+orig[92], -prefix, volreg.RFL2.run6_trim,
>>         ts.6_trim+orig.BRIK]
>>         > Host: ANLUCTERAGRID64
>>         > Directory:
>>         AFNIsnr-20090421-0930-q403bn99/jobs/g/AFNI_3dvolreg-gxefdp9j
>>         > stderr.txt:
>>         >
>>         > stdout.txt:
>>         >
>>         > ----
>>         >
>>         > Caused by:
>>         >         Cannot submit job: ; nested exception is:
>>         >         java.net.SocketTimeoutException: Read timed out
>>         > gwynn 5%
>>         >
>>         >
>>         >
>>         > and this is what I get on bsd:
>>         >
>>         > RunID: 20090421-0943-o1bb0081
>>         > Progress:
>>         > Progress:  Selecting site:1  Initializing site shared
>>         directory:1  Stage
>>         > in:1
>>         > Progress:  Stage in:2  Submitting:1
>>         > 2009.04.21 09:44:04.848 CDT: [ERROR] Parsing profiles on
>>         line 1800 Illegal
>>         > character ':'at position 60 :Illegal character ':'
>>         > Progress:  Submitted:1 Failed but can retry:2
>>         > Failed to transfer wrapper log from
>>         AFNIsnr-20090421-0943-o1bb0081/info/f on
>>         > BSD
>>         > Failed to transfer wrapper log from
>>         AFNIsnr-20090421-0943-o1bb0081/info/g on
>>         > BSD
>>         > Failed to transfer wrapper log from
>>         AFNIsnr-20090421-0943-o1bb0081/info/e on
>>         > BSD
>>         > Failed to transfer wrapper log from
>>         AFNIsnr-20090421-0943-o1bb0081/info/i on
>>         > BSD
>>         > Progress: Failed but can retry:3
>>         > Progress:  Stage in:1 Failed but can retry:2
>>         > Failed to transfer wrapper log from
>>         AFNIsnr-20090421-0943-o1bb0081/info/m on
>>         > BSD
>>         > Progress: Failed but can retry:3
>>         > Progress: Failed but can retry:3
>>         > Progress: Failed but can retry:3
>>         > Progress:  Stage in:1 Failed but can retry:2
>>         > Failed to transfer wrapper log from
>>         AFNIsnr-20090421-0943-o1bb0081/info/o on
>>         > BSD
>>         > Progress: Failed but can retry:3
>>         > Progress: Failed but can retry:3
>>         > Progress: Failed but can retry:3
>>         > Progress: Failed but can retry:3
>>         > Progress:  Stage in:1 Failed but can retry:2
>>         > Failed to transfer wrapper log from
>>         AFNIsnr-20090421-0943-o1bb0081/info/q on
>>         > BSD
>>         > Progress:  Failed:1 Failed but can retry:2
>>         > Execution failed:
>>         >         Exception in AFNI_3dvolreg:
>>         > Arguments: [-twopass, -twodup, -dfile, mot_RFL2.run5_trim,
>>         -base,
>>         > ts.5_trim+orig[92], -prefix, volreg.RFL2.run5_trim,
>>         ts.5_trim+orig.BRIK]
>>         > Host: BSD
>>         > Directory:
>>         AFNIsnr-20090421-0943-o1bb0081/jobs/q/AFNI_3dvolreg-q0rydp9j
>>         > stderr.txt:
>>         >
>>         > stdout.txt:
>>         >
>>         > ----
>>         >
>>         > Caused by:
>>         >         Cannot submit job
>>         > Caused by:
>>         >         Data transfer to the server failed [Caused by: Token
>>         length
>>         > 1248813600 > 33554432]
>>         >
>>         
>>
>>
>> _______________________________________________
>> Swift-devel mailing list
>> Swift-devel at ci.uchicago.edu
>> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
> 
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel



More information about the Swift-devel mailing list