[Swift-devel] swift not working
Michael Wilde
wilde at mcs.anl.gov
Tue Apr 21 11:12:52 CDT 2009
What do these log messages mean:
2009-04-21 09:44:05,634-0500 DEBUG vdl:execute2 APPLICATION_EXCEPTION
jobid=AFNI_3dvolreg-f0rydp9j - Application exception: Cannot submit job
Caused by:
org.globus.cog.abstraction.impl.common.task.TaskSubmissionException:
Cannot submit job
Caused by: org.globus.gram.GramException: Data transfer to the server
failed [Caused by: Token length 1248813600 > 33554432]
2009-04-21 09:44:05,635-0500 DEBUG vdl:execute2 APPLICATION_EXCEPTION
jobid=AFNI_3dvolreg-g0rydp9j - Application exception: Cannot submit job
Caused by:
org.globus.cog.abstraction.impl.common.task.TaskSubmissionException:
Cannot submit job
Caused by: org.globus.gram.GramException: Data transfer to the server
failed [Caused by: Token length 1248813600 > 33554432]
Googling for Token length > 33554432 gets a lot of hits including:
http://bugzilla.globus.org/globus/show_bug.cgi?id=2210
On 4/21/09 11:08 AM, Mihael Hategan wrote:
> I don't see anything wrong with your setup.
>
> Is this a persistent problem (i.e. can you try to run these again)?
>
> On Tue, 2009-04-21 at 10:31 -0500, Michael Andric wrote:
>> /disks/ci-gpfs/fmri/cnari/config/sites_ucanl64.xml
>>
>>
>> and
>>
>>
>> /disks/ci-gpfs/fmri/cnari/config/sites_bsd.xml
>>
>> On Tue, Apr 21, 2009 at 10:10 AM, Ben Clifford <benc at hawaga.org.uk>
>> wrote:
>>
>> Please can you send the sites.xml files for the below two
>> sites. They both
>> look like network errors of some kind.
>>
>>
>> On Tue, 21 Apr 2009, Michael Andric wrote:
>>
>> > Normally, I would hit up Sarah for a fix on this, but since
>> she's on
>> > vacation I'm hoping someone else out there could help with
>> this. I'm unable
>> > to get swift jobs submitted. I've tried submitting to both
>> the ucanl64 and
>> > bsd clusters. The run dir (with log files) is here:
>> > /disks/ci-gpfs/fmri/cnari/swift/projects/andric/SNR/RFL2
>> >
>> >
>> > Here's what I get from ucanl:
>> >
>> > [...]Progress: Submitting:1 Submitted:1 Failed but can
>> retry:1
>> > Failed to transfer wrapper log from
>> AFNIsnr-20090421-0930-q403bn99/info/e on
>> > ANLUCTERAGRID64
>> > Progress: Submitting:1 Failed but can retry:2
>> > Progress: Submitting:1 Failed but can retry:2
>> > Progress: Submitting:1 Failed but can retry:2
>> > Progress: Stage in:1 Submitting:1 Failed but can retry:1
>> > Progress: Submitting:2 Failed but can retry:1
>> > Progress: Submitting:1 Submitted:1 Failed but can retry:1
>> > Failed to transfer wrapper log from
>> AFNIsnr-20090421-0930-q403bn99/info/g on
>> > ANLUCTERAGRID64
>> > Progress: Submitting:1 Failed:1 Failed but can retry:1
>> > Execution failed:
>> > Exception in AFNI_3dvolreg:
>> > Arguments: [-twopass, -twodup, -dfile, mot_RFL2.run6_trim,
>> -base,
>> > ts.6_trim+orig[92], -prefix, volreg.RFL2.run6_trim,
>> ts.6_trim+orig.BRIK]
>> > Host: ANLUCTERAGRID64
>> > Directory:
>> AFNIsnr-20090421-0930-q403bn99/jobs/g/AFNI_3dvolreg-gxefdp9j
>> > stderr.txt:
>> >
>> > stdout.txt:
>> >
>> > ----
>> >
>> > Caused by:
>> > Cannot submit job: ; nested exception is:
>> > java.net.SocketTimeoutException: Read timed out
>> > gwynn 5%
>> >
>> >
>> >
>> > and this is what I get on bsd:
>> >
>> > RunID: 20090421-0943-o1bb0081
>> > Progress:
>> > Progress: Selecting site:1 Initializing site shared
>> directory:1 Stage
>> > in:1
>> > Progress: Stage in:2 Submitting:1
>> > 2009.04.21 09:44:04.848 CDT: [ERROR] Parsing profiles on
>> line 1800 Illegal
>> > character ':'at position 60 :Illegal character ':'
>> > Progress: Submitted:1 Failed but can retry:2
>> > Failed to transfer wrapper log from
>> AFNIsnr-20090421-0943-o1bb0081/info/f on
>> > BSD
>> > Failed to transfer wrapper log from
>> AFNIsnr-20090421-0943-o1bb0081/info/g on
>> > BSD
>> > Failed to transfer wrapper log from
>> AFNIsnr-20090421-0943-o1bb0081/info/e on
>> > BSD
>> > Failed to transfer wrapper log from
>> AFNIsnr-20090421-0943-o1bb0081/info/i on
>> > BSD
>> > Progress: Failed but can retry:3
>> > Progress: Stage in:1 Failed but can retry:2
>> > Failed to transfer wrapper log from
>> AFNIsnr-20090421-0943-o1bb0081/info/m on
>> > BSD
>> > Progress: Failed but can retry:3
>> > Progress: Failed but can retry:3
>> > Progress: Failed but can retry:3
>> > Progress: Stage in:1 Failed but can retry:2
>> > Failed to transfer wrapper log from
>> AFNIsnr-20090421-0943-o1bb0081/info/o on
>> > BSD
>> > Progress: Failed but can retry:3
>> > Progress: Failed but can retry:3
>> > Progress: Failed but can retry:3
>> > Progress: Failed but can retry:3
>> > Progress: Stage in:1 Failed but can retry:2
>> > Failed to transfer wrapper log from
>> AFNIsnr-20090421-0943-o1bb0081/info/q on
>> > BSD
>> > Progress: Failed:1 Failed but can retry:2
>> > Execution failed:
>> > Exception in AFNI_3dvolreg:
>> > Arguments: [-twopass, -twodup, -dfile, mot_RFL2.run5_trim,
>> -base,
>> > ts.5_trim+orig[92], -prefix, volreg.RFL2.run5_trim,
>> ts.5_trim+orig.BRIK]
>> > Host: BSD
>> > Directory:
>> AFNIsnr-20090421-0943-o1bb0081/jobs/q/AFNI_3dvolreg-q0rydp9j
>> > stderr.txt:
>> >
>> > stdout.txt:
>> >
>> > ----
>> >
>> > Caused by:
>> > Cannot submit job
>> > Caused by:
>> > Data transfer to the server failed [Caused by: Token
>> length
>> > 1248813600 > 33554432]
>> >
>>
>>
>>
>> _______________________________________________
>> Swift-devel mailing list
>> Swift-devel at ci.uchicago.edu
>> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
>
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
More information about the Swift-devel
mailing list