[Swift-devel] coaster one-liner bootstrap script

Michael Wilde wilde at mcs.anl.gov
Thu Feb 12 18:36:13 CST 2009


I updated to 2300. Now I get the error below 
(java.lang.RuntimeException: Failed to register service)

Im also a bit confused why I see "which: no gmd5sum in 
(/soft/java-1.5.0_06-sun-r1/bin: etc etc" on stdout - that should be 
going to /dev/null, but its reproducible in a normal interactive shell. 
Something subtle in eval?

gram log is in ~osg/gram_job_mgr_17585.log

swift log is in ~wilde/oops7-20090212-1510-kk6i43og.log

(on ci network)

- Mike

On 2/12/09 2:06 PM, Mihael Hategan wrote:
> On Thu, 2009-02-12 at 01:05 -0600, Michael Wilde wrote:
>> I got: coaster-bootstrap.list not found in classpath
> 
> Should be fixed in swift r2300.
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel

com$ ls -l ~osg/coaster-bootstrap-167563000.log
-rw-r--r--  1 osg osgvo 2744 Feb 12 15:11 
/home/osgvo/osg/coaster-bootstrap-167563000.log
com$ cat ~osg/coaster-bootstrap-167563000.log
BS: http://communicado.ci.uchicago.edu:50001
Expected checksum: c6dbde30e69462446c06e15a46fba6eb
Computed checksum: c6dbde30e69462446c06e15a46fba6eb
JAVA=/soft/java-1.5.0_06-sun-r1/bin/java
/soft/java-1.5.0_06-sun-r1/bin/java 
-Djava=/soft/java-1.5.0_06-sun-r1/bin/java -DGLOBUS_TCP_PORT_RANGE= 
-DX509_USER_PROXY=/home/osgvo/osg/.globus/job/tp-grid1.ci.uchicago.edu/16700.1234473069/x509_up 
-DX509_CERT_DIR= -DGLOBUS_HOSTNAME=tp-grid1.ci.uchicago.edu -jar 
/tmp/bootstrap.N16834 http://communicado.ci.uchicago.edu:50001 
b3d581fddd49e3d1166f52f6077ddcc5 https://128.135.125.17:50000 167563000
java.lang.RuntimeException: Failed to register service
         at 
org.globus.cog.abstraction.coaster.service.CoasterService.start(CoasterService.java:111)
         at 
org.globus.cog.abstraction.coaster.service.CoasterService.main(CoasterService.java:226)
Caused by: 
org.globus.cog.karajan.workflow.service.channels.ChannelException: 
Failed to start channel 
GSSCChannel-https://b3d581fddd49e3d1166f52f6077ddcc5:1984(1)
         at 
org.globus.cog.karajan.workflow.service.channels.GSSChannel.reconnect(GSSChannel.java:104)
         at 
org.globus.cog.karajan.workflow.service.channels.GSSChannel.start(GSSChannel.java:63)
         at 
org.globus.cog.karajan.workflow.service.ChannelFactory.newChannel(ChannelFactory.java:43)
         at 
org.globus.cog.karajan.workflow.service.Client.connect(Client.java:115)
         at 
org.globus.cog.karajan.workflow.service.Client.newClient(Client.java:72)
         at 
org.globus.cog.karajan.workflow.service.channels.ChannelManager.connect(ChannelManager.java:211)
         at 
org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:230)
         at 
org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:186)
         at 
org.globus.cog.abstraction.coaster.service.CoasterService.start(CoasterService.java:100)
         ... 1 more
Caused by: java.net.UnknownHostException: 
b3d581fddd49e3d1166f52f6077ddcc5: b3d581fddd49e3d1166f52f6077ddcc5
         at java.net.InetAddress.getAllByName0(InetAddress.java:1128)
         at java.net.InetAddress.getAllByName0(InetAddress.java:1098)
         at java.net.InetAddress.getAllByName(InetAddress.java:1061)
         at java.net.InetAddress.getByName(InetAddress.java:958)
         at org.globus.net.SocketFactory.createSocket(SocketFactory.java:53)
         at org.globus.gsi.gssapi.net.GssSocket.<init>(GssSocket.java:56)
         at 
org.globus.gsi.gssapi.net.impl.GSIGssSocket.<init>(GSIGssSocket.java:29)
         at 
org.globus.gsi.gssapi.net.impl.GSIGssSocketFactory.createSocket(GSIGssSocketFactory.java:38)
         at 
org.globus.cog.karajan.workflow.service.channels.GSSChannel.reconnect(GSSChannel.java:90)
         ... 9 more

EC: 1
BS: http://communicado.ci.uchicago.edu:50001
Failed to download bootstrap jar from 
http://communicado.ci.uchicago.edu:50001
com$

---- and on stdout/stderr:

com$ cat swift.out
/home/wilde/swift/tools/swiftrun: Swift script oops7.swift starting at 
Thu Feb 12 15:10:57 CST 2009
running on sites: teraport.coaster.gt2.osg

Swift svn swift-r2532 cog-r2300

RunID: 20090212-1510-kk6i43og
Progress:
Progress:  Stage in:1 Initializing site shared directory:1
Progress:  Stage in:1 Submitting:1
Progress:  Submitting:1 Submitted:1
Failed to transfer wrapper log from oops7-20090212-1510-kk6i43og/info/j 
on teraport
Execution failed:
         Exception in runoops:
Arguments: [input/fasta/T1af7.fasta, input/secseq/T1af7.secseq, 
input/native/T1af7.pdb, output/T1af7.0.pdt, output/T1af7.0.rmsd, 0, TEMP 
UPDATE INTERVAL = 10, SMOOTH DEVIATION COEFFICIENT = 0.80001]
Host: teraport
Directory: oops7-20090212-1510-kk6i43og/jobs/j/runoops-j1j9zi6j
stderr.txt:

stdout.txt:

----

Caused by:
         Could not submit job
Caused by:
         Could not start coaster service
Caused by:
         Task ended before registration was received.
STDOUT: which: no gmd5sum in 
(/soft/java-1.5.0_06-sun-r1/bin:/soft/java-1.5.0_06-sun-r1/jre/bin:/usr/kerberos/bin:/bin:/usr/bin:/usr/X11R6/bin:/usr/local/bin:/software/common/softenv-1.6.0-r1/bin:/home/osgvo/osg/bin/linux-rhel4-x86_64:/home/osgvo/osg/bin:/soft/xcat-1.2.0-r1/bin:/soft/xcat-1.2.0-r1/sbin:/soft/xcat-1.2.0-r1/x86_64/bin:/soft/xcat-1.2.0-r1/x86_64/sbin:/soft/xcat-1.2.0-r1/contrib/bin:/soft/xcat-1.2.0-r1/contrib/sbin:/soft/xcat-1.2.0-r1/contrib/x86_64/bin:/soft/xcat-1.2.0-r1/contrib/x86_64/sbin)


STDERR: null
Cleaning up...
  Done

/home/wilde/swift/tools/swiftrun: Swift Script oops7.swift ended at Thu 
Feb 12 15:11:24 CST 2009 with exit code 0
com$




More information about the Swift-devel mailing list