[Swift-devel] Failed to start channel GSSCChannel (trunk, coasters, ssh-cl:pbs)
Mihael Hategan
hategan at mcs.anl.gov
Fri Feb 3 14:29:52 CST 2012
Ok, so maybe the ssh-cl provider doesn't properly forward environment
variables. I'll double check that.
On Fri, 2012-02-03 at 14:02 -0600, Thomas Uram wrote:
> I have done this without success:
>
> GLOBUS_HOSTNAME=fl.ci.uchicago.edu
> GLOBUS_TCP_PORT_RANGE=50000,50100
> swiftt -sites.file sites.coasters.xml -tc.file tc.data hostname.swift
> Swift trunk swift-r5501 (swift modified locally) cog-r3350 (cog modified locally)
>
> RunID: 20120203-1357-8tekc3f7
> Progress: time: Fri, 03 Feb 2012 13:57:24 -0600
> Progress: time: Fri, 03 Feb 2012 13:57:31 -0600 Selecting site:4 Initializing site shared directory:1 Stage in:1
> ssh not set, setting to 'gsissh'
> ssh=gsissh
> Find: https://206.12.24.2:38675
> Find: keepalive(120), reconnect - https://206.12.24.2:38675
> Progress: time: Fri, 03 Feb 2012 13:57:35 -0600 Selecting site:4 Submitting:1 Submitted:1
> Failed to transfer wrapper log for job hostname-1jnudkmk
> Progress: time: Fri, 03 Feb 2012 13:57:38 -0600 Selecting site:3 Stage in:1 Failed but can retry:2
> Failed to transfer wrapper log for job hostname-2jnudkmk
> Failed to transfer wrapper log for job hostname-4jnudkmk
> Progress: time: Fri, 03 Feb 2012 13:57:54 -0600 Selecting site:3 Failed but can retry:3
> Progress: time: Fri, 03 Feb 2012 13:57:57 -0600 Selecting site:2 Stage in:1 Failed but can retry:3
> Failed to transfer wrapper log for job hostname-7jnudkmk
> No events in 10s.
>
> Registered futures:
> ----
>
> Waiting threads:
> ----
>
> No events in 10s.
>
> Registered futures:
> ----
>
> Waiting threads:
> ----
>
> ** Ctrl-C here **
>
> Progress: time: Fri, 03 Feb 2012 13:58:24 -0600 Selecting site:2 Failed but can retry:4
> Failed to shut down service https://206.12.24.2:38675
> org.globus.cog.karajan.workflow.service.channels.ChannelException: Failed to start channel GSSCChannel-https://206.12.24.2:38675(6)[69518356: {}]
> at org.globus.cog.karajan.workflow.service.channels.GSSChannel.reconnect(GSSChannel.java:103)
> at org.globus.cog.karajan.workflow.service.channels.GSSChannel.start(GSSChannel.java:62)
> at org.globus.cog.karajan.workflow.service.ChannelFactory.newChannel(ChannelFactory.java:55)
> at org.globus.cog.karajan.workflow.service.Client.connect(Client.java:116)
> at org.globus.cog.karajan.workflow.service.Client.newClient(Client.java:72)
> at org.globus.cog.karajan.workflow.service.channels.ChannelManager.connect(ChannelManager.java:236)
> at org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:256)
> at org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:217)
> at org.globus.cog.abstraction.impl.execution.coaster.ServiceManager$ServiceReaper.run(ServiceManager.java:430)
> Caused by: java.net.NoRouteToHostException: No route to host
> at java.net.PlainSocketImpl.socketConnect(Native Method)
> at java.net.PlainSocketImpl.doConnect(PlainSocketImpl.java:351)
> at java.net.PlainSocketImpl.connectToAddress(PlainSocketImpl.java:213)
> at java.net.PlainSocketImpl.connect(PlainSocketImpl.java:200)
> at java.net.SocksSocketImpl.connect(SocksSocketImpl.java:366)
> at java.net.Socket.connect(Socket.java:529)
> at java.net.Socket.connect(Socket.java:478)
> at java.net.Socket.<init>(Socket.java:375)
> at java.net.Socket.<init>(Socket.java:276)
> at org.globus.net.SocketFactory.createSocket(SocketFactory.java:74)
> at org.globus.net.SocketFactory.createSocket(SocketFactory.java:53)
> at org.globus.gsi.gssapi.net.GssSocket.<init>(GssSocket.java:56)
> at org.globus.gsi.gssapi.net.impl.GSIGssSocket.<init>(GSIGssSocket.java:29)
> at org.globus.gsi.gssapi.net.impl.GSIGssSocketFactory.createSocket(GSIGssSocketFactory.java:38)
> at org.globus.cog.karajan.workflow.service.channels.GSSChannel.reconnect(GSSChannel.java:89)
> ... 8 more
>
>
> Full log here:
> http://www.mcs.anl.gov/~turam/20120203-1401/hostname-20120203-1357-8tekc3f7.log
>
>
>
>
>
>
> On Feb 3, 2012, at 1:54 PM, Mihael Hategan wrote:
>
> > On Fri, 2012-02-03 at 13:44 -0600, Thomas Uram wrote:
> >> No I didn't set GLOBUS_HOSTNAME. The address it complains about
> >> (206.12.24.2) is publicly reachable. So is the hostname of the machine
> >> on which I'm running Swift (fl.ci.uchicago.edu).
> >
> > They should be the same! (i.e. the coaster service tries to connect back
> > to the machine you're running Swift on).
> >
> > Can you try setting GLOBUS_HOSTNAME and see what happens?
> >
> >>
> >>
> >> I was wondering about the jumble that follows the hostname:port in
> >> that URL:
> >>
> >>
> >>>> Failed to start channel
> >>>> GSSCChannel-https://206.12.24.2:35836(2)[1544213635: {}]
> >
> > (2) is the channel ID
> > [15...] is the channel context
> > They are not part of the IP address, but part of GSSChannel.toString().
> >
> >
>
More information about the Swift-devel
mailing list