[Swift-devel] Cant get auto-coasters to run from midway to beagle

Michael Wilde wilde at mcs.anl.gov
Sat Mar 9 16:11:24 CST 2013


Now Im getting the error below (from running 317 simple MODIS apps concurrently).  Im going to dial down the throttle first to see if the staging load is overwhelming either coasters or the midway-beagle path.

- Mike


----- Original Message -----
> From: "Michael Wilde" <wilde at mcs.anl.gov>
> To: "Mihael Hategan" <hategan at mcs.anl.gov>
> Cc: "Swift Devel" <swift-devel at ci.uchicago.edu>
> Sent: Saturday, March 9, 2013 3:59:22 PM
> Subject: Re: [Swift-devel] Cant get auto-coasters to run from midway to	beagle
> 
> I think we just got this working. Problems may have included the need
> to pre-create the workdirectory and to specify a dotted IP address
> on the external network for GLOBUS_HOSTNAME.  Will need to
> experiment.  So likely that proxy expiration time was not a problem
> (although its confusing).
> 
> Will report back on this once the needed steps are clear.
> 
> Thanks,
> 
> - Mike
> 
> ----- Original Message -----
> > From: "Mihael Hategan" <hategan at mcs.anl.gov>
> > To: "Michael Wilde" <wilde at mcs.anl.gov>
> > Cc: "Swift Devel" <swift-devel at ci.uchicago.edu>
> > Sent: Saturday, March 9, 2013 3:56:36 PM
> > Subject: Re: Cant get auto-coasters to run from midway to beagle
> > 
> > Can you post ,globus/coasters/coaster.log from beagle?
> > 
> > On Sat, 2013-03-09 at 15:46 -0600, Michael Wilde wrote:
> > > Mihael, can you advise on this problem?
> > > 
> > > David and I are trying to run automatic coaster jobs from midway
> > > login hosts and swift.rcc to beagle using ssh-cl:pbs.
> > > 
> > > My failed attempts are on midway under
> > > /home/wilde/osgdemo/modis/svn, see eg run020 (which has complete
> > > logs).
> > > 
> > > Quick question about the proxy files that get copied. Does this
> > > look OK? :
> > > 
> > >   2013-03-09 21:24:46,895+0000 INFO  AutoCA Checking certificate
> > >   /home/wilde/.globus/coasters/proxy.0.pem
> > > 2013-03-09 21:24:46,967+0000 INFO  AutoCA Using certificate
> > > /home/wilde/.globus/coasters/proxy.0.pem with expiration date Sat
> > > Mar 23\
> > >  19:25:53 GMT 2013
> > > 
> > > The proxy expiration time listed above is two hours *earlier*
> > > than
> > > the current time (as seen in the message's UTC timestamp).  Is
> > > that correct, or a possible cause of this problem?
> > > 
> > > The main symptom seems to be this:
> > > 
> > > Execution failed:
> > > 	Exception in getlanduse:
> > >     Arguments: [../data/modis/2002/h00v09.rgb]
> > >     Host: beagle
> > >     Directory:
> > >     modis01-20130309-2124-7ua3bde3/jobs/d/getlanduse-d24rhd6l
> > > 
> > > Caused by:
> > > 	Could not submit job
> > > Caused by:
> > > 	Could not start coaster service
> > > Caused by:
> > > 	Task ended before registration was received.
> > > Failed to download bootstrap jar from
> > > http://midway001.rcc.uchicago.edu:50001
> > > ---
> > > 
> > > Yet Ive verified that midway login4 (which is the target system)
> > > can connect to this hostname and port (with nc -l and telnet)
> > > 
> > > - Mike
> > > 
> > > 
> > 
> > 
> > 
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel
> 



More information about the Swift-devel mailing list