[Swift-devel] coasters won't start

Jonathan Monette jonmon at mcs.anl.gov
Fri Oct 21 14:02:31 CDT 2011


Thanks.  That was the next thing on my check list to check but wasn't sure how to check this.  I wasn't sure how to specify a port range for coasters to use.  Does coasters use the GLOBUS_TCP_PORT_RANGE and the GLOBUS_SOURCE_PORT_RANGE environment variables for this?

<config>
<pool handle="localhost">
   <execution provider="local" />
   <filesystem provider="local" />
   <workdirectory>/gpfs/pads/swift/jonmon/Swift/work/localhost</workdirectory>
 
   <profile namespace="karajan" key="jobThrottle">.05</profile>
 
   <profile namespace="env" key="SWIFT_GEN_SCRIPTS">KEEP</profile>
 </pool>
 <pool handle="pads">
     <execution provider="coaster" jobmanager="ssh:pbs" url="login.pads.ci.uchicago.edu" />
     <filesystem provider="local" />
     <workdirectory>/gpfs/pads/swift/jonmon/Swift/work/pads</workdirectory>
 
     <profile namespace="globus" key="project">CI-CCR000013</profile>
     <profile namespace="globus" key="maxtime">3600</profile>
     <profile namespace="globus" key="jobsPerNode">1</profile>
     <profile namespace="globus" key="slots">192</profile>             <!-- Max number of jobs for the fast queue on PADS => 192 -->
     <profile namespace="globus" key="nodeGranularity">1</profile>
     <profile namespace="globus" key="maxNodes">1</profile>
     <profile namespace="globus" key="queue">fast</profile>
 
     <profile namespace="karajan" key="jobThrottle">5</profile>
     <profile namespace="karajan" key="initialScore">10000</profile>
 
     <profile namespace="env" key="SWIFT_GEN_SCRIPTS">KEEP</profile>
 </pool>
     <pool handle="beagle">
         <execution provider="coaster" jobmanager="ssh:pbs" url="login.beagle.ci.uchicago.edu" />
         <profile namespace="globus" key="project">CI-CCR000013</profile>
 	<filesystem provider="local" />
         <workdirectory>/gpfs/pads/swift/jonmon/Swift/work/beagle</workdirectory>
 
 
         <profile namespace="globus" key="ppn">24</profile>    
         <profile namespace="globus" key="providerAttributes">pbs.aprun;pbs.mpp;depth=24</profile>
         <profile namespace="globus" key="jobsPerNode">24</profile>
         <profile namespace="globus" key="maxTime">1000</profile>
         <profile namespace="globus" key="slots">1</profile>
         <profile namespace="globus" key="nodeGranularity">1</profile>
         <profile namespace="globus" key="maxNodes">1</profile>
 
         <profile namespace="karajan" key="jobThrottle">.63</profile>
         <profile namespace="karajan" key="initialScore">10000</profile>
 
         <profile namespace="env" key="SWIFT_GEN_SCRIPTS">KEEP</profile>
     </pool>
 
</config>
On Oct 21, 2011, at 1:58 PM, Ketan Maheshwari wrote:

> Jon,
> 
> There were some changes in the firewalls rules in terms of allowed open ports on various ci machines. A long shot, but may be you want to check on that.
> 
> Can you paste your sites.xml and I can take a look if I find something.
> 
> Ketan
> 
> 
> On Fri, Oct 21, 2011 at 1:50 PM, Jonathan Monette <jonmon at mcs.anl.gov> wrote:
> Anyone have a thought on this?  Not sure what is wrong.  I can't seem to get coasters registered from PADS or Beagle.  The log also specifies a FileNotFoundException when trying to transfer back the wrapper log.  Does this have something to do with the problem?  I have been assuming that this error was being thrown due to the coaster service not connecting.
> 
> On Oct 20, 2011, at 3:04 PM, Jonathan Monette wrote:
> 
>> Here is a log saying that the coaster service isn't starting, at least that is what the log is saying.  This is with on PADS with automatic coasters using 0.93RC3.
>> http://www.ci.uchicago.edu/~jonmon/logs/coasters_wont_start.log
>> 
>> And here is the coaster log in zipped form
>> http://www.ci.uchicago.edu/~jonmon/logs/coasters.tar.gz
>> 
>> All the files used for this run are located in ~jonmon/PADS/Swift/SwiftMontage/m101_tutorial/run.0039 on the ci network.
>> 
>> 
>> 
>> _______________________________________________
>> Swift-devel mailing list
>> Swift-devel at ci.uchicago.edu
>> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel
> 
> 
> _______________________________________________
> Swift-devel mailing list
> Swift-devel at ci.uchicago.edu
> https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel
> 
> 
> 
> 
> -- 
> Ketan
> 
> 

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20111021/ebed6083/attachment.html>


More information about the Swift-devel mailing list