[Swift-devel] coasters-hosts.pl script

Justin M Wozniak wozniak at mcs.anl.gov
Fri Mar 2 11:01:02 CST 2012


Yes- I must have tested this with a different log file.  I just checked in 
and installed in ~wozniak/Public a fix for this that launches 
WORKER_INIT_CMD after the reconnect().  I am a little worried about time 
outs but it works so far.  I will continue testing...
 	Justin

On Thu, 1 Mar 2012, Jonathan Monette wrote:

> Justin,
>   So I have been trying to help Emalayan get the host list file for the worker-init.pl script.  It seems the cps log file is not providing the ip addresses for the coasters-hosts.pl script.  I thought this was maybe because we did not have the correct log4j setting set but we have the Coaster service Cpu set to DEBUG.  So for some reason the workers are not connecting to the service.  When I comment out the export WORKER_ENVIRONEMTN="…" line in the coaster-service.conf file I see the workers connect and the cps log file shows there ip addresses.  However when setting this line it seems they are not connecting.
>
> Emalayan thought there might be some sort of circular dependency going with the host-list file and the worker.  The worker requires the host-list file so that it can run the worker-init.pl script and then connect but the host-list file cannot be generated because the workers cannot connect.  I noticed in your swift-test directory the cps files did have the ip addresses set and coasters-hosts.pl found the ip addresses and reported them.  Did you try that test with setting the WORKER_ENVIRONMENT variable in the coaster-service.conf file?  Any idea what may be happening?  The job is running when looking under cqstat.
>
> A side note: At the mosaswift site, your example talks about running the coasters-hosts.pl on the cps log but the example you provide runs it on logs/coasters.log.  This may need to be changed.  Also, should provide the log4j setting that is required to generate the Cpu line with the worker ip address just to clarify that this line should be set for this script to work.
>
> For reference, this line: log4j.logger.org.globus.cog.abstraction.coaster.service.job.manager.Cpu=DEBUG

-- 
Justin M Wozniak


More information about the Swift-devel mailing list