[Swift-devel] Re: SCEC postproc workflow unresponsive after first 2 tasks

Allan Espinosa aespinosa at cs.uchicago.edu
Wed Jun 1 13:02:43 CDT 2011


Hi Ketan,

Could you add debugging for Swift's vdl:stagein calls?  Also, is this
using the stable branch?

Here's the log4j.properties I always use:

# Set root category priority to WARN and its appenders to CONSOLE and FILE.
log4j.rootCategory=INFO, CONSOLE, FILE

log4j.appender.CONSOLE=org.apache.log4j.ConsoleAppender
log4j.appender.CONSOLE.layout=org.apache.log4j.PatternLayout
log4j.appender.CONSOLE.Threshold=INFO
log4j.appender.CONSOLE.layout.ConversionPattern=%m%n

log4j.appender.FILE=org.apache.log4j.FileAppender
log4j.appender.FILE.File=swift.log
log4j.appender.FILE.layout=org.apache.log4j.PatternLayout
log4j.appender.FILE.layout.ConversionPattern=%d{yyyy-MM-dd
HH:mm:ss,SSSZZZZZ} %-5p %c{1} %m%n

log4j.logger.swift=DEBUG

log4j.logger.org.apache.axis.utils=ERROR

log4j.logger.org.globus.swift.trace=INFO

log4j.logger.org.griphyn.vdl.karajan.Loader=DEBUG
log4j.logger.org.globus.cog.karajan.workflow.events.WorkerSweeper=WARN
log4j.logger.org.globus.cog.karajan.workflow.nodes.FlowNode=WARN
log4j.logger.org.globus.cog.karajan.scheduler.WeightedHostScoreScheduler=DEBUG
log4j.logger.org.griphyn.vdl.toolkit.VDLt2VDLx=DEBUG
log4j.logger.org.griphyn.vdl.karajan.VDL2ExecutionContext=DEBUG
log4j.logger.org.globus.cog.abstraction.impl.common.task.TaskImpl=INFO
log4j.logger.org.griphyn.vdl.karajan.lib.GetFieldValue=DEBUG
log4j.logger.org.griphyn.vdl.engine.Karajan=INFO
log4j.logger.org.globus.cog.abstraction.coaster.rlog=DEBUG

# log4j.logger.org.globus.swift.data.Director=DEBUG
log4j.logger.org.griphyn.vdl.karajan.lib=INFO


log4j.logger.org.griphyn.vdl.karajan.lib.SetFieldValue=OFF
log4j.logger.org.griphyn.vdl.mapping.AbstractDataNode=OFF

# Transfer
#log4j.logger.org.globus.ftp=DEBUG
#log4j.logger.org.globus.gridftp=DEBUG


-Allan

2011/6/1 ketan <ketancmaheshwari at gmail.com>:
> Allan,
>
> I tried to run the posproc workflow on the OSG whitelisted resources.
> However, the workflow seems not to respond after completing the first two
> tasks:
>
> I get something like this:
>
> Progress:  Selecting site:248  Stage in:2  Finished successfully:2
> Progress:  Selecting site:248  Stage in:2  Finished successfully:2
> Progress:  Selecting site:248  Stage in:2  Finished successfully:2
> Progress:  Selecting site:248  Stage in:2  Finished successfully:2
> Progress:  Selecting site:248  Stage in:2  Finished successfully:2
> ..
> ..
> ..
>
>
> The sites.xml, tc.data and the log files are on bridled as follows:
>
> /home/ketan/osg-tg-effort/cybershake/condor_osg.xml
>
> /home/ketan/osg-tg-effort/cybershake/tc.data
>
> /home/ketan/osg-tg-effort/cybershake/postproc-20110601-0951-43n3a22g.log
>
> Swift is:
>
> [bridled.ci.uchicago.edu:cybershake]$ which swift
> swift is /home/ketan/swift-0.92.1/bin/swift
>
> I have memcached on, sourced /opt/osg-1.x.x/setup.sh and have a valid proxy.
>
> Could you indicate what are the first debugging steps that I should be
> taking on osg in such condition?
>
>
> Thanks,
> Ketan
>
>



-- 
Allan M. Espinosa <http://amespinosa.wordpress.com>
PhD student, Computer Science
University of Chicago <http://people.cs.uchicago.edu/~aespinosa>



More information about the Swift-devel mailing list