[Swift-devel] NPE when cpus register to persistent coaster service

Michael Wilde wilde at mcs.anl.gov
Mon Sep 19 23:01:22 CDT 2011


Im seeing this error in the service log (in trunk) when workers register CPUs with the service:

(Swift svn swift-r5131 (swift modified locally) cog-r3286)

2011-09-19 22:52:57,882-0500 INFO  AbstractStreamKarajanChannel nullChannel started
2011-09-19 22:52:57,882-0500 INFO  AbstractStreamKarajanChannel$Multiplexer (0) Scheduling SC-null for addition
2011-09-19 22:52:57,882-0500 INFO  AbstractStreamKarajanChannel nullChannel started
2011-09-19 22:52:57,907-0500 INFO  LocalTCPService Received registration: blockid = swork3, url = f1
2011-09-19 22:52:57,917-0500 INFO  AbstractStreamKarajanChannel$Multiplexer (0) Scheduling SC-null for addition
2011-09-19 22:52:57,917-0500 INFO  AbstractStreamKarajanChannel nullChannel started
2011-09-19 22:52:57,949-0500 INFO  MetaChannel MetaChannel: 381531395[1729747990: {}] -> null.bind -> SC-null
2011-09-19 22:52:57,952-0500 DEBUG Cpu worker started: block=swork3 host=f1 id=0
2011-09-19 22:52:57,952-0500 DEBUG Cpu ready for work: block=swork3 id=0
2011-09-19 22:52:57,953-0500 INFO  Block Started CPU 0:1316490777s
2011-09-19 22:52:57,953-0500 TRACE Cpu swork3:0 pull
2011-09-19 22:52:57,953-0500 INFO  Block Started worker swork3:000000
2011-09-19 22:52:57,953-0500 DEBUG Cpu requesting work: block=swork3 id=0 Cpus sleeping: 1
2011-09-19 22:52:57,953-0500 DEBUG Cpu swork3:0 sleeping
2011-09-19 22:52:57,954-0500 DEBUG PullThread sleep: 0:1316490777s
2011-09-19 22:52:57,955-0500 WARN  BlockQueueProcessor Failed to send worker status update to client
java.lang.NullPointerException
        at org.globus.cog.karajan.workflow.service.channels.ChannelManager.getMetaChannel(ChannelManager.java:433)
        at org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:226)
        at org.globus.cog.abstraction.coaster.service.job.manager.PassiveQueueProcessor.registrationReceived(PassiveQueueProcessor.j
ava:72)
        at org.globus.cog.abstraction.coaster.service.job.manager.JobQueue.registrationReceived(JobQueue.java:143)
        at org.globus.cog.abstraction.coaster.service.LocalTCPService.registrationReceived(LocalTCPService.java:64)
        at org.globus.cog.abstraction.coaster.service.local.RegistrationHandler.requestComplete(RegistrationHandler.java:57)
        at org.globus.cog.karajan.workflow.service.handlers.RequestHandler.receiveCompleted(RequestHandler.java:84)
        at org.globus.cog.karajan.workflow.service.channels.AbstractKarajanChannel.handleRequest(AbstractKarajanChannel.java:416)
        at org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel.step(AbstractStreamKarajanChannel.java:157)
        at org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel$Multiplexer.run(AbstractStreamKarajanChanne
l.java:375)
2011-09-19 22:52:57,959-0500 INFO  LocalTCPService Received registration: blockid = swork6, url = f1
2011-09-19 22:52:57,959-0500 INFO  MetaChannel MetaChannel: 558519794[723566380: {}] -> null.bind -> SC-null
2011-09-19 22:52:57,959-0500 DEBUG Cpu worker started: block=swork6 host=f1 id=0
2011-09-19 22:52:57,959-0500 DEBUG Cpu ready for work: block=swork6 id=0
2011-09-19 22:52:57,959-0500 INFO  Block Started CPU 0:1316490777s
2011-09-19 22:52:57,959-0500 TRACE Cpu swork6:0 pull
2011-09-19 22:52:57,960-0500 INFO  Block Started worker swork6:000000
2011-09-19 22:52:57,960-0500 DEBUG Cpu requesting work: block=swork6 id=0 Cpus sleeping: 2
2011-09-19 22:52:57,960-0500 WARN  BlockQueueProcessor Failed to send worker status update to client
java.lang.NullPointerException
        at org.globus.cog.karajan.workflow.service.channels.ChannelManager.getMetaChannel(ChannelManager.java:433)
        at org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:226)
        at org.globus.cog.abstraction.coaster.service.job.manager.PassiveQueueProcessor.registrationReceived(PassiveQueueProcessor.j
ava:72)
        at org.globus.cog.abstraction.coaster.service.job.manager.JobQueue.registrationReceived(JobQueue.java:143)
        at org.globus.cog.abstraction.coaster.service.LocalTCPService.registrationReceived(LocalTCPService.java:64)
        at org.globus.cog.abstraction.coaster.service.local.RegistrationHandler.requestComplete(RegistrationHandler.java:57)
        at org.globus.cog.karajan.workflow.service.handlers.RequestHandler.receiveCompleted(RequestHandler.java:84)
        at org.globus.cog.karajan.workflow.service.channels.AbstractKarajanChannel.handleRequest(AbstractKarajanChannel.java:416)
        at org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel.step(AbstractStreamKarajanChannel.java:157)
        at org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel$Multiplexer.run(AbstractStreamKarajanChanne
l.java:375)

The service still seems to work.

- Mike

-- 
Michael Wilde
Computation Institute, University of Chicago
Mathematics and Computer Science Division
Argonne National Laboratory




More information about the Swift-devel mailing list