[Swift-devel] NPE when cpus register to persistent coaster service
Mihael Hategan
hategan at mcs.anl.gov
Tue Sep 20 21:08:46 CDT 2011
Trunk or branch?
On Mon, 2011-09-19 at 23:01 -0500, Michael Wilde wrote:
> Im seeing this error in the service log (in trunk) when workers register CPUs with the service:
>
> (Swift svn swift-r5131 (swift modified locally) cog-r3286)
>
> 2011-09-19 22:52:57,882-0500 INFO AbstractStreamKarajanChannel nullChannel started
> 2011-09-19 22:52:57,882-0500 INFO AbstractStreamKarajanChannel$Multiplexer (0) Scheduling SC-null for addition
> 2011-09-19 22:52:57,882-0500 INFO AbstractStreamKarajanChannel nullChannel started
> 2011-09-19 22:52:57,907-0500 INFO LocalTCPService Received registration: blockid = swork3, url = f1
> 2011-09-19 22:52:57,917-0500 INFO AbstractStreamKarajanChannel$Multiplexer (0) Scheduling SC-null for addition
> 2011-09-19 22:52:57,917-0500 INFO AbstractStreamKarajanChannel nullChannel started
> 2011-09-19 22:52:57,949-0500 INFO MetaChannel MetaChannel: 381531395[1729747990: {}] -> null.bind -> SC-null
> 2011-09-19 22:52:57,952-0500 DEBUG Cpu worker started: block=swork3 host=f1 id=0
> 2011-09-19 22:52:57,952-0500 DEBUG Cpu ready for work: block=swork3 id=0
> 2011-09-19 22:52:57,953-0500 INFO Block Started CPU 0:1316490777s
> 2011-09-19 22:52:57,953-0500 TRACE Cpu swork3:0 pull
> 2011-09-19 22:52:57,953-0500 INFO Block Started worker swork3:000000
> 2011-09-19 22:52:57,953-0500 DEBUG Cpu requesting work: block=swork3 id=0 Cpus sleeping: 1
> 2011-09-19 22:52:57,953-0500 DEBUG Cpu swork3:0 sleeping
> 2011-09-19 22:52:57,954-0500 DEBUG PullThread sleep: 0:1316490777s
> 2011-09-19 22:52:57,955-0500 WARN BlockQueueProcessor Failed to send worker status update to client
> java.lang.NullPointerException
> at org.globus.cog.karajan.workflow.service.channels.ChannelManager.getMetaChannel(ChannelManager.java:433)
> at org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:226)
> at org.globus.cog.abstraction.coaster.service.job.manager.PassiveQueueProcessor.registrationReceived(PassiveQueueProcessor.j
> ava:72)
> at org.globus.cog.abstraction.coaster.service.job.manager.JobQueue.registrationReceived(JobQueue.java:143)
> at org.globus.cog.abstraction.coaster.service.LocalTCPService.registrationReceived(LocalTCPService.java:64)
> at org.globus.cog.abstraction.coaster.service.local.RegistrationHandler.requestComplete(RegistrationHandler.java:57)
> at org.globus.cog.karajan.workflow.service.handlers.RequestHandler.receiveCompleted(RequestHandler.java:84)
> at org.globus.cog.karajan.workflow.service.channels.AbstractKarajanChannel.handleRequest(AbstractKarajanChannel.java:416)
> at org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel.step(AbstractStreamKarajanChannel.java:157)
> at org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel$Multiplexer.run(AbstractStreamKarajanChanne
> l.java:375)
> 2011-09-19 22:52:57,959-0500 INFO LocalTCPService Received registration: blockid = swork6, url = f1
> 2011-09-19 22:52:57,959-0500 INFO MetaChannel MetaChannel: 558519794[723566380: {}] -> null.bind -> SC-null
> 2011-09-19 22:52:57,959-0500 DEBUG Cpu worker started: block=swork6 host=f1 id=0
> 2011-09-19 22:52:57,959-0500 DEBUG Cpu ready for work: block=swork6 id=0
> 2011-09-19 22:52:57,959-0500 INFO Block Started CPU 0:1316490777s
> 2011-09-19 22:52:57,959-0500 TRACE Cpu swork6:0 pull
> 2011-09-19 22:52:57,960-0500 INFO Block Started worker swork6:000000
> 2011-09-19 22:52:57,960-0500 DEBUG Cpu requesting work: block=swork6 id=0 Cpus sleeping: 2
> 2011-09-19 22:52:57,960-0500 WARN BlockQueueProcessor Failed to send worker status update to client
> java.lang.NullPointerException
> at org.globus.cog.karajan.workflow.service.channels.ChannelManager.getMetaChannel(ChannelManager.java:433)
> at org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:226)
> at org.globus.cog.abstraction.coaster.service.job.manager.PassiveQueueProcessor.registrationReceived(PassiveQueueProcessor.j
> ava:72)
> at org.globus.cog.abstraction.coaster.service.job.manager.JobQueue.registrationReceived(JobQueue.java:143)
> at org.globus.cog.abstraction.coaster.service.LocalTCPService.registrationReceived(LocalTCPService.java:64)
> at org.globus.cog.abstraction.coaster.service.local.RegistrationHandler.requestComplete(RegistrationHandler.java:57)
> at org.globus.cog.karajan.workflow.service.handlers.RequestHandler.receiveCompleted(RequestHandler.java:84)
> at org.globus.cog.karajan.workflow.service.channels.AbstractKarajanChannel.handleRequest(AbstractKarajanChannel.java:416)
> at org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel.step(AbstractStreamKarajanChannel.java:157)
> at org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel$Multiplexer.run(AbstractStreamKarajanChanne
> l.java:375)
>
> The service still seems to work.
>
> - Mike
>
More information about the Swift-devel
mailing list