[Swift-devel] NPE when cpus register to persistent coaster service
Michael Wilde
wilde at mcs.anl.gov
Tue Sep 20 23:01:28 CDT 2011
trunk.
----- Original Message -----
> From: "Mihael Hategan" <hategan at mcs.anl.gov>
> To: "Michael Wilde" <wilde at mcs.anl.gov>
> Cc: "Swift Devel" <swift-devel at ci.uchicago.edu>
> Sent: Tuesday, September 20, 2011 9:08:46 PM
> Subject: Re: NPE when cpus register to persistent coaster service
> Trunk or branch?
>
> On Mon, 2011-09-19 at 23:01 -0500, Michael Wilde wrote:
> > Im seeing this error in the service log (in trunk) when workers
> > register CPUs with the service:
> >
> > (Swift svn swift-r5131 (swift modified locally) cog-r3286)
> >
> > 2011-09-19 22:52:57,882-0500 INFO AbstractStreamKarajanChannel
> > nullChannel started
> > 2011-09-19 22:52:57,882-0500 INFO
> > AbstractStreamKarajanChannel$Multiplexer (0) Scheduling SC-null for
> > addition
> > 2011-09-19 22:52:57,882-0500 INFO AbstractStreamKarajanChannel
> > nullChannel started
> > 2011-09-19 22:52:57,907-0500 INFO LocalTCPService Received
> > registration: blockid = swork3, url = f1
> > 2011-09-19 22:52:57,917-0500 INFO
> > AbstractStreamKarajanChannel$Multiplexer (0) Scheduling SC-null for
> > addition
> > 2011-09-19 22:52:57,917-0500 INFO AbstractStreamKarajanChannel
> > nullChannel started
> > 2011-09-19 22:52:57,949-0500 INFO MetaChannel MetaChannel:
> > 381531395[1729747990: {}] -> null.bind -> SC-null
> > 2011-09-19 22:52:57,952-0500 DEBUG Cpu worker started: block=swork3
> > host=f1 id=0
> > 2011-09-19 22:52:57,952-0500 DEBUG Cpu ready for work: block=swork3
> > id=0
> > 2011-09-19 22:52:57,953-0500 INFO Block Started CPU 0:1316490777s
> > 2011-09-19 22:52:57,953-0500 TRACE Cpu swork3:0 pull
> > 2011-09-19 22:52:57,953-0500 INFO Block Started worker swork3:000000
> > 2011-09-19 22:52:57,953-0500 DEBUG Cpu requesting work: block=swork3
> > id=0 Cpus sleeping: 1
> > 2011-09-19 22:52:57,953-0500 DEBUG Cpu swork3:0 sleeping
> > 2011-09-19 22:52:57,954-0500 DEBUG PullThread sleep: 0:1316490777s
> > 2011-09-19 22:52:57,955-0500 WARN BlockQueueProcessor Failed to send
> > worker status update to client
> > java.lang.NullPointerException
> > at
> > org.globus.cog.karajan.workflow.service.channels.ChannelManager.getMetaChannel(ChannelManager.java:433)
> > at
> > org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:226)
> > at
> > org.globus.cog.abstraction.coaster.service.job.manager.PassiveQueueProcessor.registrationReceived(PassiveQueueProcessor.j
> > ava:72)
> > at
> > org.globus.cog.abstraction.coaster.service.job.manager.JobQueue.registrationReceived(JobQueue.java:143)
> > at
> > org.globus.cog.abstraction.coaster.service.LocalTCPService.registrationReceived(LocalTCPService.java:64)
> > at
> > org.globus.cog.abstraction.coaster.service.local.RegistrationHandler.requestComplete(RegistrationHandler.java:57)
> > at
> > org.globus.cog.karajan.workflow.service.handlers.RequestHandler.receiveCompleted(RequestHandler.java:84)
> > at
> > org.globus.cog.karajan.workflow.service.channels.AbstractKarajanChannel.handleRequest(AbstractKarajanChannel.java:416)
> > at
> > org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel.step(AbstractStreamKarajanChannel.java:157)
> > at
> > org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel$Multiplexer.run(AbstractStreamKarajanChanne
> > l.java:375)
> > 2011-09-19 22:52:57,959-0500 INFO LocalTCPService Received
> > registration: blockid = swork6, url = f1
> > 2011-09-19 22:52:57,959-0500 INFO MetaChannel MetaChannel:
> > 558519794[723566380: {}] -> null.bind -> SC-null
> > 2011-09-19 22:52:57,959-0500 DEBUG Cpu worker started: block=swork6
> > host=f1 id=0
> > 2011-09-19 22:52:57,959-0500 DEBUG Cpu ready for work: block=swork6
> > id=0
> > 2011-09-19 22:52:57,959-0500 INFO Block Started CPU 0:1316490777s
> > 2011-09-19 22:52:57,959-0500 TRACE Cpu swork6:0 pull
> > 2011-09-19 22:52:57,960-0500 INFO Block Started worker swork6:000000
> > 2011-09-19 22:52:57,960-0500 DEBUG Cpu requesting work: block=swork6
> > id=0 Cpus sleeping: 2
> > 2011-09-19 22:52:57,960-0500 WARN BlockQueueProcessor Failed to send
> > worker status update to client
> > java.lang.NullPointerException
> > at
> > org.globus.cog.karajan.workflow.service.channels.ChannelManager.getMetaChannel(ChannelManager.java:433)
> > at
> > org.globus.cog.karajan.workflow.service.channels.ChannelManager.reserveChannel(ChannelManager.java:226)
> > at
> > org.globus.cog.abstraction.coaster.service.job.manager.PassiveQueueProcessor.registrationReceived(PassiveQueueProcessor.j
> > ava:72)
> > at
> > org.globus.cog.abstraction.coaster.service.job.manager.JobQueue.registrationReceived(JobQueue.java:143)
> > at
> > org.globus.cog.abstraction.coaster.service.LocalTCPService.registrationReceived(LocalTCPService.java:64)
> > at
> > org.globus.cog.abstraction.coaster.service.local.RegistrationHandler.requestComplete(RegistrationHandler.java:57)
> > at
> > org.globus.cog.karajan.workflow.service.handlers.RequestHandler.receiveCompleted(RequestHandler.java:84)
> > at
> > org.globus.cog.karajan.workflow.service.channels.AbstractKarajanChannel.handleRequest(AbstractKarajanChannel.java:416)
> > at
> > org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel.step(AbstractStreamKarajanChannel.java:157)
> > at
> > org.globus.cog.karajan.workflow.service.channels.AbstractStreamKarajanChannel$Multiplexer.run(AbstractStreamKarajanChanne
> > l.java:375)
> >
> > The service still seems to work.
> >
> > - Mike
> >
--
Michael Wilde
Computation Institute, University of Chicago
Mathematics and Computer Science Division
Argonne National Laboratory
More information about the Swift-devel
mailing list