<div dir="ltr"><div>I've been experimenting with the coaster C client on midway and have been seeing a lot of warning messages from the coaster service. I wanted to know if this is a known problem, or get some guidance on how to try to resolve it.<br>
<br></div><div>I'm running the latest github master version.<br><br></div><div>I've been starting the coaster service with active coasters:<br></div><div><br> export GLOBUS_HOSTNAME=172.25.180.72<br> coaster-service -nosec -p 65001<br>
<br></div><div>I then have the coaster C client connect to it and submit jobs, with the following settings: jobManager=slurm,jobQueue=sandyb,tasksperworker=16.<br><br></div><div>The jobs have mostly been completing successfully, but I've also seen some instability and failures. I don't know if it's related to the many warnings in the service log (attached) , e.g. this one:<br>
<br>2014-08-28 01:06:34,174+0000 WARN TaskNotifier Client could not properly process notification: null<br>java.net.SocketException: Broken pipe<br> at java.net.SocketOutputStream.socketWrite0(Native Method)<br> at java.net.SocketOutputStream.socketWrite(SocketOutputStream.java:113)<br>
at java.net.SocketOutputStream.write(SocketOutputStream.java:147)<br> at org.globus.cog.coaster.channels.Sender.send(Sender.java:149)<br> at org.globus.cog.coaster.channels.Sender.run(Sender.java:85)<br>
<br></div><div>or this one:<br></div><div><br>2014-08-28 01:06:51,872+0000 WARN RemoteLogger Failed to send remote log message: BLOCK_SHUTDOWN id=0828-0601150-000000<br>org.globus.cog.coaster.channels.ChannelException: Channel died and no contact available<br>
at org.globus.cog.coaster.channels.ChannelManager.connect(ChannelManager.java:253)<br> at org.globus.cog.coaster.channels.ChannelManager.reserveChannel(ChannelManager.java:274)<br> at org.globus.cog.coaster.channels.ChannelManager.reserveChannel(ChannelManager.java:245)<br>
at org.globus.cog.abstraction.coaster.rlog.RemoteLogger.log(RemoteLogger.java:53)<br> at org.globus.cog.abstraction.coaster.service.job.manager.Block.shutdown(Block.java:303)<br> at org.globus.cog.abstraction.coaster.service.job.manager.Block.shutdownIfEmpty(Block.java:236)<br>
at org.globus.cog.abstraction.coaster.service.job.manager.Block.suspend(Block.java:576)<br> at org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.removeIdleBlocks(BlockQueueProcessor.java:472)<br>
at org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.updatePlan(BlockQueueProcessor.java:750)<br> at org.globus.cog.abstraction.coaster.service.job.manager.BlockQueueProcessor.run(BlockQueueProcessor.java:161)<br>
<br><br></div><div>Any guidance or thoughts would be appreciated.<br><br></div><div>- Tim<br></div><div><br></div></div>