Emalayan,<div><br></div><div>From your symptoms, it seems you are facing the same issue as I've been. Could you tell more about the amount of data that needs to be staged to run the Montage stages during which these warnings turn up? How much time elapses since the start of your workflow after which you see these messages?<br>
<br>Also, what version of Swift is this?</div><div><br></div><div>Regards,</div><div>Ketan</div><div><br><div class="gmail_quote">On Thu, Jan 19, 2012 at 5:51 PM, Emalayan Vairavanathan <span dir="ltr"><<a href="mailto:svemalayan@yahoo.com">svemalayan@yahoo.com</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex"><div><div style="color:#000;background-color:#fff;font-family:times new roman,new york,times,serif;font-size:12pt"><div>
<span>Dear All,</span></div>

<div><br>
  <span></span></div>

<div><span>I have a problem in running Montage with Coasters (<span style="font-style:italic">in our local cluster - no batch schedulers</span>). After few stages the swift run-time continuously prints the warnings below. Any ideas ? Should I increase the heartbeat count ?<br>
</span></div><div><span><br></span></div><div><span>Everything works fine when I try to run the same montage-scripts with swift on a single machine.<br></span></div><div><br><span></span></div><div><span>Thank you</span></div>
<div><span>Emalayan<br></span></div><div><span><br>
  </span></div>

<div><br>
  <span></span></div>

<div style="font-style:italic"><span>2012-01-19
 15:38:09,207-0800 WARN  Command Command(119, HEARTBEAT): handling reply
 timeout; sendReqTime=120119-153609.206, sendTime=120119-153609.206, 
now=120119-153809.207<br>
<a href="tel:2012-01-19%2015" value="+12012011915" target="_blank">2012-01-19 15</a>:38:09,207-0800 INFO  Command Command(119, HEARTBEAT): re-sending<br>
<a href="tel:2012-01-19%2015" value="+12012011915" target="_blank">2012-01-19 15</a>:38:09,209-0800 WARN  Command Command(119, HEARTBEAT)fault was: Reply timeout<br>
org.globus.cog.karajan.workflow.service.ReplyTimeoutException<br>
        at org.globus.cog.karajan.workflow.service.commands.Command.handleReplyTimeout(Command.java:288)<br>
        at org.globus.cog.karajan.workflow.service.commands.Command$Timeout.run(Command.java:293)<br>
        at java.util.TimerThread.mainLoop(Timer.java:534)<br>
        at java.util.TimerThread.run(Timer.java:484)</span></div>
</div></div><br>_______________________________________________<br>
Swift-user mailing list<br>
<a href="mailto:Swift-user@ci.uchicago.edu">Swift-user@ci.uchicago.edu</a><br>
<a href="https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user" target="_blank">https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-user</a><br></blockquote></div><br><br clear="all"><div><br></div>-- <br>
Ketan<br><br><br>
</div>