<html><head><style type='text/css'>p { margin: 0; }</style></head><body>Thanks. I sent mail to beagle-support already but I will subscribe to that list and respond to beagle-support about it. Thanks again. <br><br><div id="htc_header" style="">----- Reply message -----<br>From: "Michael Wilde" <wilde@mcs.anl.gov><br>Date: Sat, Aug 20, 2011 9:03 pm<br>Subject: [Swift-devel] Index out of bounds<br>To: "Jonathan Monette" <jonmon@mcs.anl.gov><br>Cc: <swift-devel@ci.uchicago.edu>, "Daniel S. Katz" <dsk@ci.uchicago.edu><br><br><br></div><div style='font-family: Times New Roman; font-size: 12pt; color: #000000'>Jon, the list you want for Beagle issue notifications is beagle-users. You can subscribe via the link:<div><br></div><div><div>https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/beagle-users</div><div><br></div><div>- Mike</div><div><br></div><div><div>----- Forwarded Message -----</div><div>From: "Greg Cross" <grog@ci.uchicago.edu></div><div>To: beagle-users@ci.uchicago.edu</div><div>Sent: Saturday, August 20, 2011 2:12:45 PM</div><div>Subject: [beagle-users] Outage update</div><div><br></div><div>Lustre is mounting properly but there is a communication failure between the Moab and ALPS scheduler components. This issue is under investigation and has been escalated to Cray.</div><div><br></div><div>As a reminder, please DO NOT attempt to log into the system during this or any other maintenance period. While logins should be denied at this time, any user processes found running on login or sandbox nodes will be terminated without warning. Users who do not respect this may be contacted individually.</div><div><br></div><div>Definitive notification will be sent to this mailing list when the system is available for use.</div><div><br></div><div><br></div><div>_______________________________________________</div><div>beagle-users mailing list</div><div>beagle-users@ci.uchicago.edu</div><div>https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/beagle-users</div><div><br></div><br><hr id="zwchr"><blockquote style="border-left:2px solid rgb(16, 16, 255);margin-left:5px;padding-left:5px;"><b>From: </b>"Jonathan Monette" <jonmon@mcs.anl.gov><br><b>To: </b>"Daniel S. Katz" <dsk@ci.uchicago.edu><br><b>Cc: </b>swift-devel@ci.uchicago.edu<br><b>Sent: </b>Saturday, August 20, 2011 4:20:35 PM<br><b>Subject: </b>Re: [Swift-devel] Index out of bounds<br><br>Thanks. In the meantime could someone let me know when beagle is back in production so I can check my run? <br><br><div id="htc_header" style="">----- Reply message -----<br>From: "Daniel S. Katz" <dsk@ci.uchicago.edu><br>Date: Sat, Aug 20, 2011 3:14 pm<br>Subject: [Swift-devel] Index out of bounds<br>To: "Jonathan Monette" <jonmon@mcs.anl.gov><br>Cc: "Ketan Maheshwari" <ketancmaheshwari@gmail.com>, "swift-devel@ci.uchicago.edu" <swift-devel@ci.uchicago.edu><br><br><br></div><div>Yes, write to beagle-support. <br><br>On Aug 20, 2011, at 14:52, "Jonathan Monette" <<a href="mailto:jonmon@mcs.anl.gov" target="_blank">jonmon@mcs.anl.gov</a>> wrote:<br><br></div><div></div><blockquote><div>Ok thanks. It seems that I was not added to the beagle-notify list. Could someone point me to a link I can subscribe to? Or do I subscribe by sending mail to beagle-support?<br><br><div id="htc_header" style="">----- Reply message -----<br>From: "Ketan Maheshwari" <<a href="mailto:ketancmaheshwari@gmail.com" target="_blank">ketancmaheshwari@gmail.com</a>><br>Date: Sat, Aug 20, 2011 7:45 am<br>Subject: [Swift-devel] Index out of bounds<br>To: "Jonathan Monette" <<a href="mailto:jonmon@mcs.anl.gov" target="_blank">jonmon@mcs.anl.gov</a>><br>Cc: <<a href="mailto:swift-devel@ci.uchicago.edu" target="_blank">swift-devel@ci.uchicago.edu</a>><br><br><br></div>Yes, Beagle went down yesterday. There was a notice.<div><br></div><div>Current status as of Aug 19, 5.30PM:</div><div><br></div><div>==</div><div><span class="Apple-style-span" style="font-family: monospace; background-color: rgb(255, 255, 255); font-size: medium; ">At this time, Lustre is not starting properly on Beagle. This may be related to a configuration change that was made during the last outage. The effort to restore system availability is still in active progress.</span></div>
<div><font class="Apple-style-span" face="monospace" size="3">==<br></font><br></div><div><br></div><div>Ketan</div><div><br><div class="gmail_quote">On Sat, Aug 20, 2011 at 12:03 AM, Jonathan Monette <span dir="ltr"><<a href="mailto:jonmon@mcs.anl.gov" target="_blank"></a><a href="mailto:jonmon@mcs.anl.gov" target="_blank">jonmon@mcs.anl.gov</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin:0 0 0 .8ex;border-left:1px #ccc solid;padding-left:1ex;">I updated and rebuilt and added that line to my log4j properties. Does anyone know if Beagle is down? showq says there is no service listening to sdb:<number>. qstat shows that I have a job sitting in the queue but it doesn't look like jobs are running.<br>
<br>
I am using both PADS and Beagle for this execution. In this case where jobs are not executing on Beagle shouldn't Swift start submitting jobs to PADS? I do not see that behavior.<br>
<br>
This run is still executing. But if you would like to look at the log it is at <a href="http://www.ci.uchicago.edu/~jonmon/logs/montage-2.log" target="_blank"></a><a href="http://www.ci.uchicago.edu/~jonmon/logs/montage-2.log" target="_blank">www.ci.uchicago.edu/~jonmon/logs/montage-2.log</a>. Only 23 tasks have finished before it just sits there waiting for Beagle to run.<br>
<div><div></div><div class="h5">On Aug 19, 2011, at 2:46 PM, Jonathan Monette wrote:<br>
<br>
> Sure can. I add that line to the log4j file or in a different properties file.<br>
><br>
> ----- Reply message -----<br>
> From: "Mihael Hategan" <<a href="mailto:hategan@mcs.anl.gov" target="_blank"></a><a href="mailto:hategan@mcs.anl.gov" target="_blank">hategan@mcs.anl.gov</a>><br>
> Date: Fri, Aug 19, 2011 2:03 pm<br>
> Subject: Index out of bounds<br>
> To: "Jonathan Monette" <<a href="mailto:jonmon@mcs.anl.gov" target="_blank"></a><a href="mailto:jonmon@mcs.anl.gov" target="_blank">jonmon@mcs.anl.gov</a>><br>
> Cc: <<a href="mailto:swift-devel@ci.uchicago.edu" target="_blank"></a><a href="mailto:swift-devel@ci.uchicago.edu" target="_blank">swift-devel@ci.uchicago.edu</a>><br>
><br>
><br>
> Hmm. So I can't see how this manages to happen.<br>
><br>
> I added some checks and debugging statements. Can you update, set log<br>
> level of org.globus.cog.abstraction.impl.file.local to DEBUG, re-run and<br>
> then post the log when the exception pops up?<br>
><br>
> Mihael<br>
><br>
> On Thu, 2011-08-18 at 23:14 -0500, Jonathan Monette wrote:<br>
> > Ok. The log is at<br>
> > <a href="http://www.ci.uchicago.edu/~jonmon/logs/montage-1.log" target="_blank"></a><a href="http://www.ci.uchicago.edu/~jonmon/logs/montage-1.log" target="_blank">www.ci.uchicago.edu/~jonmon/logs/montage-1.log</a><br>
> > On Aug 18, 2011, at 5:56 PM, Mihael Hategan wrote:<br>
> ><br>
> > > It's probably a good idea to post the stack trace of that exception now<br>
> > > rather than later.<br>
> > ><br>
> > > On Thu, 2011-08-18 at 13:09 -0500, Jonathan Monette wrote:<br>
> > >> Hello,<br>
> > >> I was running 0.93 with one a relatively small run, a 350 task run.<br>
> > >> The run failed on one of the final tasks. I checked the log file and<br>
> > >> saw some index out of bounds errors. I tried with a smaller run and<br>
> > >> didn't see the error.<br>
> > >><br>
> > >> This run was using beagle, pads, and communicado. I was also using<br>
> > >> cdm. I will post the log in a bit. I am seeing if I cam replicate it<br>
> > >> without using cdm and with a smaller site pool.<br>
> > >><br>
> > ><br>
> > ><br>
> ><br>
><br>
><br>
><br>
><br>
</div></div><div><div></div><div class="h5">> _______________________________________________<br>
> Swift-devel mailing list<br>
> <a href="mailto:Swift-devel@ci.uchicago.edu" target="_blank"></a><a href="mailto:Swift-devel@ci.uchicago.edu" target="_blank">Swift-devel@ci.uchicago.edu</a><br>
> <a href="https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel" target="_blank"></a><a href="https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel" target="_blank">https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel</a><br>
<br>
_______________________________________________<br>
Swift-devel mailing list<br>
<a href="mailto:Swift-devel@ci.uchicago.edu" target="_blank"></a><a href="mailto:Swift-devel@ci.uchicago.edu" target="_blank">Swift-devel@ci.uchicago.edu</a><br>
<a href="https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel" target="_blank"></a><a href="https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel" target="_blank">https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel</a><br>
</div></div></blockquote></div><br><br clear="all"><div><br></div>-- <br>Ketan<br><br><br>
</div>
</div></blockquote><blockquote><div><span>_______________________________________________</span><br><span>Swift-devel mailing list</span><br><span><a href="mailto:Swift-devel@ci.uchicago.edu" target="_blank">Swift-devel@ci.uchicago.edu</a></span><br><span><a href="https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel" target="_blank">https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel</a></span><br></div></blockquote><br>_______________________________________________<br>Swift-devel mailing list<br>Swift-devel@ci.uchicago.edu<br>https://lists.ci.uchicago.edu/cgi-bin/mailman/listinfo/swift-devel<br></blockquote><br><span><br><br>-- <br><span name="x"></span>Michael Wilde<br>Computation Institute, University of Chicago<br>Mathematics and Computer Science Division<br>Argonne National Laboratory<br><span name="x"></span><br></span></div></div></div></b