[Swift-devel] [Bug 72] Campaign for scaling wf up to 244 molecules

bugzilla-daemon at mcs.anl.gov bugzilla-daemon at mcs.anl.gov
Thu Jun 28 16:12:48 CDT 2007


http://bugzilla.mcs.anl.gov/swift/show_bug.cgi?id=72





------- Comment #1 from wilde at mcs.anl.gov  2007-06-28 16:12 -------
Ive reviewed this email thread on this bug, and am moving this discussion to
bugzilla. 

I and am uncertain about the following - can people involved (Nika, Ioan,
Mihael) clarify:

- did Mihael discover an error in Falkon mutex code?

- if so was it fixed, and did it correct the problem of missed completion
notifications?

- whats the state of the "unable to write output file" problem?

- do we still have a bad node in UC-Teraport w/ NSF Stale file handles? If so,
was that reported? (This raises interesting issues in troubleshooting and
trouble workaround)

- do we have a plan for how to run this WF at scale? Meaning how to get 244
nodes for several days, whether we can scale up beyond
1-processor-per-molecule, what the expected runtime is, how to deal with
errors/restarts, etc? (Should detail this here in bugz).


-- 
Configure bugmail: http://bugzilla.mcs.anl.gov/swift/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.



More information about the Swift-devel mailing list