[Swift-devel] [Bug 72] Campaign for scaling wf up to 244 molecules
bugzilla-daemon at mcs.anl.gov
bugzilla-daemon at mcs.anl.gov
Thu Jun 28 16:12:48 CDT 2007
http://bugzilla.mcs.anl.gov/swift/show_bug.cgi?id=72
------- Comment #1 from wilde at mcs.anl.gov 2007-06-28 16:12 -------
Ive reviewed this email thread on this bug, and am moving this discussion to
bugzilla.
I and am uncertain about the following - can people involved (Nika, Ioan,
Mihael) clarify:
- did Mihael discover an error in Falkon mutex code?
- if so was it fixed, and did it correct the problem of missed completion
notifications?
- whats the state of the "unable to write output file" problem?
- do we still have a bad node in UC-Teraport w/ NSF Stale file handles? If so,
was that reported? (This raises interesting issues in troubleshooting and
trouble workaround)
- do we have a plan for how to run this WF at scale? Meaning how to get 244
nodes for several days, whether we can scale up beyond
1-processor-per-molecule, what the expected runtime is, how to deal with
errors/restarts, etc? (Should detail this here in bugz).
--
Configure bugmail: http://bugzilla.mcs.anl.gov/swift/userprefs.cgi?tab=email
------- You are receiving this mail because: -------
You are on the CC list for the bug, or are watching someone who is.
More information about the Swift-devel
mailing list