[Swift-devel] bug 53

Ioan Raicu iraicu at cs.uchicago.edu
Thu Sep 13 10:27:12 CDT 2007


If you mean bug 53, why not... it would be great if that allows MolDyn 
to scale better than in the past, and we could do some comparison runs 
between Falkon and GRAM/PBS for such a large workflow!

Ioan

Mihael Hategan wrote:
> May I still fix that bug though?
>
> On Thu, 2007-09-13 at 09:54 -0500, Ioan Raicu wrote:
>   
>> Hi,
>> I am still working on the new feature for Falkon to avoid submitting
>> tasks to known bad nodes, and to perhaps do its own retries for failed
>> jobs with certain known errors (i.e. stale NFS handle).  I should have
>> that ready for next week to try out.  Once this new feature is in, we
>> could try MolDyn again to see how it behaves.
>>
>> About avoiding Falkon of MolDyn, I recall something about the
>> scalability/policies of GRAM/PBS to handle many con current jobs,
>> having to throttle job submissions to something around 1 job every 10
>> seconds (for sustained periods of time, short bursts could send
>> faster), and the fact that only a few 10s of nodes would be used
>> concurrently, even though the sites that it was running on had more
>> free nodes.  I also think that MolDyn through GRAM/PBS was running
>> only 1 job per node, in essence only using 1 processor of the 2 per
>> node.  I think the largest workflow Nika was able to run over GRAM/PBS
>> was 5 molecules, 421 jobs (but only 340 jobs in the large stage).
>> Nika, were there other problems you encountered?
>>
>> Ioan
>>
>> Mihael Hategan wrote: 
>>     
>>> Very well Sir. I shall see to the priority of the issue being raised.
>>>
>>> On Thu, 2007-09-13 at 14:09 +0000, Ben Clifford wrote:
>>>   
>>>       
>>>> I think one of the main impediments to moldyn running with GRAM directly 
>>>> is bug 53 which is a request for sumission rate limiting.
>>>>
>>>> It might be relatively easy to implement that and see how the MolDyn 
>>>> workflow behaves then.
>>>>
>>>> I'm interested to see if Falkon can be avoided for this workflow.
>>>>
>>>>     
>>>>         
>>> _______________________________________________
>>> Swift-devel mailing list
>>> Swift-devel at ci.uchicago.edu
>>> http://mail.ci.uchicago.edu/mailman/listinfo/swift-devel
>>>
>>>   
>>>       
>> -- 
>> ============================================
>> Ioan Raicu
>> Ph.D. Student
>> ============================================
>> Distributed Systems Laboratory
>> Computer Science Department
>> University of Chicago
>> 1100 E. 58th Street, Ryerson Hall
>> Chicago, IL 60637
>> ============================================
>> Email: iraicu at cs.uchicago.edu
>> Web:   http://www.cs.uchicago.edu/~iraicu
>>        http://dsl.cs.uchicago.edu/
>> ============================================
>> ============================================
>>     
>
>
>   

-- 
============================================
Ioan Raicu
Ph.D. Student
============================================
Distributed Systems Laboratory
Computer Science Department
University of Chicago
1100 E. 58th Street, Ryerson Hall
Chicago, IL 60637
============================================
Email: iraicu at cs.uchicago.edu
Web:   http://www.cs.uchicago.edu/~iraicu
       http://dsl.cs.uchicago.edu/
============================================
============================================

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/swift-devel/attachments/20070913/179068fe/attachment.html>


More information about the Swift-devel mailing list