[mpich-discuss] Hydra issues

Scott Atchley atchley at myri.com
Wed Aug 26 15:09:42 CDT 2009


No, the ring starts fast enough. It is connecting 1024 processes that  
is slow (allgatherv).

By contrast, Intel MPI launched in < 10 seconds.

Scott

On Aug 26, 2009, at 4:07 PM, Rusty Lusk wrote:

> I assume that you mean launching the MPD ring is slow.  Once the MPD  
> ring is up, launching should be quick.   The original idea was that  
> the MPD ring would be persistent across jobs, even from different  
> people, as long as the jobs used the same nodes.
>
> Rusty
>
> On Wednesday,Aug 26, 2009, at 2:45 PM, Scott Atchley wrote:
>
>> On Aug 26, 2009, at 3:39 PM, Pavan Balaji wrote:
>>
>>>>> However you could use one of the various workarounds for this,  
>>>>> such as an LD_PRELOADed setvbuf call: http://lists.gnu.org/archive/html/bug-coreutils/2008-11/msg00164.html
>>>> This does not change the behavior.
>>>> I am still stumped as to why there is no delay when using  
>>>> persistent (launch-mode=2) versus a delay with no proxies (launch- 
>>>> mode=1).
>>>
>>> This works for me. We need to figure out how to make this portable  
>>> now.
>>>
>>> -- Pavan
>>
>> Thanks for your persistence (no pun intended).
>>
>> When running with 1,024 ranks, launching via MPD can take several  
>> minutes. I am assuming that hydra will launch in seconds.
>>
>> Scott
>



More information about the mpich-discuss mailing list