[MPICH] MPICH2 hangs on diskless SuSE 10.2 based cluster

Reuti reuti at staff.uni-marburg.de
Wed Dec 20 00:25:56 CST 2006


Hi,

Am 20.12.2006 um 02:03 schrieb Shaun Qualheim:

> Hey everyone...
>
> I'm trying to figure out why I'm having an issue with running a job  
> on multiple machines here.
>
> It worked fine with a 32-bit SuSE 9.3 based setup.
>
> I started it up with a 64-bit base distro and kernel...
>
> I can start up the process with:
> mpd &; mpiexec -n 2 process &
> and it starts with no issues...
>
> When I try doing that with 2 machines though...
> mpdboot -n 2 &; mpiexec -n 2 process &
> It hangs for about 5 minutes and then starts up.

maybe putting both commands in the background leads to a race  
conditilon. Can you try the mpdboot without putting it in the  
background?

-- Reuti




More information about the mpich-discuss mailing list