[mpich-discuss] mvapich2 on multiple nodes: 2 problems

Dave Goodell goodell at mcs.anl.gov
Fri Apr 23 11:24:18 CDT 2010


On Apr 23, 2010, at 9:42 AM, abc def wrote:

> Having now had a chance to reinstall mpich2 on the first 2  
> computers, I have tested the program trying to use the 4 cores of  
> both computers together, using:
>
> mpiexec -n 8 /home/me/software.ex > job.out 2> job.err
> and
> mpiexec -machinefile ./machinefile -n 8 /home/me/software.ex >  
> job.out 2> job.err
>
> but I get the same error again:
> Fatal error in MPI_Init: Other MPI error, error stack:
> MPIR_Init_thread(310): Initialization failed
> MPID_Init(113).......: channel initialization failed
> MPIDI_CH3_Init(244)..: process not on the same host (quad != december)

This message only comes from using ch3:shm.  There is no other way to  
obtain it.  Something is wrong with your installation.

> This is despite running configure with
>
> configure --prefix=/usr/local/mpich --with-device=ch3:nemesis -- 
> enable-sharedlibs=gcc

Please start with a fresh source tree.  You probably have some old  
compiled code hanging around.  Also make sure that you "make install".

As for your earlier MPD problems, please see this FAQ entry: http://wiki.mcs.anl.gov/mpich2/index.php/Frequently_Asked_Questions#Q 
:_My_MPD_ring_won.27t_start.2C_what.27s_wrong.3F

-Dave



More information about the mpich-discuss mailing list