[mpich-discuss] mvapich2 on multiple nodes: 2 problems
Dave Goodell
goodell at mcs.anl.gov
Fri Apr 23 11:24:18 CDT 2010
On Apr 23, 2010, at 9:42 AM, abc def wrote:
> Having now had a chance to reinstall mpich2 on the first 2
> computers, I have tested the program trying to use the 4 cores of
> both computers together, using:
>
> mpiexec -n 8 /home/me/software.ex > job.out 2> job.err
> and
> mpiexec -machinefile ./machinefile -n 8 /home/me/software.ex >
> job.out 2> job.err
>
> but I get the same error again:
> Fatal error in MPI_Init: Other MPI error, error stack:
> MPIR_Init_thread(310): Initialization failed
> MPID_Init(113).......: channel initialization failed
> MPIDI_CH3_Init(244)..: process not on the same host (quad != december)
This message only comes from using ch3:shm. There is no other way to
obtain it. Something is wrong with your installation.
> This is despite running configure with
>
> configure --prefix=/usr/local/mpich --with-device=ch3:nemesis --
> enable-sharedlibs=gcc
Please start with a fresh source tree. You probably have some old
compiled code hanging around. Also make sure that you "make install".
As for your earlier MPD problems, please see this FAQ entry: http://wiki.mcs.anl.gov/mpich2/index.php/Frequently_Asked_Questions#Q
:_My_MPD_ring_won.27t_start.2C_what.27s_wrong.3F
-Dave
More information about the mpich-discuss
mailing list