[mpich-discuss] mpich2 and IPoIB

Fabio Motezuki kenji_japanese at ig.com.br
Fri Oct 31 13:11:43 CDT 2008


Hi,

 

We got something here, now the communications goes through infiniband. The machinefile for smpd is a bit different from the mpd.

Also tested the machinefile with 1.0.7 and works too.

 

Fabio

 

 

From: Jayesh Krishna [mailto:jayesh at mcs.anl.gov] 
Sent: Friday, October 31, 2008 2:03 PM
To: 'Fabio Kenji'
Cc: mpich-discuss at mcs.anl.gov
Subject: RE: [mpich-discuss] mpich2 and IPoIB

 

 Hi,
  Can you try specifying hostnames and interface ipaddresses in your machinefile (also see the difference in the way interface ipaddr is specified) ?

==========================================
Hostname_of_10_2_5_10 -ifhn 10.2.10.10
Hostname_of_10_2_5_11 -ifhn 10.2.10.11
==========================================

Regards,
Jayesh

-----Original Message-----
From: Fabio Kenji [mailto:kenji_japanese at ig.com.br]
Sent: Friday, October 31, 2008 6:39 AM
To: Jayesh Krishna
Cc: mpich-discuss at mcs.anl.gov
Subject: Re: [mpich-discuss] mpich2 and IPoIB

Hi,

Compiled the 1.0.8 with the following options:
./configure --enable-cxx --with-pm=smpd --prefix=<path> make && make install

to started the smpd using
<path>smpd -s

to launch the program I'm using
<path>mpiexec -n 4 -machinefile mf <program>

the mf file contains:
10.2.5.10 ifhn=10.2.10.10
10.2.5.11 ifhn=10.2.10.11

where 10.2.5.xx is ethernet and 10.2.10.xx is infiniband

and the communications are going through ethernet

Fabio

2008/10/30, Jayesh Krishna <jayesh at mcs.anl.gov>:
>
>
>
>  Hi,
>    Can you try the latest stable release (1.0.8) of MPICH2 ?
>    I just tried entering a non-existant ipaddress in the machinefile using the "-ifhn" option and my program failed (expected failure). A quick look of the smpd debug output shows that smpd is setting the right (as per the "-ifhn" args) environment before launching the process.
>    Please let us know if MPICH2 1.0.8 + smpd + "-ifhn" option does not work for you.
>
>  Regards,
>  Jayesh
>
>  -----Original Message-----
>  From: Fabio Motezuki [mailto:kenji_japanese at ig.com.br]
>  Sent: Wednesday, October 29, 2008 8:36 PM
>  To: Jayesh Krishna
>  Cc: mpich-discuss at mcs.anl.gov
>  Subject: Re: [mpich-discuss] mpich2 and IPoIB
>
>  Returning, I compiled de 1.0.8rc1 source and using mpd the ifhn in the mpd.hosts file worked well, but not with interface name resolution, just using ip directly.
>
>  In 1.0.7 I tried to use ifhn in the machinefile for smpd execution, but no effect even when using the ip address of the infiniband interface.
>
>
>  Fabio
>
>
>
>  Jayesh Krishna escreveu:
>
>  >
>  >  Hi,
>  >   Yes, you can use the "-ifhn" option in the machinefile with smpd.
>  >   Let us know if you have any problems.
>  >
>  > Regards,
>  > Jayesh
>  >
>  > -----Original Message-----
>  > From: owner-mpich-discuss at mcs.anl.gov  >
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Rajeev Thakur  >
> Sent: Monday, October 27, 2008 2:22 PM  > To:
> mpich-discuss at mcs.anl.gov  > Subject: RE: [mpich-discuss] mpich2 and
> IPoIB  >  > Probably :-). From doing a grep in the smpd directory it
> looks like it  > might accept "-ifhn hostname"
>  >
>  > Rajeev
>  >
>  > > -----Original Message-----
>  > > From: owner-mpich-discuss at mcs.anl.gov  > >
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Fabio Motezuki 
> > > Sent: Monday, October 27, 2008 2:12 PM  > > To:
> mpich-discuss at mcs.anl.gov  > > Subject: Re: [mpich-discuss] mpich2 and
> IPoIB  > >  > > Thanks Rajeev,  > >  > > I'll try it, perhaps is there
> any options like this for smpd?
>  > >
>  > > Fabio
>  > >
>  > >
>  > > Rajeev Thakur escreveu:
>  > > > Try specifying the interface name using ifhn= in the  > >
> mpd.hosts file as  > > > described in Sec 5.1.5 of the installation
> guide.
>  > > >
>  > > http://www.mcs.anl.gov/research/projects/mpich2/documentation/
>  > > files/mpich2-1
>  > > > .0.8-installguide.pdf
>  > > >
>  > > > Rajeev
>  > > >
>  > > >
>  > > >> -----Original Message-----
>  > > >> From: owner-mpich-discuss at mcs.anl.gov  > > >>
> [mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of  > > Fabio
> Motezuki  > > >> Sent: Monday, October 27, 2008 5:35 AM  > > >> To:
> mpich-discuss at mcs.anl.gov  > > >> Subject: [mpich-discuss] mpich2 and
> IPoIB  > > >>  > > >> Hi all,  > > >>  > > >> I'm working on a cluster
> where each node is connected with two  > > >> networks:
>  > > >>
>  > > >> eth0 - gigabit ethernet
>  > > >> ib0 - IP over infiniband
>  > > >>
>  > > >> I would like to use the ib0 net for all mpi communications,
> but  > > >> when I start the example cpi all comunications go through
> eth0 is  > > this the  > > >> expected behavior?
>  > > >>
>  > > >> I'm launching the example program with "mpiexec  > >
> -machinefile mf -n 16  > > >> ./cpi" where the file "mf" contains the
> ip address of infiniband  > > >> cards.
>  > > >>
>  > > >> Fabio
>  > > >>
>  > > >>
>  > > >>
>  > > >
>  > > >
>  > > >
>  > >
>  > >
>  >
>

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20081031/86059c89/attachment.htm>


More information about the mpich-discuss mailing list