[MPICH] MPICH2 v1.02 in heterogeneous enviroment

Rajeev Thakur thakur at mcs.anl.gov
Tue Mar 7 14:33:18 CST 2006


Yes it's definitely planned, but can't give a firm date for when it will
appear in the implementation.
 
Rajeev
 


  _____  

From: Kenneth Rempe [mailto:kenneth.rempe at studsvikscandpower.com] 
Sent: Tuesday, March 07, 2006 2:16 PM
To: Rajeev Thakur
Cc: mpich-discuss at mcs.anl.gov
Subject: Re: [MPICH] MPICH2 v1.02 in heterogeneous enviroment


Thanks for the response. I found the following in the MPICH2_1.0.3 release
notes:

- The CH3 device does not presently support heterogeneous communication.
That
  is to say that the processes involved in a job must use the same basic
type
  sizes and format.  The sizes and format are typically determined by the
  processor architecture, although it may also be influenced by compiler
  options.  This device does support the use of different executables (e.g.,
  multiple-program-multiple-data, or MPMD, programming).


Which is just as you stated. Do you know if true heterogeneous support is
planned?

Thanks again.

Ken



Rajeev Thakur wrote:


MPICH2 doesn't yet work in heterogeneous environments. It will work in

heterogeneous environments that really are homogoneous, ie, no differences

in byte ordering, sizes of datatypes, etc. 



Rajeev

 



  

-----Original Message-----

From: owner-mpich-discuss at mcs.anl.gov 

[mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Kenneth Rempe

Sent: Monday, March 06, 2006 8:13 AM

To: mpich-discuss at mcs.anl.gov

Subject: [MPICH] MPICH2 v1.02 in heterogeneous enviroment





I'm using MPICH2 v1.02 with IBM AIX 5.2 and Red Hat Linux. My program 

works fine

if I just run on Linux machines or just IBM machines but 

fails with the 

following message when

trying to run on both IBM and Linux at the same time.



Fatal error in MPI_Comm_spawn: Internal MPI error!, error stack:

MPI_Comm_spawn(128): 

MPI_Comm_spawn(cmd="/home/ken/DUKE/Formosa-link/New/FPSIM3/Lin

ux/fpsim3.exe", 

argv=2005bc88, maxprocs=1, info=0x9c000000, root=0, MPI_COMM_WORLD, 

intercomm=2005b858, errors=2002fff0) failed

MPID_Comm_spawn_multiple(52):

MPIDI_CH3_Comm_spawn_multiple(212):

MPIDI_CH3_Comm_accept(102):

MPIDI_CH3_Progress_wait(209): an error occurred while 

handling an event 

returned by MPIDU_Sock_Wait()

MPIDI_CH3I_Progress_handle_sock_event(886): [ch3:sock] 

received packet 

of unknown type (385875968)

% rank 0 in job 1  ibm6e1_34162   caused collective abort of all ranks

  exit status of rank 0: killed by signal 9





Is it possible to use MPICH2 in a heterogeneous environment?



Thanks.





    



  



-- 

*** NOTE: soa.com will no longer be used after February 1, 2006, please
change your address book ***



Kenneth R. Rempe                * email:
kenneth.rempe at studsvikscandpower.com

Studsvik Scandpower, Inc.       * 

504 Shoup Avenue Suite #201     * voice:  208-522-4630

Idaho Falls, ID USA 83402-3502  * fax:    208-522-1187

-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20060307/70acd641/attachment.htm>


More information about the mpich-discuss mailing list