[MPICH] MPICH2 v1.02 in heterogeneous enviroment
Kenneth Rempe
kenneth.rempe at studsvikscandpower.com
Tue Mar 7 14:15:55 CST 2006
Thanks for the response. I found the following in the MPICH2_1.0.3
release notes:
- The CH3 device does not presently support heterogeneous
communication. That
is to say that the processes involved in a job must use the same basic
type
sizes and format. The sizes and format are typically determined by the
processor architecture, although it may also be influenced by compiler
options. This device does support the use of different executables (e.g.,
multiple-program-multiple-data, or MPMD, programming).
Which is just as you stated. Do you know if true heterogeneous support
is planned?
Thanks again.
Ken
Rajeev Thakur wrote:
>MPICH2 doesn't yet work in heterogeneous environments. It will work in
>heterogeneous environments that really are homogoneous, ie, no differences
>in byte ordering, sizes of datatypes, etc.
>
>Rajeev
>
>
>
>
>>-----Original Message-----
>>From: owner-mpich-discuss at mcs.anl.gov
>>[mailto:owner-mpich-discuss at mcs.anl.gov] On Behalf Of Kenneth Rempe
>>Sent: Monday, March 06, 2006 8:13 AM
>>To: mpich-discuss at mcs.anl.gov
>>Subject: [MPICH] MPICH2 v1.02 in heterogeneous enviroment
>>
>>
>>I'm using MPICH2 v1.02 with IBM AIX 5.2 and Red Hat Linux. My program
>>works fine
>>if I just run on Linux machines or just IBM machines but
>>fails with the
>>following message when
>>trying to run on both IBM and Linux at the same time.
>>
>>Fatal error in MPI_Comm_spawn: Internal MPI error!, error stack:
>>MPI_Comm_spawn(128):
>>MPI_Comm_spawn(cmd="/home/ken/DUKE/Formosa-link/New/FPSIM3/Lin
>>ux/fpsim3.exe",
>>argv=2005bc88, maxprocs=1, info=0x9c000000, root=0, MPI_COMM_WORLD,
>>intercomm=2005b858, errors=2002fff0) failed
>>MPID_Comm_spawn_multiple(52):
>>MPIDI_CH3_Comm_spawn_multiple(212):
>>MPIDI_CH3_Comm_accept(102):
>>MPIDI_CH3_Progress_wait(209): an error occurred while
>>handling an event
>>returned by MPIDU_Sock_Wait()
>>MPIDI_CH3I_Progress_handle_sock_event(886): [ch3:sock]
>>received packet
>>of unknown type (385875968)
>>% rank 0 in job 1 ibm6e1_34162 caused collective abort of all ranks
>> exit status of rank 0: killed by signal 9
>>
>>
>>Is it possible to use MPICH2 in a heterogeneous environment?
>>
>>Thanks.
>>
>>
>>
>>
>
>
>
--
*** NOTE: soa.com will no longer be used after February 1, 2006, please change your address book ***
Kenneth R. Rempe * email: kenneth.rempe at studsvikscandpower.com
Studsvik Scandpower, Inc. *
504 Shoup Avenue Suite #201 * voice: 208-522-4630
Idaho Falls, ID USA 83402-3502 * fax: 208-522-1187
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20060307/51eebcbc/attachment.htm>
More information about the mpich-discuss
mailing list