[mpich-discuss] Not able to run program parallely on cluster...

Rajeev Thakur thakur at mcs.anl.gov
Mon May 21 20:14:35 CDT 2012


You are passing the same buffer as the sendbuf and recvbuf to MPI_Gatherv, which is not allowed in MPI. Use MPI_IN_PLACE as described in the standard.


On May 21, 2012, at 2:29 PM, Albert Spade wrote:

> This is my new error after making few changes...
> Results are quite similar... No succes with cluster...
>  
> Sample run
> --------------------------------------------------------
>  
> [root at beowulf testing]# mpiexec -n 1 ./Radix
> Time taken for 16 elements using 1 processors = 4.72069e-05 seconds
> [root at beowulf testing]# mpiexec -n 2 ./Radix
> Fatal error in PMPI_Gatherv: Internal MPI error!, error stack:
> PMPI_Gatherv(398).....: MPI_Gatherv failed(sbuf=0x97d0500, scount=64, MPI_CHAR, rbuf=0x97d0500, rcnts=0x97d06b8, displs=0x97d06c8, MPI_CHAR, root=0, MPI_COMM_WORLD) failed
> MPIR_Gatherv_impl(210):
> MPIR_Gatherv(104).....:
> MPIR_Localcopy(357)...: memcpy arguments alias each other, dst=0x97d0500 src=0x97d0500 len=64
> =====================================================================================
> =   BAD TERMINATION OF ONE OF YOUR APPLICATION PROCESSES
> =   EXIT CODE: 256
> =   CLEANING UP REMAINING PROCESSES
> =   YOU CAN IGNORE THE BELOW CLEANUP MESSAGES
> =====================================================================================
> [proxy:0:1 at beowulf.node1] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:927): assert (!closed) failed
> [proxy:0:1 at beowulf.node1] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status
> [proxy:0:1 at beowulf.node1] main (./pm/pmiserv/pmip.c:221): demux engine error waiting for event
> [mpiexec at beowulf.master] HYDT_bscu_wait_for_completion (./tools/bootstrap/utils/bscu_wait.c:77): one of the processes terminated badly; aborting
> [mpiexec at beowulf.master] HYDT_bsci_wait_for_completion (./tools/bootstrap/src/bsci_wait.c:23): launcher returned error waiting for completion
> [mpiexec at beowulf.master] HYD_pmci_wait_for_completion (./pm/pmiserv/pmiserv_pmci.c:225): launcher returned error waiting for completion
> [mpiexec at beowulf.master] main (./ui/mpich/mpiexec.c:437): process manager error waiting for completion
> [root at beowulf testing]#
> 
> _______________________________________________
> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
> To manage subscription options or unsubscribe:
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss



More information about the mpich-discuss mailing list