I ran the cpi example and below is the output.<br><i><br>[schaudhry@n13 examples]$ mpiexec -hosts n13,n03 -np 2 ./cpi<br>Process 0 of 2 is on n13<br>Process 1 of 2 is on n03<br>pi is approximately 3.1415926544231318, Error is 0.0000000008333387<br>
wall clock time = 0.000931<br>[schaudhry@n13 examples]$ </i><br><br><br><div class="gmail_quote">On Mon, Nov 21, 2011 at 2:57 PM, Rajeev Thakur <span dir="ltr"><<a href="mailto:thakur@mcs.anl.gov">thakur@mcs.anl.gov</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">See if other MPI programs run across multiple machines. For example, the cpi example in the examples directory.<br>
<div><div></div><div class="h5"><br>
<br>
On Nov 21, 2011, at 3:51 PM, Chavez, Andres wrote:<br>
<br>
> When restricted to running on one machine, my F90 program works perfectly, but when I try to have it run on multiple machines the problem below occurs. I can't figure out what is going wrong, any help will be greatly appreciated thank you.<br>
><br>
> Fatal error in PMPI_Gather: Other MPI error, error stack:<br>
> PMPI_Gather(863)..................: MPI_Gather(sbuf=0xeb59a0, scount=512, MPI_DOUBLE_COMPLEX, rbuf=(nil), rcount=512, MPI_DOUBLE_COMPLEX, root=0, MPI_COMM_WORLD) failed<br>
> MPIR_Gather_impl(693).............:<br>
> MPIR_Gather(655)..................:<br>
> MPIR_Gather_intra(283)............:<br>
> MPIC_Send(66).....................:<br>
> MPIC_Wait(540)....................:<br>
> MPIDI_CH3I_Progress(402)..........:<br>
> MPID_nem_mpich2_blocking_recv(905):<br>
> MPID_nem_tcp_connpoll(1838).......:<br>
> state_listening_handler(1908).....: accept of socket fd failed - Invalid argument<br>
> Fatal error in PMPI_Gather: Other MPI error, error stack:<br>
> PMPI_Gather(863)..........: MPI_Gather(sbuf=0x25d39e0, scount=512, MPI_DOUBLE_COMPLEX, rbuf=0x25bd9b0, rcount=512, MPI_DOUBLE_COMPLEX, root=0, MPI_COMM_WORLD) failed<br>
> MPIR_Gather_impl(693).....:<br>
> MPIR_Gather(655)..........:<br>
> MPIR_Gather_intra(202)....:<br>
> dequeue_and_set_error(596): Communication error with rank 1<br>
><br>
> These are all the instances of MPI_GATHER<br>
> call MPI_GATHER(xi_dot_matrix_transp,na*n_elements*nsd/numtasks,MPI_DOUBLE_COMPLEX,xi_dot_matrix_gath,&<br>
> na*n_elements*nsd/numtasks,MPI_DOUBLE_COMPLEX,0,MPI_COMM_WORLD,ierr)<br>
> call MPI_GATHER(Matrix_A_hat_3d_transp,5*na*size_matrix*nsd/numtasks,MPI_DOUBLE_COMPLEX,&<br>
> Matrix_A_hat_3d_gath,5*na*size_matrix*nsd/numtasks,MPI_DOUBLE_COMPLEX,0,MPI_COMM_WORLD,ierr)<br>
> call MPI_GATHER(JR_matrix_transp,5*na*size_matrix*nsd/numtasks,MPI_INTEGER,JR_matrix_gath,&<br>
> 5*na*size_matrix*nsd/numtasks,MPI_INTEGER,0,MPI_COMM_WORLD,ierr)<br>
> call MPI_GATHER(JC_matrix_transp,5*na*size_matrix*nsd/numtasks,MPI_INTEGER,JC_matrix_gath,&<br>
> 5*na*size_matrix*nsd/numtasks,MPI_INTEGER,0,MPI_COMM_WORLD,ierr)<br>
><br>
</div></div>> _______________________________________________<br>
> mpich-discuss mailing list <a href="mailto:mpich-discuss@mcs.anl.gov">mpich-discuss@mcs.anl.gov</a><br>
> To manage subscription options or unsubscribe:<br>
> <a href="https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss" target="_blank">https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss</a><br>
<br>
_______________________________________________<br>
mpich-discuss mailing list <a href="mailto:mpich-discuss@mcs.anl.gov">mpich-discuss@mcs.anl.gov</a><br>
To manage subscription options or unsubscribe:<br>
<a href="https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss" target="_blank">https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss</a><br>
</blockquote></div><br>