[mpich-discuss] Too many open files (errno 24) when using MPI_Alltoallv

Dave Goodell goodell at mcs.anl.gov
Thu Jun 7 11:22:26 CDT 2012


Your OS has too low of a limit on the number of file descriptors that can be opened by a single process at a time (probably 1024).  Try increasing this limit.  Some instructions for Linux can be found here (just try step 1), you'll have to search around for instructions for other operating systems:

http://www.cs.uwaterloo.ca/~brecht/servers/openfiles.html

-Dave

On Jun 7, 2012, at 1:16 AM CDT, jt.meng at siat.ac.cn wrote:

> I have trouble on using MPI_alltoallv over 1000 cores, I can not find out why this error come out, can you help me? 
> 
> The TestAlltoAll.cpp is attached in this email, You can have a try on 1000 cores to reproduce this errors, Thanks.
> 
> 
> 
> node73:/lustrefs/home/temp/BGI/newGraph/MPIGraph # mpic++ -O2 TestAlltoAll.cpp  -o TestAlltoAll
> node73:/lustrefs/home/temp/BGI/newGraph/MPIGraph # time mpirun -np 1056 -machinefile hostfile ./TestAlltoAll
> Fatal error in PMPI_Alltoallv: Other MPI error, error stack:
> PMPI_Alltoallv(549)..............: MPI_Alltoallv(sbuf=0x2b2738837010, scnts=0x5f06e0, sdispls=0x5f89b0, MPI_LONG_LONG_INT, rbuf=0x2b276ade2010, rcnts=0x5f7920, rdispls=0x5f9a40, MPI_LONG_LONG_INT, MPI_COMM_WORLD) failed
> MPIR_Alltoallv_impl(389).........:
> MPIR_Alltoallv(355)..............:
> MPIR_Alltoallv_intra(190)........:
> MPIC_Isend(475)..................:
> MPID_nem_lmt_RndvSend(81)........:
> MPIDI_CH3_RndvSend(63)...........: failure occurred while attempting to send RTS packet
> MPID_nem_tcp_iStartContigMsg(298):
> MPID_nem_tcp_connect(849)........: unable to create a socket, Too many open files (errno 24)
> Fatal error in PMPI_Alltoallv: Other MPI error, error stack:
> PMPI_Alltoallv(549)..............: MPI_Alltoallv(sbuf=0x2b242dea0010, scnts=0x5f06e0, sdispls=0x5f89b0, MPI_LONG_LONG_INT, rbuf=0x2b246044b010, rcnts=0x5f7920, rdispls=0x5f9a40, MPI_LONG_LONG_INT, MPI_COMM_WORLD) failed
> MPIR_Alltoallv_impl(389).........:
> MPIR_Alltoallv(355)..............:
> MPIR_Alltoallv_intra(190)........:
> MPIC_Isend(475)..................:
> MPID_nem_lmt_RndvSend(81)........:
> MPIDI_CH3_RndvSend(63)...........: failure occurred while attempting to send RTS packet
> MPID_nem_tcp_iStartContigMsg(298):
> MPID_nem_tcp_connect(849)........: unable to create a socket, Too many open files (errno 24)
> Fatal error in PMPI_Alltoallv: Other MPI error, error stack:
> PMPI_Alltoallv(549)..............: MPI_Alltoallv(sbuf=0x2b1a26776010, scnts=0x5f06e0, sdispls=0x5f89b0, MPI_LONG_LONG_INT, rbuf=0x2b1a58d21010, rcnts=0x5f7920, rdispls=0x5f9a40, MPI_LONG_LONG_INT, MPI_COMM_WORLD) failed
> MPIR_Alltoallv_impl(389).........:
> MPIR_Alltoallv(355)..............:
> MPIR_Alltoallv_intra(190)........:
> MPIC_Isend(475)..................:
> MPID_nem_lmt_RndvSend(81)........:
> MPIDI_CH3_RndvSend(63)...........: failure occurred while attempting to send RTS packet
> MPID_nem_tcp_iStartContigMsg(298):
> MPID_nem_tcp_connect(849)........: unable to create a socket, Too many open files (errno 24)
> Fatal error in PMPI_Alltoallv: Other MPI error, error stack:
> PMPI_Alltoallv(549)...............: MPI_Alltoallv(sbuf=0x2b7cd63af010, scnts=0x5f06e0, sdispls=0x5f89b0, MPI_LONG_LONG_INT, rbuf=0x2b7d0895a010, rcnts=0x5f7920, rdispls=0x5f9a40, MPI_LONG_LONG_INT, MPI_COMM_WORLD) failed
> MPIR_Alltoallv_impl(389)..........:
> MPIR_Alltoallv(355)...............:
> MPIR_Alltoallv_intra(199).........:
> MPIC_Waitall_ft(852)..............:
> MPIR_Waitall_impl(121)............:
> MPIDI_CH3I_Progress(402)..........:
> MPID_nem_mpich2_blocking_recv(905):
> MPID_nem_tcp_connpoll(1838).......:
> state_listening_handler(1908).....: accept of socket fd failed - Too many open files
> Ctrl-C caught... cleaning up processes
> 
> real    13m0.684s
> user    0m13.065s
> sys     0m49.703s
> 
> 
> Jintao Meng
> High Performance Computing Center
> Shenzhen Institutes of Advanced Technology, CAS
> 
> 
> 
> 
> 
> <TestAlltoAll.cpp>_______________________________________________
> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
> To manage subscription options or unsubscribe:
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss



More information about the mpich-discuss mailing list