[mpich-discuss] MPICH2 error.Seek for Help

songjing songjing at mail.ustc.edu.cn
Tue Dec 28 20:28:59 CST 2010


Dear Sir:
First of all, thank you for your excellent MPICH! It helps us a lot.
But I have some trouble puzzles me.
I have a parallel code system compiled with MPICH2 1.3.1.But when it run on Windows Server 2003,64 bit system ,a computational cluster with 2 nodes,it informed as below and stopped running.
What is wrong with MPICH2 and my code ?
Waiting for your quick and detail answer.
Thank you very much!
                                                                                      SongJing

mcnp     ver=5    , ld=06212004  12/28/10 12:35:24                   
 outp already exists.  outq is created instead.
 outq already exists.  outr is created instead.
 outr already exists.  outs is created instead.
 outs already exists.  outt is created instead.
 outt already exists.  outu is created instead.
 outu already exists.  outv is created instead.
          Thread Name & Version = MCNP5_RSICC, 1.30
          Copyright LANL/UC/DOE - see output file
                                _                                      
          ._ _    _  ._   ._   |_                                      
          | | |  (_  | |  |_)   _)                                     
                          |                                            
  
 m1     $ENrich u:0.095                                               
 warning. material   1 is not used in the problem.
 m2                                                                   
 warning. material   2 is not used in the problem.
 m3                                                                   
 warning. material   3 is not used in the problem.
 cut:n 1.e20 1.e-11 .18 .09                                           
 warning. neutron energy cutoff is not zero in this kcode problem.
 warning. neutron energy cutoff >0 in this neutron-photon problem.
 mode n p                                                             
 comment. photonuclear physics may be needed (phys:p).
 srctp already exists.  srctq is created instead.
 srctq already exists.  srctr is created instead.
 srctr already exists.  srcts is created instead.

 comment. total fission nubar data are being used.
 warning.   2 materials had unnormalized fractions. print table 40.
 comment. using random number generator  1, initial seed = 19073486328125      
 warning. neutron  time cutoff is not equal to photon   time cutoff.
 imcn   is done
 runtpe already exists.  runtpf is created instead.
 runtpf already exists.  runtpg is created instead.
 runtpg already exists.  runtph is created instead.
 runtph already exists.  runtpi is created instead.
 runtpi already exists.  runtpj is created instead.
 runtpj already exists.  runtpk is created instead.
 warning.   1003.00c lacks gamma-ray production cross sections.
 warning.   2004.00c lacks gamma-ray production cross sections.
 warning.   8017.00c lacks gamma-ray production cross sections.
 warning.  31069.00c lacks gamma-ray production cross sections.
 warning.  31071.00c lacks gamma-ray production cross sections.
 warning.  33075.00c lacks gamma-ray production cross sections.
 warning.  34076.00c lacks gamma-ray production cross sections.
 warning.  34077.00c lacks gamma-ray production cross sections.
 warning.  34078.00c lacks gamma-ray production cross sections.
 warning.  34079.00c lacks gamma-ray production cross sections.
 warning.  34080.00c lacks gamma-ray production cross sections.
 warning.  34082.00c lacks gamma-ray production cross sections.
 warning.  35079.00c lacks gamma-ray production cross sections.
 additional error messages on file outv    
 dump    1 on file runtpk     nps =           0    coll =              0
                              ctm =        0.00   nrn =                 0
 source distribution to file srcts           cycle =     0
 xact   is done
 cp0 =   0.34
 master starting      15 tasks with       1 threads each  12/28/10 12:35:54 
 master sending static commons...
 master sending dynamic commons...
 master sending cross section data...
Fatal error in PMPI_Bcast: Other MPI error, error stack:
PMPI_Bcast(1306).................................: MPI_Bcast(buf=1B7F0040, count=192621760, MPI_BYTE, root=0, comm=0x84000004) failed
MPIR_Bcast_impl(1150)............................: 
MPIR_Bcast_intra(990)............................: 
MPIR_Bcast_scatter_ring_allgather(908)...........: 
MPIR_Bcast_scatter_ring_allgather(693)...........: 
scatter_for_bcast(286)...........................: 
MPIC_Recv(108)...................................: 
MPIC_Wait(528)...................................: 
MPIDI_CH3I_Progress(335).........................: 
MPID_nem_mpich2_blocking_recv(906)...............: 
MPID_nem_newtcp_module_poll(37)..................: 
MPID_nem_newtcp_module_connpoll(2669)............: 
MPID_nem_newtcp_module_recv_success_handler(2364): 
MPID_nem_newtcp_module_post_readv_ex(330)........: 
MPIU_SOCKW_Readv_ex(392).........................: read from socket failed, 由于系统缓冲区空间不足或队列已满,不能执行套接字上的操作。  (errno 10055)
Fatal error in PMPI_Bcast: Other MPI error, error stack:
PMPI_Bcast(1306)......................: MPI_Bcast(buf=1B7F0040, count=192621760, MPI_BYTE, root=0, comm=0x84000002) failed
MPIR_Bcast_impl(1150).................: 
MPIR_Bcast_intra(990).................: 
MPIR_Bcast_scatter_ring_allgather(908): 
MPIR_Bcast_scatter_ring_allgather(730): 
MPIC_Sendrecv(186)....................: 
MPIC_Wait(528)........................: 
MPIDI_CH3I_Progress(335)..............: 
MPID_nem_mpich2_blocking_recv(906)....: 
MPID_nem_newtcp_module_poll(37).......: 
MPID_nem_newtcp_module_connpoll(2655).: 
gen_read_fail_handler(1145)...........: read from socket failed - 指定的网络名不再可用。 
Fatal error in PMPI_Bcast: Other MPI error, error stack:
PMPI_Bcast(1306)......................: MPI_Bcast(buf=1B7F0040, count=192621760, MPI_BYTE, root=0, comm=0x84000004) failed
MPIR_Bcast_impl(1150).................: 
MPIR_Bcast_intra(990).................: 
MPIR_Bcast_scatter_ring_allgather(908): 
MPIR_Bcast_scatter_ring_allgather(730): 
MPIC_Sendrecv(189)....................: 
MPIC_Wait(528)........................: 
MPIDI_CH3I_Progress(335)..............: 
MPID_nem_mpich2_blocking_recv(906)....: 
MPID_nem_newtcp_module_poll(37).......: 
MPID_nem_newtcp_module_connpoll(2655).: 
gen_read_fail_handler(1145)...........: read from socket failed - 指定的网络名不再可用。 




                                                                                                                    
**********************************************************************************************************
University of Science and Technology of China,Nuclear Science And Technology
E-mail:   songjing at mail.ustc.edu.cn
Website:www.fds.org.cn
     CellPhone : 13645698849 
**********************************************************************************************************
2010-12-20 



songjing 

2010-12-29 



songjing 
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20101229/964b663e/attachment-0001.htm>
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: image/jpeg
Size: 200347 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20101229/964b663e/attachment-0001.jpeg>


More information about the mpich-discuss mailing list