[mpich-discuss] p4_error issue

akshar bhosale akshar.bhosale at gmail.com
Tue Nov 22 11:24:22 CST 2011


Hi,

i have one executable compiled with mpich-1.2.7p1. it works fine if run on
one server (node) but it gives following error if i try to run it across
multiple nodes.


p15_4211: p4_error: net_recv read: probable EOF on socket: 1
> p14_4184: p4_error: net_recv read: probable EOF on socket: 1
> p13_4157: p4_error: net_recv read: probable EOF on socket: 1
> p12_4130: p4_error: net_recv read: probable EOF on socket: 1
> p11_4103: p4_error: net_recv read: probable EOF on socket: 1
> p10_4076: p4_error: net_recv read: probable EOF on socket: 1
> p8_4022: p4_error: net_recv read: probable EOF on socket: 1
> p9_4049: p4_error: net_recv read: probable EOF on socket: 1
> p23_9466: p4_error: net_recv read: probable EOF on socket: 1
> p18_9331: p4_error: net_recv read: probable EOF on socket: 1
> p22_9439: p4_error: net_recv read: probable EOF on socket: 1
> p16_9277: p4_error: net_recv read: probable EOF on socket: 1
> p17_9304: p4_error: net_recv read: probable EOF on socket: 1
> p3_32073: p4_error: net_recv read: probable EOF on socket: 1
> p6_32157: p4_error: net_recv read: probable EOF on socket: 1
> p7_32185: p4_error: net_recv read: probable EOF on socket: 1
> p4_32101: p4_error: net_recv read: probable EOF on socket: 1
> p5_32129: p4_error: net_recv read: probable EOF on socket: 1
> rm_l_2_32068: (1003.843750) net_send: could not write to fd=5, errno = 32

I have tried changing P4_GLOBMEMSIZE but it did not work. machine is having
linux (rhel 5.2) with 48 GB of memory.
Where to llok for errors? I know that mpich-1 is no longer supported. But
looking for help.
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/mpich-discuss/attachments/20111122/5f525fc0/attachment.htm>


More information about the mpich-discuss mailing list