Hi all,<div><br></div><div>Recently, I implemented a very naive protocol to check for error messages before actually receive the message. To accomplish this, I used the MPI_PROBE function.</div><div>I have tested and run my program successfully several times, but I got the following error:</div>
<blockquote style="margin:0 0 0 40px;border:none;padding:0px"><div><ol><li><span style="font-family:'courier new',monospace">Fatal error in MPI_Probe: Invalid communicator, error stack:</span></li><li><span style="font-family:'courier new',monospace">MPI_Probe(113): MPI_Probe(src=MPI_ANY_SOURCE, tag=MPI_ANY_TAG, MPI_COMM_WORLD, status=0x8185c80) failed</span></li>
<li><span style="font-family:'courier new',monospace">MPI_Probe(85).: Invalid communicator</span></li><li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li>
<li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xffc98664, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li><li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li>
<li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li><li><span style="font-family:'courier new',monospace">MPI_Send(173)..................: MPI_Send(buf=0xfff71364, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li>
<li><span style="font-family:'courier new',monospace">MPIDI_EagerContigShortSend(262): failure occurred while attempting to send an eager message</span></li><li><span style="font-family:'courier new',monospace">MPIDI_CH3_iStartMsg(36)........: Communication error with rank 0</span></li>
<li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li><li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xfffc8964, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li>
<li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li><li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li>
<li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xffdb4c64, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li><li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li>
<li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li><li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xff9c2f64, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li>
<li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li><li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li>
<li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xfffe1d64, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li><li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li>
<li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li><li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xffd5e464, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li>
<li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li><li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li>
<li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xff9ae864, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li><li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li>
<li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li><li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xff8df4e4, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li>
<li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li><li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li>
<li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xffc94fe4, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li><li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li>
<li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li><li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xff9caae4, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li>
<li><span style="font-family:'courier new',monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</span></li><li><span style="font-family:'courier new',monospace">Fatal error in MPI_Send: Other MPI error, error stack:</span></li>
<li><span style="font-family:'courier new',monospace">MPI_Send(173)..............: MPI_Send(buf=0xfffe7be4, count=1, MPI_INTEGER, dest=0, tag=97, MPI_COMM_WORLD) failed</span></li><li><font face="'courier new', monospace">MPID_nem_tcp_connpoll(1826): Communication error with rank 0: Connection refused</font> </li>
</ol></div></blockquote><div>a) Line 1 says that the communicator is invalid but at line 2 it is possible to see that the MPI_PROBE function recognized that the communicator is MPI_COMM_WORLD. In which conditions this can happen?</div>
<div>b) What is the meaning of the numbers right after the mpi functions names that appear inside the parenthesis on error messages (<span style="font-family:'courier new',monospace">MPI_Probe(113) for example</span>)?</div>
<div><br></div><div>I re-run the case and it worked perfectly.</div><div><br></div><div>I am using MPICH2 Version: 1.4.1p1.</div><div><br></div><div>Thanks in advance,</div><div>Luiz</div><div><br></div>