Hi,<br><br>My code crashes, and I'm not sure how to debug the problem. I'm new to MPI/mpich programming, and any suggestions on debugging the problem would be appreciated. Here is the error output displayed by mpich:<br>
<br>[proxy:0:1@ComputeNodeIB101] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:906): assert (!closed) failed<br>[proxy:0:1@ComputeNodeIB101] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status<br>
[proxy:0:1@ComputeNodeIB101] main (./pm/pmiserv/pmip.c:222): demux engine error waiting for event<br>[proxy:0:2@ComputeNodeIB102] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:906): assert (!closed) failed<br>[proxy:0:2@ComputeNodeIB102] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status<br>
[proxy:0:2@ComputeNodeIB102] main (./pm/pmiserv/pmip.c:222): demux engine error waiting for event<br>[proxy:0:3@ComputeNodeIB103] HYD_pmcd_pmip_control_cmd_cb (./pm/pmiserv/pmip_cb.c:906): assert (!closed) failed<br>[proxy:0:3@ComputeNodeIB103] HYDT_dmxu_poll_wait_for_event (./tools/demux/demux_poll.c:77): callback returned error status<br>
[proxy:0:3@ComputeNodeIB103] main (./pm/pmiserv/pmip.c:222): demux engine error waiting for event<br><br>I'm using MPI_THREAD_MULTIPLE over an ib fabric. The problem doesn't occur all the time. I believe it occurs during a recv statement, but I'm not certain.<br>
<br>Thanks,<br>Jay<br>