[mpich-discuss] Error: collective abort of all ranks
Darius Buntinas
buntinas at mcs.anl.gov
Wed May 4 08:08:31 CDT 2011
Signal 11 indicates a segmentation fault. Most likely this is because of a bug in your program. If you compile your program with debugging symbols (using the -g flag) and enable core dumps (using "ulimit -c unlimited") you can use a debugger to determine the location of the fault.
-d
On May 4, 2011, at 6:33 AM, marcia nasare wrote:
> Hi, everyone!
>
> I'm finding trouble to execute my MPI programs. When execute an MPI
> program with mpiexec sometimes it terminates correctly, but sometimes
> it shows different error messages such as:
>
> rank 0 in job 1 PSGLABPE10011_53798 caused collective abort of all ranks
> exit status of rank 0: killed by signal 11
>
> I can't discover why it is happening.
>
> Some help?
>
> Thank you.
>
> Márcia
> _______________________________________________
> mpich-discuss mailing list
> mpich-discuss at mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
More information about the mpich-discuss
mailing list