[mpich-discuss] recovering from a communicator failure

Rajeev Thakur thakur at mcs.anl.gov
Mon Sep 20 21:34:04 CDT 2010


If you use multiple threads, you must initialize with MPI_Init_thread instead of MPI_Init. And you can call MPI_Init or Init_thread only once in a process running an MPI program. You have to start a new process if you want to call it again.

Rajeev


On Sep 20, 2010, at 10:21 AM, Hiatt, Dave M wrote:

> I am currently doing mpi_init in thread 1 (thread 0 is my master thread), on the main app process for what becomes node 0.  If I have a fatal MPI error, and can catch it, is terminating the initiating thread (thread 1) sufficient to allow me to execute a new MPI_Init and recover?
>  
> “People get held back by the voice inside em” – K’naan – In the Beginning
>  
> Dave M. Hiatt
> Director, Risk Analytics
> CitiMortgage
> 1000 Technology Drive
> O'Fallon, MO 63368-2240
>  
> Telephone:  636-261-1408
>  
> _______________________________________________
> mpich-discuss mailing list
> mpich-discuss at mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss



More information about the mpich-discuss mailing list