[mpich-discuss] HYDRA and kill process

Pavan Balaji balaji at mcs.anl.gov
Thu Mar 24 16:25:06 CDT 2011


This should work correctly. The problem that was fixed earlier was with 
multiple process groups (i.e., using Comm_spawn).

Can you try the nightly snapshot in any case?

  -- Pavan

On 03/24/2011 02:16 PM, Jain, Rohit wrote:
> I have seen same problem and reported earlier. It is not just ctrl-c, even if your process on one host crashes, it leaves other parallel host processes.
>
> I think Pavan has already fixed it in newer version. Right, Pavan?
>
> Regards,
> Rohit
>
>
> -----Original Message-----
> From: mpich-discuss-bounces at mcs.anl.gov [mailto:mpich-discuss-bounces at mcs.anl.gov] On Behalf Of Torquil Macdonald Sørensen
> Sent: Thursday, March 24, 2011 12:10 PM
> To: mpich-discuss at mcs.anl.gov
> Subject: [mpich-discuss] HYDRA and kill process
>
> Hi!
>
> With mpich-1.3.2, say I run a parallel job consisting of two processes running a
> hostA and hostB. They are started by running mpiexec on hostA.
>
> If I (on hostA) hit CTRL-c to kill my job, the process on hostB keeps running...
>
> With earlier MPICH2 versions, without HYDRA, the behaviour was different. The
> whole job (all processes on all hosts) were killed when I hist CTRL-c on the
> host where mpiexec was run.
>
> Any pointers on how to "restore" the old behaviour when using the newest MPICH2
> and HYDRA?
>
> Or is it a bug?
>
> Best regards
> Torquil Sørensen
> _______________________________________________
> mpich-discuss mailing list
> mpich-discuss at mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss
> _______________________________________________
> mpich-discuss mailing list
> mpich-discuss at mcs.anl.gov
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss

-- 
Pavan Balaji
http://www.mcs.anl.gov/~balaji


More information about the mpich-discuss mailing list