[mpich-discuss] BLCR library - Restart Problem

Darius Buntinas buntinas at mcs.anl.gov
Thu Sep 20 13:18:49 CDT 2012


Can you send us the command lines you used for the initial run and when restarting?

Thanks,
-d

On Sep 20, 2012, at 12:36 PM, Mehmet Kurt wrote:

> Hello,
> 
> I'm using BLCR checkpointing library with mpich2-1.4.1p1.
> 
> I have no problem with checkpointing my application, but when I want to restart it with the same set of nodes, nothing happens; it just hangs there.
> I connected the same node, which restarts the application by mpiexec, from another terminal. after running "top" command I saw that it
> creates a <DEFUNCT> process for my executable.
> 
> Any ideas about what can cause this behavior?
> 
> Thank you,
> 
> Mehmet Can Kurt
> -----------------------------
> Graduate Student
> Computer Engineering Department
> Ohio State University
> _______________________________________________
> mpich-discuss mailing list     mpich-discuss at mcs.anl.gov
> To manage subscription options or unsubscribe:
> https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss



More information about the mpich-discuss mailing list