Dear Darius,<br><br>Yes, /tmp/app.ckpoint directory is already exist. And I didn''t find anything here after the program was running, or while the program still run. <br>But the execution always print 'requesting checkpoint... checkpoint completed'.<br>
<br>Please give me a suggestion. I think it would be great if MPICH2 can communicate with BLCR :).<br>Thank you<br><br>Best regards,<br>Bagus<br><br><div class="gmail_quote">On Mon, May 17, 2010 at 11:57 PM, Darius Buntinas <span dir="ltr"><<a href="mailto:buntinas@mcs.anl.gov">buntinas@mcs.anl.gov</a>></span> wrote:<br>
<blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;">It seems that you did things correctly. Did you recompile your application (cg) with the new mpich2? The make sure the /tmp/app.ckpoint directory exists (not just /tmp but /tmp/app.ckpoint).<br>
<br>
Note that this version of MPICH2 is still an alpha version, and we're still working the bugs out of it. We appreciate that you're giving this a try.<br>
<br>
-d<div><div></div><div class="h5"><br>
<br>
On 05/16/2010 03:43 AM, Bagus Jati Santoso wrote:<br>
</div></div><blockquote class="gmail_quote" style="margin: 0pt 0pt 0pt 0.8ex; border-left: 1px solid rgb(204, 204, 204); padding-left: 1ex;"><div><div></div><div class="h5">
Hello all,<br>
<br>
I have succesfully compiled blcr-0.8.2 and the target is in /mirror/blcr.<br>
<br>
Then I Install MPICH 2-1.3a2 that support BLCR, and the target is in<br>
/mirror/mpich2blcr.<br>
I installed it by :<br>
./configure --prefix=/mirror/mpich2blcr --with-blcr=/mirror/blcr<br>
--with-blcr-include=/mirror/blcr/include --with-blcr-lib=/mirror/blcr/lib<br>
After that :<br>
make<br>
sudo make install<br>
<br>
And it shows no error.<br>
<br>
My cluster is consisted of 11 computers with debian.<br>
<br>
Then I compile. After that I execute the program my CG program with :<br>
mpiexec -ckpointlib blcr -ckpoint-interval 4 -ckpoint-prefix<br>
/tmp/app.ckpoint ./cg bcsstk18.mtx<br>
<br>
It seems that checkpoint process is success, since i found 'requested<br>
checkpointing... checkpointing completed' every 4 seconds.<br>
<br>
But, why I can't found the checkpoint file in /tmp/app.ckpoint?<br>
Is all my above procedures correct?<br>
<br>
Thanks for your answers.<br>
<br>
Best regards,<br>
Bagus<br>
<br>
<br>
<br></div></div>
_______________________________________________<br>
mpich-discuss mailing list<br>
<a href="mailto:mpich-discuss@mcs.anl.gov" target="_blank">mpich-discuss@mcs.anl.gov</a><br>
<a href="https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss" target="_blank">https://lists.mcs.anl.gov/mailman/listinfo/mpich-discuss</a><br>
</blockquote>
</blockquote></div><br>