[petsc-dev] odd log behavior

Mark Adams mfadams at lbl.gov
Tue Apr 26 11:02:33 CDT 2022


Well, Nans are a clear sign that something is very wrong.

On Tue, Apr 26, 2022 at 11:52 AM Jacob Faibussowitsch <jacob.fai at gmail.com>
wrote:

> There is an automatic warning that shows when you do run with
> `-log_view_gpu_time`, but perhaps there should also be an automatic warning
> when *not* running with it. It is unfortunate that NaN is the value printed
> as this implies a bug but AFAIK it is unavoidable (Barry can say more on
> this though).
>
> Best regards,
>
> Jacob Faibussowitsch
> (Jacob Fai - booss - oh - vitch)
>
> > On Apr 26, 2022, at 09:48, Jose E. Roman <jroman at dsic.upv.es> wrote:
> >
> > You have to add -log_view_gpu_time
> > See https://gitlab.com/petsc/petsc/-/merge_requests/5056
> >
> > Jose
> >
> >
> >> El 26 abr 2022, a las 16:39, Mark Adams <mfadams at lbl.gov> escribió:
> >>
> >> I'm seeing this on Perlmutter with Kokkos-CUDA. Nans in most log timing
> data except the two 'Solve' lines.
> >> Just cg/jacobi on snes/ex56.
> >>
> >> Any ideas?
> >>
> >> VecTDot                2 1.0   nan nan 1.20e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0  -nan    -nan      0 0.00e+00    0
> 0.00e+00 100
> >> VecNorm                2 1.0   nan nan 1.00e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0  -nan    -nan      0 0.00e+00    0
> 0.00e+00 100
> >> VecCopy                2 1.0   nan nan 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0  -nan    -nan      0 0.00e+00    0
> 0.00e+00  0
> >> VecSet                 5 1.0   nan nan 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0  -nan    -nan      0 0.00e+00    0
> 0.00e+00  0
> >> VecAXPY                4 1.0   nan nan 2.40e+01 1.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   1  0  0  0  0  -nan    -nan      0 0.00e+00    0
> 0.00e+00 100
> >> VecPointwiseMult       1 1.0   nan nan 3.00e+00 1.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0  -nan    -nan      0 0.00e+00    0
> 0.00e+00 100
> >> KSPSetUp               1 1.0   nan nan 0.00e+00 0.0 0.0e+00 0.0e+00
> 0.0e+00  0  0  0  0  0   0  0  0  0  0  -nan    -nan      0 0.00e+00    0
> 0.00e+00  0
> >> KSPSolve               1 1.0 4.0514e-04 1.0 5.50e+01 1.0 0.0e+00
> 0.0e+00 0.0e+00  1  0  0  0  0   2  0  0  0  0     0    -nan      0
> 0.00e+00    0 0.00e+00 100
> >> SNESSolve              1 1.0 2.2128e-02 1.0 5.55e+05 1.0 0.0e+00
> 0.0e+00 0.0e+00 72 56  0  0  0 100100  0  0  0    25    -nan      0
> 0.00e+00    0 0.00e+00  0
> >
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-dev/attachments/20220426/c66080f2/attachment-0001.html>


More information about the petsc-dev mailing list