[petsc-users] CG+GAMG convergence issues in GHEP Krylov-Schur for some MPI runs

Hong hzhang at mcs.anl.gov
Fri Nov 6 09:09:02 CST 2015


Denis:
Do you use shift-and-invert method for solving eigenvalue problem?
If so, the linear problems would be extremely ill-conditioned, for which
the direct solver, such LU or Cholesky are usually the only working option.

You may run your petsc/slepc code with option '-ksp_monitor' to observe
convergence behavior.

Hong

After running in debug mode it seems that the GAMG solver indeed did not
> converge, however throwing the error leads to SIGABRT (backtrace and frames
> are below).
> It is still very suspicious why would solving for (unchanged) mass matrix
> wouldn't converge inside SLEPc's spectral transformation.
>
> p.s. valgrind takes enormous amount of time on this problem,
> will try to leave it over the weekend.
>
> Denis.
>
> ===============
> Program received signal SIGABRT, Aborted.
> 0x00007fffea87fcc9 in __GI_raise (sig=sig at entry=6)
>     at ../nptl/sysdeps/unix/sysv/linux/raise.c:56
> 56      ../nptl/sysdeps/unix/sysv/linux/raise.c: No such file or directory.
> (gdb) bt
> #0  0x00007fffea87fcc9 in __GI_raise (sig=sig at entry=6)
>     at ../nptl/sysdeps/unix/sysv/linux/raise.c:56
> #1  0x00007fffea8830d8 in __GI_abort () at abort.c:89
> #2  0x00007fffeb790c91 in PetscTraceBackErrorHandler (comm=0x2a09bd0,
>     line=798, fun=0x7fffed0e24b9 <__func__.20043> "KSPSolve",
>     file=0x7fffed0e1620
> "/home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/ksp/ksp/interface/itfunc.c",
> n=91, p=PETSC_ERROR_INITIAL,
>     mess=0x7fffffffac30 "KSPSolve has not converged", ctx=0x0)
>     at
> /home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/sys/error/errtrace.c:243
> #3  0x00007fffeb78b8b9 in PetscError (comm=0x2a09bd0, line=798,
>     func=0x7fffed0e24b9 <__func__.20043> "KSPSolve",
>     file=0x7fffed0e1620
> "/home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/ksp/ksp/interface/itfunc.c",
> n=91, p=PETSC_ERROR_INITIAL,
>     mess=0x7fffed0e1e7a "KSPSolve has not converged")
>     at
> /home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/sys/error/err.c:377
> #4  0x00007fffec75e1e7 in KSPSolve (ksp=0x367227d0, b=0x35b285c0,
> x=0x35d89250)
>     at
> /home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/ksp/ksp/interface/itfunc.c:798
> #5  0x00007fffe32a8657 in STMatSolve (st=0x3672d820, b=0x35b285c0,
>     x=0x35d89250)
>     at
> /home/davydden/.hashdist/tmp/slepc-22nb32nbgvhx/src/sys/classes/st/interface/stsles.c:166
> ---Type <return> to continue, or q <return> to quit---q
> Quit
> (gdb) f 5
> #5  0x00007fffe32a8657 in STMatSolve (st=0x3672d820, b=0x35b285c0,
>     x=0x35d89250)
>     at
> /home/davydden/.hashdist/tmp/slepc-22nb32nbgvhx/src/sys/classes/st/interface/stsles.c:166
> 166       ierr = KSPSolve(st->ksp,b,x);CHKERRQ(ierr);
> (gdb) f 4
> #4  0x00007fffec75e1e7 in KSPSolve (ksp=0x367227d0, b=0x35b285c0,
> x=0x35d89250)
>     at
> /home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/ksp/ksp/interface/itfunc.c:798
> 798       if (ksp->errorifnotconverged && ksp->reason < 0)
> SETERRQ(comm,PETSC_ERR_NOT_CONVERGED,"KSPSolve has not converged");
> (gdb) f 3
> #3  0x00007fffeb78b8b9 in PetscError (comm=0x2a09bd0, line=798,
>     func=0x7fffed0e24b9 <__func__.20043> "KSPSolve",
>     file=0x7fffed0e1620
> "/home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/ksp/ksp/interface/itfunc.c",
> n=91, p=PETSC_ERROR_INITIAL,
>     mess=0x7fffed0e1e7a "KSPSolve has not converged")
>     at
> /home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/sys/error/err.c:377
> 377       if (!eh) ierr =
> PetscTraceBackErrorHandler(comm,line,func,file,n,p,lbuf,0);
> (gdb) f 2
> #2  0x00007fffeb790c91 in PetscTraceBackErrorHandler (comm=0x2a09bd0,
>     line=798, fun=0x7fffed0e24b9 <__func__.20043> "KSPSolve",
>     file=0x7fffed0e1620
> "/home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/ksp/ksp/interface/itfunc.c",
> n=91, p=PETSC_ERROR_INITIAL,
>     mess=0x7fffffffac30 "KSPSolve has not converged", ctx=0x0)
>     at
> /home/davydden/.hashdist/tmp/petsc-hujktg3j6hq7/src/sys/error/errtrace.c:243
> 243         abort();
> (gdb) f 1
> #1  0x00007fffea8830d8 in __GI_abort () at abort.c:89
> 89      abort.c: No such file or directory.
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20151106/851f6fd4/attachment.html>


More information about the petsc-users mailing list