[petsc-users] Questions Regarding PETSc and Solving Constrained Structural Mechanics Problems

Thu Jun 19 21:07:28 CDT 2025

On Thu, Jun 19, 2025 at 9:18 PM hexioafeng <hexiaofeng at buaa.edu.cn> wrote:

> Hello,
>
> Here are the outputs with svd:
>
> 0 KSP unpreconditioned resid norm 2.777777777778e+01 true resid norm
> 2.777777777778e+01 ||r(i)||/||b|| 1.000000000000e+00
>     Linear fieldsplit_0_mg_levels_1_ solve converged due to CONVERGED_ITS
> iterations 2
>     Linear fieldsplit_0_mg_levels_1_ solve converged due to CONVERGED_ITS
> iterations 2
>   Linear fieldsplit_1_ solve did not converge due to DIVERGED_PC_FAILED
>

You are running ILU(0) on your Schur complement, but it looks like it is
rank-deficient. You will have to use something that works for that (like
maybe GAMG again with SVD on the coarse grid). Is S elliptic?

  Thanks,

     Matt

> iterations 0
>                  PC failed due to SUBPC_ERROR
>     Linear fieldsplit_0_mg_levels_1_ solve converged due to CONVERGED_ITS
> iterations 2
>     Linear fieldsplit_0_mg_levels_1_ solve converged due to CONVERGED_ITS
> iterations 2
> Linear solve did not converge due to DIVERGED_PC_FAILED iterations 0
>                PC failed due to SUBPC_ERROR
> KSP Object: 1 MPI processes
>   type: cg
>   maximum iterations=200, initial guess is zero
>   tolerances:  relative=1e-06, absolute=1e-12, divergence=1e+30
>   left preconditioning
>   using UNPRECONDITIONED norm type for convergence test
> PC Object: 1 MPI processes
>   type: fieldsplit
>     FieldSplit with Schur preconditioner, blocksize = 1, factorization FULL
>     Preconditioner for the Schur complement formed from Sp, an assembled
> approximation to S, which uses A00's diagonal's inverse
>     Split info:
>     Split number 0 Defined by IS
>     Split number 1 Defined by IS
>     KSP solver for A00 block
>       KSP Object: (fieldsplit_0_) 1 MPI processes
>         type: preonly
>         maximum iterations=10000, initial guess is zero
>         tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>         left preconditioning
>         using NONE norm type for convergence test
>       PC Object: (fieldsplit_0_) 1 MPI processes
>         type: gamg
>           type is MULTIPLICATIVE, levels=2 cycles=v
>             Cycles per PCApply=1
>             Using externally compute Galerkin coarse grid matrices
>             GAMG specific options
>               Threshold for dropping small values in graph on each level =
>
>               Threshold scaling factor for each level not specified = 1.
>               AGG specific options
>                 Symmetric graph false
>                 Number of levels to square graph 1
>                 Number smoothing steps 1
>               Complexity:    grid = 1.00222
>         Coarse grid solver -- level -------------------------------
>           KSP Object: (fieldsplit_0_mg_coarse_) 1 MPI processes
>             type: preonly
>             maximum iterations=10000, initial guess is zero
>             tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>             left preconditioning
>             using NONE norm type for convergence test
>           PC Object: (fieldsplit_0_mg_coarse_) 1 MPI processes
>             type: bjacobi
>               number of blocks = 1
>               Local solver is the same for all blocks, as in the following
> KSP and PC objects on rank 0:
>             KSP Object: (fieldsplit_0_mg_coarse_sub_) 1 MPI processes
>               type: preonly
>               maximum iterations=1, initial guess is zero
>               tolerances:  relative=1e-05, absolute=1e-50,
> divergence=10000.
>               left preconditioning
>               using NONE norm type for convergence test
>             PC Object: (fieldsplit_0_mg_coarse_sub_) 1 MPI processes
>               type: svd
>                 All singular values smaller than 1e-12 treated as zero
>                 Provided essential rank of the matrix 0 (all other
> eigenvalues are zeroed)
>               linear system matrix = precond matrix:
>               Mat Object: 1 MPI processes
>                 type: seqaij
>                 rows=8, cols=8
>                 total: nonzeros=56, allocated nonzeros=56
>                 total number of mallocs used during MatSetValues calls=0
>                   using I-node routines: found 3 nodes, limit used is 5
>             linear system matrix = precond matrix:
>             Mat Object: 1 MPI processes
>               type: mpiaij
>               rows=8, cols=8
>               total: nonzeros=56, allocated nonzeros=56
>               total number of mallocs used during MatSetValues calls=0
>                 using nonscalable MatPtAP() implementation
>                 using I-node (on process 0) routines: found 3 nodes, limit
> used is 5
>         Down solver (pre-smoother) on level 1
> -------------------------------
>           KSP Object: (fieldsplit_0_mg_levels_1_) 1 MPI processes
>             type: chebyshev
>               eigenvalue estimates used:  min = 0.0998145, max = 1.09796
>               eigenvalues estimate via gmres min 0.00156735, max 0.998145
>               eigenvalues estimated using gmres with translations  [0.
> 0.1; 0. 1.1]
>               KSP Object: (fieldsplit_0_mg_levels_1_esteig_) 1 MPI
> processes
>                 type: gmres
>                   restart=30, using Classical (unmodified) Gram-Schmidt
> Orthogonalization with no iterative refinement
>                   happy breakdown tolerance 1e-30
>                 maximum iterations=10, initial guess is zero
>                 tolerances:  relative=1e-12, absolute=1e-50,
> divergence=10000.
>                 left preconditioning
>                 using PRECONDITIONED norm type for convergence test
>               estimating eigenvalues using noisy right hand side
>             maximum iterations=2, nonzero initial guess
>             tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>             left preconditioning
>             using NONE norm type for convergence test
>           PC Object: (fieldsplit_0_mg_levels_1_) 1 MPI processes
>             type: sor
>               type = local_symmetric, iterations = 1, local iterations =
> 1, omega = 1.
>             linear system matrix = precond matrix:
>             Mat Object: (fieldsplit_0_) 1 MPI processes
>               type: mpiaij
>               rows=480, cols=480
>               total: nonzeros=25200, allocated nonzeros=25200
>               total number of mallocs used during MatSetValues calls=0
>                 using I-node (on process 0) routines: found 160 nodes,
> limit used is 5
>         Up solver (post-smoother) same as down solver (pre-smoother)
>         linear system matrix = precond matrix:
>         Mat Object: (fieldsplit_0_) 1 MPI processes
>           type: mpiaij
>           rows=480, cols=480
>           total: nonzeros=25200, allocated nonzeros=25200
>           total number of mallocs used during MatSetValues calls=0
>             using I-node (on process 0) routines: found 160 nodes, limit
> used is 5
>     KSP solver for S = A11 - A10 inv(A00) A01
>       KSP Object: (fieldsplit_1_) 1 MPI processes
>         type: preonly
>         maximum iterations=10000, initial guess is zero
>         tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>         left preconditioning
>         using NONE norm type for convergence test
>       PC Object: (fieldsplit_1_) 1 MPI processes
>         type: bjacobi
>           number of blocks = 1
>           Local solver is the same for all blocks, as in the following KSP
> and PC objects on rank 0:
>         KSP Object: (fieldsplit_1_sub_) 1 MPI processes
>           type: preonly
>           maximum iterations=10000, initial guess is zero
>           tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>           left preconditioning
>           using NONE norm type for convergence test
>         PC Object: (fieldsplit_1_sub_) 1 MPI processes
>           type: bjacobi
>             number of blocks = 1
>             Local solver is the same for all blocks, as in the following
> KSP and PC objects on rank 0:
>                     KSP Object:           (fieldsplit_1_sub_sub_)
>   1 MPI processes
>                       type: preonly
>                       maximum iterations=10000, initial guess is zero
>                       tolerances:  relative=1e-05, absolute=1e-50,
> divergence=10000.
>                       left preconditioning
>                       using NONE norm type for convergence test
>                     PC Object:           (fieldsplit_1_sub_sub_)
> 1 MPI processes
>                       type: ilu
>                         out-of-place factorization
>                         0 levels of fill
>                         tolerance for zero pivot 2.22045e-14
>                         matrix ordering: natural
>                         factor fill ratio given 1., needed 1.
>                           Factored matrix follows:
>                             Mat Object:           1 MPI processes
>                               type: seqaij
>                               rows=144, cols=144
>                               package used to perform factorization: petsc
>                               total: nonzeros=240, allocated nonzeros=240
>                                 not using I-node routines
>                       linear system matrix = precond matrix:
>                       Mat Object:           1 MPI processes
>                         type: seqaij
>                         rows=144, cols=144
>                         total: nonzeros=240, allocated nonzeros=240
>                         total number of mallocs used during MatSetValues
> calls=0
>                           not using I-node routines
>           linear system matrix = precond matrix:
>           Mat Object: 1 MPI processes
>             type: mpiaij
>             rows=144, cols=144
>             total: nonzeros=240, allocated nonzeros=240
>             total number of mallocs used during MatSetValues calls=0
>               not using I-node (on process 0) routines
>         linear system matrix followed by preconditioner matrix:
>         Mat Object: (fieldsplit_1_) 1 MPI processes
>           type: schurcomplement
>           rows=144, cols=144
>             Schur complement A11 - A10 inv(A00) A01
>             A11
>               Mat Object: (fieldsplit_1_) 1 MPI processes
>                 type: mpiaij
>                 rows=144, cols=144
>                 total: nonzeros=240, allocated nonzeros=240
>                 total number of mallocs used during MatSetValues calls=0
>                   not using I-node (on process 0) routines
>             A10
>               Mat Object: 1 MPI processes
>                 type: mpiaij
>                 rows=144, cols=480
>                 total: nonzeros=48, allocated nonzeros=48
>                 total number of mallocs used during MatSetValues calls=0
>                   using I-node (on process 0) routines: found 74 nodes,
> limit used is 5
>             KSP of A00
>               KSP Object: (fieldsplit_0_) 1 MPI processes
>                 type: preonly
>                 maximum iterations=10000, initial guess is zero
>                 tolerances:  relative=1e-05, absolute=1e-50,
> divergence=10000.
>                 left preconditioning
>                 using NONE norm type for convergence test
>               PC Object: (fieldsplit_0_) 1 MPI processes
>                 type: gamg
>                   type is MULTIPLICATIVE, levels=2 cycles=v
>                     Cycles per PCApply=1
>                     Using externally compute Galerkin coarse grid matrices
>                     GAMG specific options
>                       Threshold for dropping small values in graph on each
> level =
>                       Threshold scaling factor for each level not
> specified = 1.
>                       AGG specific options
>                         Symmetric graph false
>                         Number of levels to square graph 1
>                         Number smoothing steps 1
>                       Complexity:    grid = 1.00222
>                 Coarse grid solver -- level -------------------------------
>                   KSP Object: (fieldsplit_0_mg_coarse_) 1 MPI processes
>                     type: preonly
>                     maximum iterations=10000, initial guess is zero
>                     tolerances:  relative=1e-05, absolute=1e-50,
> divergence=10000.
>                     left preconditioning
>                     using NONE norm type for convergence test
>                   PC Object: (fieldsplit_0_mg_coarse_) 1 MPI processes
>                     type: bjacobi
>                       number of blocks = 1
>                       Local solver is the same for all blocks, as in the
> following KSP and PC objects on rank 0:
>                     KSP Object: (fieldsplit_0_mg_coarse_sub_) 1 MPI
> processes
>                       type: preonly
>                       maximum iterations=1, initial guess is zero
>                       tolerances:  relative=1e-05, absolute=1e-50,
> divergence=10000.
>                       left preconditioning
>                       using NONE norm type for convergence test
>                     PC Object: (fieldsplit_0_mg_coarse_sub_) 1 MPI
> processes
>                       type: svd
>                         All singular values smaller than 1e-12 treated as
> zero
>                         Provided essential rank of the matrix 0 (all other
> eigenvalues are zeroed)
>                       linear system matrix = precond matrix:
>                       Mat Object: 1 MPI processes
>                         type: seqaij
>                         rows=8, cols=8
>                         total: nonzeros=56, allocated nonzeros=56
>                         total number of mallocs used during MatSetValues
> calls=0
>                           using I-node routines: found 3 nodes, limit used
> is 5
>                     linear system matrix = precond matrix:
>                     Mat Object: 1 MPI processes
>                       type: mpiaij
>                       rows=8, cols=8
>                       total: nonzeros=56, allocated nonzeros=56
>                       total number of mallocs used during MatSetValues
> calls=0
>                         using nonscalable MatPtAP() implementation
>                         using I-node (on process 0) routines: found 3
> nodes, limit used is 5
>                 Down solver (pre-smoother) on level 1
> -------------------------------
>                   KSP Object: (fieldsplit_0_mg_levels_1_) 1 MPI processes
>                     type: chebyshev
>                       eigenvalue estimates used:  min = 0.0998145, max =
> 1.09796
>                       eigenvalues estimate via gmres min 0.00156735, max
> 0.998145
>                       eigenvalues estimated using gmres with translations
>  [0. 0.1; 0. 1.1]
>                       KSP Object: (fieldsplit_0_mg_levels_1_esteig_) 1 MPI
> processes
>                         type: gmres
>                           restart=30, using Classical (unmodified)
> Gram-Schmidt Orthogonalization with no iterative refinement
>                           happy breakdown tolerance 1e-30
>                         maximum iterations=10, initial guess is zero
>                         tolerances:  relative=1e-12, absolute=1e-50,
> divergence=10000.
>                         left preconditioning
>                         using PRECONDITIONED norm type for convergence test
>                       estimating eigenvalues using noisy right hand side
>                     maximum iterations=2, nonzero initial guess
>                     tolerances:  relative=1e-05, absolute=1e-50,
> divergence=10000.
>                     left preconditioning
>                     using NONE norm type for convergence test
>                   PC Object: (fieldsplit_0_mg_levels_1_) 1 MPI processes
>                     type: sor
>                       type = local_symmetric, iterations = 1, local
> iterations = 1, omega = 1.
>                     linear system matrix = precond matrix:
>                     Mat Object: (fieldsplit_0_) 1 MPI processes
>                       type: mpiaij
>                       rows=480, cols=480
>                       total: nonzeros=25200, allocated nonzeros=25200
>                       total number of mallocs used during MatSetValues
> calls=0
>                         using I-node (on process 0) routines: found 160
> nodes, limit used is 5
>                 Up solver (post-smoother) same as down solver
> (pre-smoother)
>                 linear system matrix = precond matrix:
>                 Mat Object: (fieldsplit_0_) 1 MPI processes
>                   type: mpiaij
>                   rows=480, cols=480
>                   total: nonzeros=25200, allocated nonzeros=25200
>                   total number of mallocs used during MatSetValues calls=0
>                     using I-node (on process 0) routines: found 160 nodes,
> limit used is 5
>             A01
>               Mat Object: 1 MPI processes
>                 type: mpiaij
>                 rows=480, cols=144
>                 total: nonzeros=48, allocated nonzeros=48
>                 total number of mallocs used during MatSetValues calls=0
>                   using I-node (on process 0) routines: found 135 nodes,
> limit used is 5
>         Mat Object: 1 MPI processes
>           type: mpiaij
>           rows=144, cols=144
>           total: nonzeros=240, allocated nonzeros=240
>           total number of mallocs used during MatSetValues calls=0
>             not using I-node (on process 0) routines
>   linear system matrix = precond matrix:
>   Mat Object: 1 MPI processes
>     type: mpiaij
>     rows=624, cols=624
>     total: nonzeros=25536, allocated nonzeros=25536
>     total number of mallocs used during MatSetValues calls=0
>       using I-node (on process 0) routines: found 336 nodes, limit used is
> 5
>
>
> Thanks,
> Xiaofeng
>
>
>
> On Jun 20, 2025, at 00:56, Mark Adams <mfadams at lbl.gov> wrote:
>
> This is what Matt is looking at:
>
>                 PC Object: (fieldsplit_0_mg_coarse_sub_) 1 MPI processes
>                       type: lu
>
> This should be svd, not lu
>
> If you had used -options_left you would have caught this mistake(s)
>
> On Thu, Jun 19, 2025 at 8:06 AM Matthew Knepley <knepley at gmail.com> wrote:
>
>> On Thu, Jun 19, 2025 at 7:59 AM hexioafeng <hexiaofeng at buaa.edu.cn>
>> wrote:
>>
>>> Hello sir,
>>>
>>> I remove the duplicated "_type", and get the same error and output.
>>>
>>
>> The output cannot be the same. Please send it.
>>
>>   Thanks,
>>
>>      Matt
>>
>>
>>> Best regards,
>>> Xiaofeng
>>>
>>>
>>> On Jun 19, 2025, at 19:45, Matthew Knepley <knepley at gmail.com> wrote:
>>>
>>> This options is wrong
>>>
>>>   -fieldsplit_0_mg_coarse_sub_pc_type_type svd
>>>
>>> Notice that "_type" is repeated.
>>>
>>>   Thanks,
>>>
>>>       Matt
>>>
>>> On Thu, Jun 19, 2025 at 7:10 AM hexioafeng <hexiaofeng at buaa.edu.cn>
>>> wrote:
>>>
>>>> Dear authors,
>>>>
>>>> Here are the options passed with fieldsplit preconditioner:
>>>>
>>>> -ksp_type cg -pc_type fieldsplit -pc_fieldsplit_detect_saddle_point
>>>>  -pc_fieldsplit_type schur -pc_fieldsplit_schur_precondition selfp
>>>> -pc_fieldsplit_schur_fact_type full -fieldsplit_0_ksp_type preonly
>>>> -fieldsplit_0_pc_type gamg -fieldsplit_0_mg_coarse_sub_pc_type_type svd
>>>> -fieldsplit_1_ksp_type preonly -fieldsplit_1_pc_type bjacobi -ksp_view
>>>>  -ksp_monitor_true_residual  -ksp_converged_reason
>>>>  -fieldsplit_0_mg_levels_ksp_monitor_true_residual
>>>>  -fieldsplit_0_mg_levels_ksp_converged_reason
>>>>  -fieldsplit_1_ksp_monitor_true_residual
>>>>  -fieldsplit_1_ksp_converged_reason
>>>>
>>>> and the output:
>>>>
>>>> 0 KSP unpreconditioned resid norm 2.777777777778e+01 true resid norm
>>>> 2.777777777778e+01 ||r(i)||/||b|| 1.000000000000e+00
>>>>     Linear fieldsplit_0_mg_levels_1_ solve converged due to
>>>> CONVERGED_ITS iterations 2
>>>>     Linear fieldsplit_0_mg_levels_1_ solve converged due to
>>>> CONVERGED_ITS iterations 2
>>>>   Linear fieldsplit_1_ solve did not converge due to DIVERGED_PC_FAILED
>>>> iterations 0
>>>>                  PC failed due to SUBPC_ERROR
>>>>     Linear fieldsplit_0_mg_levels_1_ solve converged due to
>>>> CONVERGED_ITS iterations 2
>>>>     Linear fieldsplit_0_mg_levels_1_ solve converged due to
>>>> CONVERGED_ITS iterations 2
>>>> Linear solve did not converge due to DIVERGED_PC_FAILED iterations 0
>>>>                PC failed due to SUBPC_ERROR
>>>> KSP Object: 1 MPI processes
>>>>   type: cg
>>>>   maximum iterations=200, initial guess is zero
>>>>   tolerances:  relative=1e-06, absolute=1e-12, divergence=1e+30
>>>>   left preconditioning
>>>>   using UNPRECONDITIONED norm type for convergence test
>>>> PC Object: 1 MPI processes
>>>>   type: fieldsplit
>>>>     FieldSplit with Schur preconditioner, blocksize = 1, factorization
>>>> FULL
>>>>     Preconditioner for the Schur complement formed from Sp, an
>>>> assembled approximation to S, which uses A00's diagonal's inverse
>>>>     Split info:
>>>>     Split number 0 Defined by IS
>>>>     Split number 1 Defined by IS
>>>>     KSP solver for A00 block
>>>>       KSP Object: (fieldsplit_0_) 1 MPI processes
>>>>         type: preonly
>>>>         maximum iterations=10000, initial guess is zero
>>>>         tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>>>>         left preconditioning
>>>>         using NONE norm type for convergence test
>>>>       PC Object: (fieldsplit_0_) 1 MPI processes
>>>>         type: gamg
>>>>           type is MULTIPLICATIVE, levels=2 cycles=v
>>>>             Cycles per PCApply=1
>>>>             Using externally compute Galerkin coarse grid matrices
>>>>             GAMG specific options
>>>>               Threshold for dropping small values in graph on each
>>>> level =
>>>>               Threshold scaling factor for each level not specified = 1.
>>>>               AGG specific options
>>>>                 Symmetric graph false
>>>>                 Number of levels to square graph 1
>>>>                 Number smoothing steps 1
>>>>               Complexity:    grid = 1.00222
>>>>         Coarse grid solver -- level -------------------------------
>>>>           KSP Object: (fieldsplit_0_mg_coarse_) 1 MPI processes
>>>>             type: preonly
>>>>             maximum iterations=10000, initial guess is zero
>>>>             tolerances:  relative=1e-05, absolute=1e-50,
>>>> divergence=10000.
>>>>             left preconditioning
>>>>             using NONE norm type for convergence test
>>>>           PC Object: (fieldsplit_0_mg_coarse_) 1 MPI processes
>>>>             type: bjacobi
>>>>               number of blocks = 1
>>>>               Local solver is the same for all blocks, as in the
>>>> following KSP and PC objects on rank 0:
>>>>             KSP Object: (fieldsplit_0_mg_coarse_sub_) 1 MPI processes
>>>>               type: preonly
>>>>               maximum iterations=1, initial guess is zero
>>>>               tolerances:  relative=1e-05, absolute=1e-50,
>>>> divergence=10000.
>>>>               left preconditioning
>>>>               using NONE norm type for convergence test
>>>>             PC Object: (fieldsplit_0_mg_coarse_sub_) 1 MPI processes
>>>>               type: lu
>>>>                 out-of-place factorization
>>>>                 tolerance for zero pivot 2.22045e-14
>>>>                 using diagonal shift on blocks to prevent zero pivot
>>>> [INBLOCKS]
>>>>                 matrix ordering: nd
>>>>                 factor fill ratio given 5., needed 1.
>>>>                   Factored matrix follows:
>>>>                     Mat Object: 1 MPI processes
>>>>                       type: seqaij
>>>>                       rows=8, cols=8
>>>>                       package used to perform factorization: petsc
>>>>                       total: nonzeros=56, allocated nonzeros=56
>>>>                         using I-node routines: found 3 nodes, limit
>>>> used is 5
>>>>               linear system matrix = precond matrix:
>>>>               Mat Object: 1 MPI processes
>>>>                 type: seqaij
>>>>                 rows=8, cols=8
>>>>                 total: nonzeros=56, allocated nonzeros=56
>>>>                 total number of mallocs used during MatSetValues calls=0
>>>>                   using I-node routines: found 3 nodes, limit used is 5
>>>>             linear system matrix = precond matrix:
>>>>             Mat Object: 1 MPI processes
>>>>               type: mpiaij
>>>>               rows=8, cols=8
>>>>               total: nonzeros=56, allocated nonzeros=56
>>>>               total number of mallocs used during MatSetValues calls=0
>>>>                 using nonscalable MatPtAP() implementation
>>>>                 using I-node (on process 0) routines: found 3 nodes,
>>>> limit used is 5
>>>>         Down solver (pre-smoother) on level 1
>>>> -------------------------------
>>>>           KSP Object: (fieldsplit_0_mg_levels_1_) 1 MPI processes
>>>>             type: chebyshev
>>>>               eigenvalue estimates used:  min = 0.0998145, max = 1.09796
>>>>               eigenvalues estimate via gmres min 0.00156735, max
>>>> 0.998145
>>>>               eigenvalues estimated using gmres with translations  [0.
>>>> 0.1; 0. 1.1]
>>>>               KSP Object: (fieldsplit_0_mg_levels_1_esteig_) 1 MPI
>>>> processes
>>>>                 type: gmres
>>>>                   restart=30, using Classical (unmodified) Gram-Schmidt
>>>> Orthogonalization with no iterative refinement
>>>>                   happy breakdown tolerance 1e-30
>>>>                 maximum iterations=10, initial guess is zero
>>>>                 tolerances:  relative=1e-12, absolute=1e-50,
>>>> divergence=10000.
>>>>                 left preconditioning
>>>>                 using PRECONDITIONED norm type for convergence test
>>>>               estimating eigenvalues using noisy right hand side
>>>>             maximum iterations=2, nonzero initial guess
>>>>             tolerances:  relative=1e-05, absolute=1e-50,
>>>> divergence=10000.
>>>>             left preconditioning
>>>>             using NONE norm type for convergence test
>>>>           PC Object: (fieldsplit_0_mg_levels_1_) 1 MPI processes
>>>>             type: sor
>>>>               type = local_symmetric, iterations = 1, local iterations
>>>> = 1, omega = 1.
>>>>             linear system matrix = precond matrix:
>>>>             Mat Object: (fieldsplit_0_) 1 MPI processes
>>>>               type: mpiaij
>>>>               rows=480, cols=480
>>>>               total: nonzeros=25200, allocated nonzeros=25200
>>>>               total number of mallocs used during MatSetValues calls=0
>>>>                 using I-node (on process 0) routines: found 160 nodes,
>>>> limit used is 5
>>>>         Up solver (post-smoother) same as down solver (pre-smoother)
>>>>         linear system matrix = precond matrix:
>>>>         Mat Object: (fieldsplit_0_) 1 MPI processes
>>>>           type: mpiaij
>>>>           rows=480, cols=480
>>>>           total: nonzeros=25200, allocated nonzeros=25200
>>>>           total number of mallocs used during MatSetValues calls=0
>>>>             using I-node (on process 0) routines: found 160 nodes,
>>>> limit used is 5
>>>>     KSP solver for S = A11 - A10 inv(A00) A01
>>>>       KSP Object: (fieldsplit_1_) 1 MPI processes
>>>>         type: preonly
>>>>         maximum iterations=10000, initial guess is zero
>>>>         tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>>>>         left preconditioning
>>>>         using NONE norm type for convergence test
>>>>       PC Object: (fieldsplit_1_) 1 MPI processes
>>>>         type: bjacobi
>>>>           number of blocks = 1
>>>>           Local solver is the same for all blocks, as in the following
>>>> KSP and PC objects on rank 0:
>>>>         KSP Object: (fieldsplit_1_sub_) 1 MPI processes
>>>>           type: preonly
>>>>           maximum iterations=10000, initial guess is zero
>>>>           tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>>>>           left preconditioning
>>>>           using NONE norm type for convergence test
>>>>         PC Object: (fieldsplit_1_sub_) 1 MPI processes
>>>>           type: bjacobi
>>>>             number of blocks = 1
>>>>             Local solver is the same for all blocks, as in the
>>>> following KSP and PC objects on rank 0:
>>>>                     KSP Object:           (fieldsplit_1_sub_sub_)
>>>>     1 MPI processes
>>>>                       type: preonly
>>>>                       maximum iterations=10000, initial guess is zero
>>>>                       tolerances:  relative=1e-05, absolute=1e-50,
>>>> divergence=10000.
>>>>                       left preconditioning
>>>>                       using NONE norm type for convergence test
>>>>                     PC Object:           (fieldsplit_1_sub_sub_)
>>>>     1 MPI processes
>>>>                       type: ilu
>>>>                         out-of-place factorization
>>>>                         0 levels of fill
>>>>                         tolerance for zero pivot 2.22045e-14
>>>>                         matrix ordering: natural
>>>>                         factor fill ratio given 1., needed 1.
>>>>                           Factored matrix follows:
>>>>                             Mat Object:           1 MPI processes
>>>>                               type: seqaij
>>>>                               rows=144, cols=144
>>>>                               package used to perform factorization:
>>>> petsc
>>>>                               total: nonzeros=240, allocated
>>>> nonzeros=240
>>>>                                 not using I-node routines
>>>>                       linear system matrix = precond matrix:
>>>>                       Mat Object:           1 MPI processes
>>>>                         type: seqaij
>>>>                         rows=144, cols=144
>>>>                         total: nonzeros=240, allocated nonzeros=240
>>>>                         total number of mallocs used during
>>>> MatSetValues calls=0
>>>>                           not using I-node routines
>>>>           linear system matrix = precond matrix:
>>>>           Mat Object: 1 MPI processes
>>>>             type: mpiaij
>>>>             rows=144, cols=144
>>>>             total: nonzeros=240, allocated nonzeros=240
>>>>             total number of mallocs used during MatSetValues calls=0
>>>>               not using I-node (on process 0) routines
>>>>         linear system matrix followed by preconditioner matrix:
>>>>         Mat Object: (fieldsplit_1_) 1 MPI processes
>>>>           type: schurcomplement
>>>>           rows=144, cols=144
>>>>             Schur complement A11 - A10 inv(A00) A01
>>>>             A11
>>>>               Mat Object: (fieldsplit_1_) 1 MPI processes
>>>>                 type: mpiaij
>>>>                 rows=144, cols=144
>>>>                 total: nonzeros=240, allocated nonzeros=240
>>>>                 total number of mallocs used during MatSetValues calls=0
>>>>                   not using I-node (on process 0) routines
>>>>             A10
>>>>               Mat Object: 1 MPI processes
>>>>                 type: mpiaij
>>>>                 rows=144, cols=480
>>>>                 total: nonzeros=48, allocated nonzeros=48
>>>>                 total number of mallocs used during MatSetValues calls=0
>>>>                   using I-node (on process 0) routines: found 74 nodes,
>>>> limit used is 5
>>>>             KSP of A00
>>>>               KSP Object: (fieldsplit_0_) 1 MPI processes
>>>>                 type: preonly
>>>>                 maximum iterations=10000, initial guess is zero
>>>>                 tolerances:  relative=1e-05, absolute=1e-50,
>>>> divergence=10000.
>>>>                 left preconditioning
>>>>                 using NONE norm type for convergence test
>>>>               PC Object: (fieldsplit_0_) 1 MPI processes
>>>>                 type: gamg
>>>>                   type is MULTIPLICATIVE, levels=2 cycles=v
>>>>                     Cycles per PCApply=1
>>>>                     Using externally compute Galerkin coarse grid
>>>> matrices
>>>>                     GAMG specific options
>>>>                       Threshold for dropping small values in graph on
>>>> each level =
>>>>                       Threshold scaling factor for each level not
>>>> specified = 1.
>>>>                       AGG specific options
>>>>                         Symmetric graph false
>>>>                         Number of levels to square graph 1
>>>>                         Number smoothing steps 1
>>>>                       Complexity:    grid = 1.00222
>>>>                 Coarse grid solver -- level
>>>> -------------------------------
>>>>                   KSP Object: (fieldsplit_0_mg_coarse_) 1 MPI processes
>>>>                     type: preonly
>>>>                     maximum iterations=10000, initial guess is zero
>>>>                     tolerances:  relative=1e-05, absolute=1e-50,
>>>> divergence=10000.
>>>>                     left preconditioning
>>>>                     using NONE norm type for convergence test
>>>>                   PC Object: (fieldsplit_0_mg_coarse_) 1 MPI processes
>>>>                     type: bjacobi
>>>>                       number of blocks = 1
>>>>                       Local solver is the same for all blocks, as in
>>>> the following KSP and PC objects on rank 0:
>>>>                     KSP Object: (fieldsplit_0_mg_coarse_sub_) 1 MPI
>>>> processes
>>>>                       type: preonly
>>>>                       maximum iterations=1, initial guess is zero
>>>>                       tolerances:  relative=1e-05, absolute=1e-50,
>>>> divergence=10000.
>>>>                       left preconditioning
>>>>                       using NONE norm type for convergence test
>>>>                     PC Object: (fieldsplit_0_mg_coarse_sub_) 1 MPI
>>>> processes
>>>>                       type: lu
>>>>                         out-of-place factorization
>>>>                         tolerance for zero pivot 2.22045e-14
>>>>                         using diagonal shift on blocks to prevent zero
>>>> pivot [INBLOCKS]
>>>>                         matrix ordering: nd
>>>>                         factor fill ratio given 5., needed 1.
>>>>                           Factored matrix follows:
>>>>                             Mat Object: 1 MPI processes
>>>>                               type: seqaij
>>>>                               rows=8, cols=8
>>>>                               package used to perform factorization:
>>>> petsc
>>>>                               total: nonzeros=56, allocated nonzeros=56
>>>>                                 using I-node routines: found 3 nodes,
>>>> limit used is 5
>>>>                       linear system matrix = precond matrix:
>>>>                       Mat Object: 1 MPI processes
>>>>                         type: seqaij
>>>>                         rows=8, cols=8
>>>>                         total: nonzeros=56, allocated nonzeros=56
>>>>                         total number of mallocs used during
>>>> MatSetValues calls=0
>>>>                           using I-node routines: found 3 nodes, limit
>>>> used is 5
>>>>                     linear system matrix = precond matrix:
>>>>                     Mat Object: 1 MPI processes
>>>>                       type: mpiaij
>>>>                       rows=8, cols=8
>>>>                       total: nonzeros=56, allocated nonzeros=56
>>>>                       total number of mallocs used during MatSetValues
>>>> calls=0
>>>>                         using nonscalable MatPtAP() implementation
>>>>                         using I-node (on process 0) routines: found 3
>>>> nodes, limit used is 5
>>>>                 Down solver (pre-smoother) on level 1
>>>> -------------------------------
>>>>                   KSP Object: (fieldsplit_0_mg_levels_1_) 1 MPI
>>>> processes
>>>>                     type: chebyshev
>>>>                       eigenvalue estimates used:  min = 0.0998145, max
>>>> = 1.09796
>>>>                       eigenvalues estimate via gmres min 0.00156735,
>>>> max 0.998145
>>>>                       eigenvalues estimated using gmres with
>>>> translations  [0. 0.1; 0. 1.1]
>>>>                       KSP Object: (fieldsplit_0_mg_levels_1_esteig_) 1
>>>> MPI processes
>>>>                         type: gmres
>>>>                           restart=30, using Classical (unmodified)
>>>> Gram-Schmidt Orthogonalization with no iterative refinement
>>>>                           happy breakdown tolerance 1e-30
>>>>                         maximum iterations=10, initial guess is zero
>>>>                         tolerances:  relative=1e-12, absolute=1e-50,
>>>> divergence=10000.
>>>>                         left preconditioning
>>>>                         using PRECONDITIONED norm type for convergence
>>>> test
>>>>                       estimating eigenvalues using noisy right hand side
>>>>                     maximum iterations=2, nonzero initial guess
>>>>                     tolerances:  relative=1e-05, absolute=1e-50,
>>>> divergence=10000.
>>>>                     left preconditioning
>>>>                     using NONE norm type for convergence test
>>>>                   PC Object: (fieldsplit_0_mg_levels_1_) 1 MPI processes
>>>>                     type: sor
>>>>                       type = local_symmetric, iterations = 1, local
>>>> iterations = 1, omega = 1.
>>>>                     linear system matrix = precond matrix:
>>>>                     Mat Object: (fieldsplit_0_) 1 MPI processes
>>>>                       type: mpiaij
>>>>                       rows=480, cols=480
>>>>                       total: nonzeros=25200, allocated nonzeros=25200
>>>>                       total number of mallocs used during MatSetValues
>>>> calls=0
>>>>                         using I-node (on process 0) routines: found 160
>>>> nodes, limit used is 5
>>>>                 Up solver (post-smoother) same as down solver
>>>> (pre-smoother)
>>>>                 linear system matrix = precond matrix:
>>>>                 Mat Object: (fieldsplit_0_) 1 MPI processes
>>>>                   type: mpiaij
>>>>                   rows=480, cols=480
>>>>                   total: nonzeros=25200, allocated nonzeros=25200
>>>>                   total number of mallocs used during MatSetValues
>>>> calls=0
>>>>                     using I-node (on process 0) routines: found 160
>>>> nodes, limit used is 5
>>>>             A01
>>>>               Mat Object: 1 MPI processes
>>>>                 type: mpiaij
>>>>                 rows=480, cols=144
>>>>                 total: nonzeros=48, allocated nonzeros=48
>>>>                 total number of mallocs used during MatSetValues calls=0
>>>>                   using I-node (on process 0) routines: found 135
>>>> nodes, limit used is 5
>>>>         Mat Object: 1 MPI processes
>>>>           type: mpiaij
>>>>           rows=144, cols=144
>>>>           total: nonzeros=240, allocated nonzeros=240
>>>>           total number of mallocs used during MatSetValues calls=0
>>>>             not using I-node (on process 0) routines
>>>>   linear system matrix = precond matrix:
>>>>   Mat Object: 1 MPI processes
>>>>     type: mpiaij
>>>>     rows=624, cols=624
>>>>     total: nonzeros=25536, allocated nonzeros=25536
>>>>     total number of mallocs used during MatSetValues calls=0
>>>>       using I-node (on process 0) routines: found 336 nodes, limit used
>>>> is 5
>>>>
>>>>
>>>> Thanks,
>>>> Xiaofeng
>>>>
>>>>
>>>>
>>>> On Jun 17, 2025, at 19:05, Mark Adams <mfadams at lbl.gov> wrote:
>>>>
>>>> And don't use -pc_gamg_parallel_coarse_grid_solver
>>>> You can use that in production but for debugging use -mg_coarse_pc_type
>>>> svd
>>>> Also, use -options_left and remove anything that is not used.
>>>> (I am puzzled, I see -pc_type gamg not -pc_type fieldsplit)
>>>>
>>>> Mark
>>>>
>>>>
>>>> On Mon, Jun 16, 2025 at 6:40 AM Matthew Knepley <knepley at gmail.com>
>>>> wrote:
>>>>
>>>>> On Sun, Jun 15, 2025 at 9:46 PM hexioafeng <hexiaofeng at buaa.edu.cn>
>>>>> wrote:
>>>>>
>>>>>> Hello,
>>>>>>
>>>>>> Here are the options and outputs:
>>>>>>
>>>>>> options:
>>>>>>
>>>>>> -ksp_type cg -pc_type gamg -pc_gamg_parallel_coarse_grid_solver
>>>>>>  -pc_fieldsplit_detect_saddle_point  -pc_fieldsplit_type schur
>>>>>> -pc_fieldsplit_schur_precondition selfp
>>>>>> -fieldsplit_1_mat_schur_complement_ainv_type lump
>>>>>> -pc_fieldsplit_schur_fact_type full -fieldsplit_0_ksp_type preonly
>>>>>> -fieldsplit_0_pc_type gamg -fieldsplit_0_mg_coarse_pc_type_type svd
>>>>>> -fieldsplit_1_ksp_type preonly -fieldsplit_1_pc_type bjacobi
>>>>>> -fieldsplit_1_sub_pc_type sor -ksp_view  -ksp_monitor_true_residual
>>>>>>  -ksp_converged_reason  -fieldsplit_0_mg_levels_ksp_monitor_true_residual
>>>>>>  -fieldsplit_0_mg_levels_ksp_converged_reason
>>>>>>  -fieldsplit_1_ksp_monitor_true_residual
>>>>>>  -fieldsplit_1_ksp_converged_reason
>>>>>>
>>>>>
>>>>> This option was wrong:
>>>>>
>>>>>   -fieldsplit_0_mg_coarse_pc_type_type svd
>>>>>
>>>>> from the output, we can see that it should have been
>>>>>
>>>>>   -fieldsplit_0_mg_coarse_sub_pc_type_type svd
>>>>>
>>>>>   THanks,
>>>>>
>>>>>      Matt
>>>>>
>>>>>
>>>>>> output:
>>>>>>
>>>>>> 0 KSP unpreconditioned resid norm 2.777777777778e+01 true resid norm
>>>>>> 2.777777777778e+01 ||r(i)||/||b|| 1.000000000000e+00
>>>>>> Linear solve did not converge due to DIVERGED_PC_FAILED iterations 0
>>>>>>                PC failed due to SUBPC_ERROR
>>>>>> KSP Object: 1 MPI processes
>>>>>>   type: cg
>>>>>>   maximum iterations=200, initial guess is zero
>>>>>>   tolerances:  relative=1e-06, absolute=1e-12, divergence=1e+30
>>>>>>   left preconditioning
>>>>>>   using UNPRECONDITIONED norm type for convergence test
>>>>>> PC Object: 1 MPI processes
>>>>>>   type: gamg
>>>>>>     type is MULTIPLICATIVE, levels=2 cycles=v
>>>>>>       Cycles per PCApply=1
>>>>>>       Using externally compute Galerkin coarse grid matrices
>>>>>>       GAMG specific options
>>>>>>         Threshold for dropping small values in graph on each level =
>>>>>>         Threshold scaling factor for each level not specified = 1.
>>>>>>         AGG specific options
>>>>>>           Symmetric graph false
>>>>>>           Number of levels to square graph 1
>>>>>>           Number smoothing steps 1
>>>>>>         Complexity:    grid = 1.00176
>>>>>>   Coarse grid solver -- level -------------------------------
>>>>>>     KSP Object: (mg_coarse_) 1 MPI processes
>>>>>>       type: preonly
>>>>>>       maximum iterations=10000, initial guess is zero
>>>>>>       tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>>>>>>       left preconditioning
>>>>>>       using NONE norm type for convergence test
>>>>>>     PC Object: (mg_coarse_) 1 MPI processes
>>>>>>       type: bjacobi
>>>>>>         number of blocks = 1
>>>>>>         Local solver is the same for all blocks, as in the following
>>>>>> KSP and PC objects on rank 0:
>>>>>>       KSP Object: (mg_coarse_sub_) 1 MPI processes
>>>>>>         type: preonly
>>>>>>         maximum iterations=1, initial guess is zero
>>>>>>         tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>>>>>>         left preconditioning
>>>>>>         using NONE norm type for convergence test
>>>>>>       PC Object: (mg_coarse_sub_) 1 MPI processes
>>>>>>         type: lu
>>>>>>           out-of-place factorization
>>>>>>           tolerance for zero pivot 2.22045e-14
>>>>>>           using diagonal shift on blocks to prevent zero pivot
>>>>>> [INBLOCKS]
>>>>>>           matrix ordering: nd
>>>>>>           factor fill ratio given 5., needed 1.
>>>>>>             Factored matrix follows:
>>>>>>               Mat Object: 1 MPI processes
>>>>>>                 type: seqaij
>>>>>>                 rows=7, cols=7
>>>>>>                 package used to perform factorization: petsc
>>>>>>                 total: nonzeros=45, allocated nonzeros=45
>>>>>>                   using I-node routines: found 3 nodes, limit used is
>>>>>> 5
>>>>>>         linear system matrix = precond matrix:
>>>>>>         Mat Object: 1 MPI processes
>>>>>>           type: seqaij
>>>>>>           rows=7, cols=7
>>>>>>           total: nonzeros=45, allocated nonzeros=45
>>>>>>           total number of mallocs used during MatSetValues calls=0
>>>>>>             using I-node routines: found 3 nodes, limit used is 5
>>>>>>       linear system matrix = precond matrix:
>>>>>>       Mat Object: 1 MPI processes
>>>>>>         type: mpiaij
>>>>>>         rows=7, cols=7
>>>>>>         total: nonzeros=45, allocated nonzeros=45
>>>>>>         total number of mallocs used during MatSetValues calls=0
>>>>>>           using nonscalable MatPtAP() implementation
>>>>>>           using I-node (on process 0) routines: found 3 nodes, limit
>>>>>> used is 5
>>>>>>   Down solver (pre-smoother) on level 1
>>>>>> -------------------------------
>>>>>>     KSP Object: (mg_levels_1_) 1 MPI processes
>>>>>>       type: chebyshev
>>>>>>         eigenvalue estimates used:  min = 0., max = 0.
>>>>>>         eigenvalues estimate via gmres min 0., max 0.
>>>>>>         eigenvalues estimated using gmres with translations  [0. 0.1;
>>>>>> 0. 1.1]
>>>>>>         KSP Object: (mg_levels_1_esteig_) 1 MPI processes
>>>>>>           type: gmres
>>>>>>             restart=30, using Classical (unmodified) Gram-Schmidt
>>>>>> Orthogonalization with no iterative refinement
>>>>>>             happy breakdown tolerance 1e-30
>>>>>>           maximum iterations=10, initial guess is zero
>>>>>>           tolerances:  relative=1e-12, absolute=1e-50,
>>>>>> divergence=10000.
>>>>>>           left preconditioning
>>>>>>           using PRECONDITIONED norm type for convergence test
>>>>>>         PC Object: (mg_levels_1_) 1 MPI processes
>>>>>>           type: sor
>>>>>>             type = local_symmetric, iterations = 1, local iterations
>>>>>> = 1, omega = 1.
>>>>>>           linear system matrix = precond matrix:
>>>>>>           Mat Object: 1 MPI processes
>>>>>>             type: mpiaij
>>>>>>             rows=624, cols=624
>>>>>>             total: nonzeros=25536, allocated nonzeros=25536
>>>>>>             total number of mallocs used during MatSetValues calls=0
>>>>>>               using I-node (on process 0) routines: found 336 nodes,
>>>>>> limit used is 5
>>>>>>         estimating eigenvalues using noisy right hand side
>>>>>>       maximum iterations=2, nonzero initial guess
>>>>>>       tolerances:  relative=1e-05, absolute=1e-50, divergence=10000.
>>>>>>       left preconditioning
>>>>>>       using NONE norm type for convergence test
>>>>>>     PC Object: (mg_levels_1_) 1 MPI processes
>>>>>>       type: sor
>>>>>>         type = local_symmetric, iterations = 1, local iterations = 1,
>>>>>> omega = 1.      linear system matrix = precond matrix:
>>>>>>       Mat Object: 1 MPI processes
>>>>>>         type: mpiaij
>>>>>>         rows=624, cols=624
>>>>>>         total: nonzeros=25536, allocated nonzeros=25536
>>>>>>         total number of mallocs used during MatSetValues calls=0
>>>>>>           using I-node (on process 0) routines: found 336 nodes,
>>>>>> limit used is 5  Up solver (post-smoother) same as down solver
>>>>>> (pre-smoother)
>>>>>>   linear system matrix = precond matrix:
>>>>>>   Mat Object: 1 MPI processes
>>>>>>     type: mpiaij
>>>>>>     rows=624, cols=624
>>>>>>     total: nonzeros=25536, allocated nonzeros=25536
>>>>>>     total number of mallocs used during MatSetValues calls=0
>>>>>>       using I-node (on process 0) routines: found 336 nodes, limit
>>>>>> used is 5
>>>>>>
>>>>>>
>>>>>> Best regards,
>>>>>>
>>>>>> Xiaofeng
>>>>>>
>>>>>>
>>>>>> On Jun 14, 2025, at 07:28, Barry Smith <bsmith at petsc.dev> wrote:
>>>>>>
>>>>>>
>>>>>>   Matt,
>>>>>>
>>>>>>    Perhaps we should add options -ksp_monitor_debug and
>>>>>> -snes_monitor_debug that turn on all possible monitoring for the (possibly)
>>>>>> nested solvers and all of their converged reasons also? Note this is not
>>>>>> completely trivial because each preconditioner will have to supply its list
>>>>>> based on the current solver options for it.
>>>>>>
>>>>>>    Then we won't need to constantly list a big string of problem
>>>>>> specific monitor options to ask the user to use.
>>>>>>
>>>>>>   Barry
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>> On Jun 13, 2025, at 9:09 AM, Matthew Knepley <knepley at gmail.com>
>>>>>> wrote:
>>>>>>
>>>>>> On Thu, Jun 12, 2025 at 10:55 PM hexioafeng <hexiaofeng at buaa.edu.cn>
>>>>>> wrote:
>>>>>>
>>>>>>> Dear authors,
>>>>>>>
>>>>>>> I tried *-pc_type game -pc_gamg_parallel_coarse_grid_solver* and *-pc_type
>>>>>>> field split -pc_fieldsplit_detect_saddle_point -fieldsplit_0_ksp_type
>>>>>>> pronely -fieldsplit_0_pc_type game -fieldsplit_0_mg_coarse_pc_type sad
>>>>>>> -fieldsplit_1_ksp_type pronely -fieldsplit_1_pc_type Jacobi
>>>>>>> _fieldsplit_1_sub_pc_type for* , both options got the
>>>>>>> KSP_DIVERGE_PC_FAILED error.
>>>>>>>
>>>>>>
>>>>>> With any question about convergence, we need to see the output of
>>>>>>
>>>>>>   -ksp_view -ksp_monitor_true_residual -ksp_converged_reason
>>>>>> -fieldsplit_0_mg_levels_ksp_monitor_true_residual
>>>>>> -fieldsplit_0_mg_levels_ksp_converged_reason
>>>>>> -fieldsplit_1_ksp_monitor_true_residual -fieldsplit_1_ksp_converged_reason
>>>>>>
>>>>>> and all the error output.
>>>>>>
>>>>>>   Thanks,
>>>>>>
>>>>>>      Matt
>>>>>>
>>>>>>
>>>>>>> Thanks,
>>>>>>>
>>>>>>> Xiaofeng
>>>>>>>
>>>>>>>
>>>>>>> On Jun 12, 2025, at 20:50, Mark Adams <mfadams at lbl.gov> wrote:
>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>> On Thu, Jun 12, 2025 at 8:44 AM Matthew Knepley <knepley at gmail.com>
>>>>>>> wrote:
>>>>>>>
>>>>>>>> On Thu, Jun 12, 2025 at 4:58 AM Mark Adams <mfadams at lbl.gov> wrote:
>>>>>>>>
>>>>>>>>> Adding this to the PETSc mailing list,
>>>>>>>>>
>>>>>>>>> On Thu, Jun 12, 2025 at 3:43 AM hexioafeng <hexiaofeng at buaa.edu.cn>
>>>>>>>>> wrote:
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Dear Professor,
>>>>>>>>>>
>>>>>>>>>> I hope this message finds you well.
>>>>>>>>>>
>>>>>>>>>> I am an employee at a CAE company and a heavy user of the PETSc
>>>>>>>>>> library. I would like to thank you for your contributions to PETSc and
>>>>>>>>>> express my deep appreciation for your work.
>>>>>>>>>>
>>>>>>>>>> Recently, I encountered some difficulties when using PETSc to
>>>>>>>>>> solve structural mechanics problems with Lagrange multiplier constraints.
>>>>>>>>>> After searching extensively online and reviewing several papers, I found
>>>>>>>>>> your previous paper titled "*Algebraic multigrid methods for
>>>>>>>>>> constrained linear systems with applications to contact problems in solid
>>>>>>>>>> mechanics*" seems to be the most relevant and helpful.
>>>>>>>>>>
>>>>>>>>>> The stiffness matrix I'm working with, *K*, is a block
>>>>>>>>>> saddle-point matrix of the form (A00 A01; A10 0), where *A00 is
>>>>>>>>>> singular*—just as described in your paper, and different from
>>>>>>>>>> many other articles . I have a few questions regarding your work and would
>>>>>>>>>> greatly appreciate your insights:
>>>>>>>>>>
>>>>>>>>>> 1. Is the *AMG/KKT* method presented in your paper available in
>>>>>>>>>> PETSc? I tried using *CG+GAMG* directly but received a
>>>>>>>>>> *KSP_DIVERGED_PC_FAILED* error. I also attempted to use
>>>>>>>>>> *CG+PCFIELDSPLIT* with the following options:
>>>>>>>>>>
>>>>>>>>>
>>>>>>>>> No
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>     -pc_type fieldsplit -pc_fieldsplit_detect_saddle_point
>>>>>>>>>> -pc_fieldsplit_type schur -pc_fieldsplit_schur_precondition selfp
>>>>>>>>>> -pc_fieldsplit_schur_fact_type full -fieldsplit_0_ksp_type preonly
>>>>>>>>>> -fieldsplit_0_pc_type gamg -fieldsplit_1_ksp_type preonly
>>>>>>>>>> -fieldsplit_1_pc_type bjacobi
>>>>>>>>>>
>>>>>>>>>>    Unfortunately, this also resulted in a
>>>>>>>>>> *KSP_DIVERGED_PC_FAILED* error. Do you have any suggestions?
>>>>>>>>>>
>>>>>>>>>> 2. In your paper, you compare the method with *Uzawa*-type
>>>>>>>>>> approaches. To my understanding, Uzawa methods typically require A00 to be
>>>>>>>>>> invertible. How did you handle the singularity of A00 to construct an
>>>>>>>>>> M-matrix that is invertible?
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>> You add a regularization term like A01 * A10 (like springs). See
>>>>>>>>> the paper or any reference to augmented lagrange or Uzawa
>>>>>>>>>
>>>>>>>>>
>>>>>>>>> 3. Can i implement the AMG/KKT method in your paper using existing
>>>>>>>>>>  *AMG APIs*? Implementing a production-level AMG solver from
>>>>>>>>>> scratch would be quite challenging for me, so I’m hoping to utilize
>>>>>>>>>> existing AMG interfaces within PETSc or other packages.
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>> You can do Uzawa and make the regularization matrix with
>>>>>>>>> matrix-matrix products. Just use AMG for the A00 block.
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>
>>>>>>>>>> 4. For saddle-point systems where A00 is singular, can you
>>>>>>>>>> recommend any more robust or efficient solutions? Alternatively, are you
>>>>>>>>>> aware of any open-source software packages that can handle such cases
>>>>>>>>>> out-of-the-box?
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>> No, and I don't think PETSc can do this out-of-the-box, but others
>>>>>>>>> may be able to give you a better idea of what PETSc can do.
>>>>>>>>> I think PETSc can do Uzawa or other similar algorithms but it will
>>>>>>>>> not do the regularization automatically (it is a bit more complicated than
>>>>>>>>> just A01 * A10)
>>>>>>>>>
>>>>>>>>
>>>>>>>> One other trick you can use is to have
>>>>>>>>
>>>>>>>>   -fieldsplit_0_mg_coarse_pc_type svd
>>>>>>>>
>>>>>>>> This will use SVD on the coarse grid of GAMG, which can handle the
>>>>>>>> null space in A00 as long as the prolongation does not put it back in. I
>>>>>>>> have used this for the Laplacian with Neumann conditions and for freely
>>>>>>>> floating elastic problems.
>>>>>>>>
>>>>>>>>
>>>>>>> Good point.
>>>>>>> You can also use -pc_gamg_parallel_coarse_grid_solver to get GAMG to
>>>>>>> use a on level iterative solver for the coarse grid.
>>>>>>>
>>>>>>>
>>>>>>>>   Thanks,
>>>>>>>>
>>>>>>>>      Matt
>>>>>>>>
>>>>>>>>
>>>>>>>>> Thanks,
>>>>>>>>> Mark
>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Thank you very much for taking the time to read my email. Looking
>>>>>>>>>> forward to hearing from you.
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>>> Sincerely,
>>>>>>>>>>
>>>>>>>>>> Xiaofeng He
>>>>>>>>>> -----------------------------------------------------
>>>>>>>>>>
>>>>>>>>>> Research Engineer
>>>>>>>>>>
>>>>>>>>>> Internet Based Engineering, Beijing, China
>>>>>>>>>>
>>>>>>>>>>
>>>>>>>>
>>>>>>>> --
>>>>>>>> What most experimenters take for granted before they begin their
>>>>>>>> experiments is infinitely more interesting than any results to which their
>>>>>>>> experiments lead.
>>>>>>>> -- Norbert Wiener
>>>>>>>>
>>>>>>>> https://urldefense.us/v3/__https://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!bCEQ75t2tb346ZO4z8MiHCNG9f8IWujBRyEK8EbJqLTQfpfGIn5H_0ZXA_V7K7Y7Csps7k35GiSVrqnpTYvh$ 
>>>>>>>> <https://urldefense.us/v3/__http://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!f-YJSzthRa7atIa1xs1GPHW53hGIqSenvp1eO2kDsSyf4jv1_Vp0kL9Lg8pyyPeG8al4Im8XlLqGRHw1FxYh$>
>>>>>>>>
>>>>>>>
>>>>>>>
>>>>>>
>>>>>> --
>>>>>> What most experimenters take for granted before they begin their
>>>>>> experiments is infinitely more interesting than any results to which their
>>>>>> experiments lead.
>>>>>> -- Norbert Wiener
>>>>>>
>>>>>> https://urldefense.us/v3/__https://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!bCEQ75t2tb346ZO4z8MiHCNG9f8IWujBRyEK8EbJqLTQfpfGIn5H_0ZXA_V7K7Y7Csps7k35GiSVrqnpTYvh$ 
>>>>>> <https://urldefense.us/v3/__http://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!f-YJSzthRa7atIa1xs1GPHW53hGIqSenvp1eO2kDsSyf4jv1_Vp0kL9Lg8pyyPeG8al4Im8XlLqGRHw1FxYh$>
>>>>>>
>>>>>>
>>>>>>
>>>>>>
>>>>>
>>>>> --
>>>>> What most experimenters take for granted before they begin their
>>>>> experiments is infinitely more interesting than any results to which their
>>>>> experiments lead.
>>>>> -- Norbert Wiener
>>>>>
>>>>> https://urldefense.us/v3/__https://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!bCEQ75t2tb346ZO4z8MiHCNG9f8IWujBRyEK8EbJqLTQfpfGIn5H_0ZXA_V7K7Y7Csps7k35GiSVrqnpTYvh$ 
>>>>> <https://urldefense.us/v3/__http://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!dYETsi-moODALE1tmLrk5pxFKF9l552nNiC0cBgsCQ9ebugJWHtsNYa0QBS5Gmws9J_VC_Iec3Nx0c1FgNl1$>
>>>>>
>>>>
>>>>
>>>
>>> --
>>> What most experimenters take for granted before they begin their
>>> experiments is infinitely more interesting than any results to which their
>>> experiments lead.
>>> -- Norbert Wiener
>>>
>>> https://urldefense.us/v3/__https://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!bCEQ75t2tb346ZO4z8MiHCNG9f8IWujBRyEK8EbJqLTQfpfGIn5H_0ZXA_V7K7Y7Csps7k35GiSVrqnpTYvh$ 
>>> <https://urldefense.us/v3/__http://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!bCEQ75t2tb346ZO4z8MiHCNG9f8IWujBRyEK8EbJqLTQfpfGIn5H_0ZXA_V7K7Y7Csps7k35GiSVro8L1MVr$ >
>>>
>>>
>>>
>>
>> --
>> What most experimenters take for granted before they begin their
>> experiments is infinitely more interesting than any results to which their
>> experiments lead.
>> -- Norbert Wiener
>>
>> https://urldefense.us/v3/__https://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!bCEQ75t2tb346ZO4z8MiHCNG9f8IWujBRyEK8EbJqLTQfpfGIn5H_0ZXA_V7K7Y7Csps7k35GiSVrqnpTYvh$ 
>> <https://urldefense.us/v3/__http://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!bCEQ75t2tb346ZO4z8MiHCNG9f8IWujBRyEK8EbJqLTQfpfGIn5H_0ZXA_V7K7Y7Csps7k35GiSVro8L1MVr$ >
>>
>
>

-- 
What most experimenters take for granted before they begin their
experiments is infinitely more interesting than any results to which their
experiments lead.
-- Norbert Wiener

https://urldefense.us/v3/__https://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!bCEQ75t2tb346ZO4z8MiHCNG9f8IWujBRyEK8EbJqLTQfpfGIn5H_0ZXA_V7K7Y7Csps7k35GiSVrqnpTYvh$  <https://urldefense.us/v3/__http://www.cse.buffalo.edu/*knepley/__;fg!!G_uCfscf7eWS!bCEQ75t2tb346ZO4z8MiHCNG9f8IWujBRyEK8EbJqLTQfpfGIn5H_0ZXA_V7K7Y7Csps7k35GiSVro8L1MVr$ >
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20250619/a31c1362/attachment-0001.html>