[petsc-users] default orthogonalization in gmres
Karl Rupp
rupp at mcs.anl.gov
Mon Jul 15 17:49:36 CDT 2013
Hey,
>> However, for Gram-Schmidt you can just compute all the
>> necessary scalar products at the same time (VecMDot) and reuse the
>> common data vector. This gives you a speed-up of a factor of almost two.
>
> It's not a factor of 2, it's a factor of k where k is the size of the
> subspace. Classical Gram-Schmidt needs one reduction per iteration
> (normalization can be hidden), but modified needs k reductions.
well, you see the factor of k only if the communication for the
reduction is the bottleneck. The factor of almost 2 is what you get if
memory bandwidth is the bottleneck.
Best regards,
Karli
More information about the petsc-users
mailing list