[petsc-users] matsetvaluesblocked4_
Matthew Knepley
knepley at gmail.com
Wed May 27 18:36:50 CDT 2020
On Wed, May 27, 2020 at 7:34 PM Jed Brown <jed at jedbrown.org> wrote:
> Mark Adams <mfadams at lbl.gov> writes:
>
> > Nvidias's NSight with 2D Q3 and bs=10. (attached).
>
> Thanks; this is basically the same as a CPU -- the cost is searching the
> sorted rows for the next entry. I've long thought we should optimize
> the implementations to fast-path when the next column index in the
> sparse matrix equals the next index in the provided block. It'd just
> take a good CPU test to demonstrate that payoff.
>
So you first check whether the next index is the one in the set passed in,
and otherwise
fall back on the search? Good idea.
Matt
--
What most experimenters take for granted before they begin their
experiments is infinitely more interesting than any results to which their
experiments lead.
-- Norbert Wiener
https://www.cse.buffalo.edu/~knepley/ <http://www.cse.buffalo.edu/~knepley/>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20200527/2b5d7b29/attachment.html>
More information about the petsc-users
mailing list