[petsc-users] Obtaining bytes per second

Matthew Knepley knepley at gmail.com
Thu May 7 13:28:28 CDT 2015


On Thu, May 7, 2015 at 1:25 PM, Jed Brown <jed at jedbrown.org> wrote:

> Matthew Knepley <knepley at gmail.com> writes:
>
> > On Thu, May 7, 2015 at 9:23 AM, Justin Chang <jychang48 at gmail.com>
> wrote:
> >
> >> So to summarize, if I understand everything, I should do the following:
> >>
> >> 1) calculate the flop/byte ratio for various problem sizes and solver
> >> methods on one process and:
>
> It is impossible to define "arithmetic intensity" (flops/byte) without
> selecting a cache model.  And my "cache model", I include register
> reuse.  Assuming no reuse at all is not even close to representative
> (off by orders of magnitude).


You are making assumptions about the algorithm here. Pure streaming
computations,
like VecAXPY do not depend on the cache model.

  Matt


>
> > Should be roughly invariant to problem size.
>
> Except (a) insofar as it depends on cache sizes and (b) the algorithm
> characteristics depend on data size (e.g., the cost of supernode
> factorization for sparse direct solvers scales superlinearly, so
> arithmetic intensity will drift as you increase the problem size).
>
>
> You can see how we compare "pessimal" versus "perfect" caching here:
>
> http://59A2.org/na/MayBrownLePourhiet-pTatin3d-2014.pdf
>



-- 
What most experimenters take for granted before they begin their
experiments is infinitely more interesting than any results to which their
experiments lead.
-- Norbert Wiener
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20150507/569d0fc7/attachment.html>


More information about the petsc-users mailing list