[petsc-dev] ts_monitor_solution_vtk does not work when used with DMComposite
Jed Brown
jed at jedbrown.org
Fri Feb 7 16:40:27 CST 2014
Mani Chandra <mc0710 at gmail.com> writes:
> I have a heavy residual evaluation function which I cannot seem to
> vectorize at all, so then I tried changing the global data layout putting
> each variable in a contiguous chunk and it finally does vectorize.
I suggest locally extracting from interlaced to fill vector registers at
the cell level (e.g., group of a 4 cells). The memory performance
(number of streams and cache reuse) is better with interlaced and if you
have enough flops that local vectorization matters, packing will be
cheap.
-------------- next part --------------
A non-text attachment was scrubbed...
Name: not available
Type: application/pgp-signature
Size: 835 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/petsc-dev/attachments/20140207/1b56f4b2/attachment.sig>
More information about the petsc-dev
mailing list