[petsc-dev] ts_monitor_solution_vtk does not work when used	with DMComposite
    Matthew Knepley 
    knepley at gmail.com
       
    Fri Feb  7 16:48:15 CST 2014
    
    
  
On Fri, Feb 7, 2014 at 4:40 PM, Jed Brown <jed at jedbrown.org> wrote:
> Mani Chandra <mc0710 at gmail.com> writes:
>
> > I have a heavy residual evaluation function which I cannot seem to
> > vectorize at all, so then I tried changing the global data layout putting
> > each variable in a contiguous chunk and it finally does vectorize.
>
> I suggest locally extracting from interlaced to fill vector registers at
> the cell level (e.g., group of a 4 cells).  The memory performance
> (number of streams and cache reuse) is better with interlaced and if you
> have enough flops that local vectorization matters, packing will be
> cheap.
>
This is EXACTLY what you should do, and what PetscFE does now.
   Matt
-- 
What most experimenters take for granted before they begin their
experiments is infinitely more interesting than any results to which their
experiments lead.
-- Norbert Wiener
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-dev/attachments/20140207/2cde54cd/attachment.html>
    
    
More information about the petsc-dev
mailing list