[petsc-dev] Supporting OpenCL matrix assembly

Matthew Knepley knepley at gmail.com
Tue Sep 24 10:13:52 CDT 2013


On Tue, Sep 24, 2013 at 8:11 AM, Karl Rupp <rupp at mcs.anl.gov> wrote:

> Hi Matt,
>
>
>  Here I believe strongly that we need tests. Nathan assured me that
>> nothing is faster on the GPU than sort+reduce-by-key since
>> they are highly optimized. I think they will be hard to beat, and the
>> initial timings I had say that this is the case. I am willing to be
>> wrong, but I am not willing to overengineer based on supposition.
>>
>
> Fair enough. Is a brute-force implementation for P1 elements sufficient as
> a baseline for discussion?
>

src//ksp/ksp/examples/tutorials/ex4.c

   Matt


> Best regards,
> Karli
>



-- 
What most experimenters take for granted before they begin their
experiments is infinitely more interesting than any results to which their
experiments lead.
-- Norbert Wiener
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-dev/attachments/20130924/cf4677fa/attachment.html>


More information about the petsc-dev mailing list