[petsc-dev] Kokkos/Crusher perforance

Mark Adams mfadams at lbl.gov
Tue Jan 25 11:53:50 CST 2022


BTW, a -device_view would be great.

On Tue, Jan 25, 2022 at 12:30 PM Mark Adams <mfadams at lbl.gov> wrote:

>
>
> On Tue, Jan 25, 2022 at 11:56 AM Jed Brown <jed at jedbrown.org> wrote:
>
>> Barry Smith <bsmith at petsc.dev> writes:
>>
>> >   Thanks Mark, far more interesting. I've improved the formatting to
>> make it easier to read (and fixed width font for email reading)
>> >
>> >   * Can you do same run with say 10 iterations of Jacobi PC?
>> >
>> >   * PCApply performance (looks like GAMG) is terrible! Problems too
>> small?
>>
>> This is -pc_type jacobi.
>>
>> >   * VecScatter time is completely dominated by SFPack! Junchao what's
>> up with that? Lots of little kernels in the PCApply? PCJACOBI run will help
>> clarify where that is coming from.
>>
>> It's all in MatMult.
>>
>> I'd like to see a run that doesn't wait for the GPU.
>>
>>
> Not sure what you mean. Can I do that?
>
>
-------------- next part --------------
An HTML attachment was scrubbed...
URL: <http://lists.mcs.anl.gov/pipermail/petsc-dev/attachments/20220125/307fe67f/attachment.html>


More information about the petsc-dev mailing list