[petsc-dev] NVIDIA cuTENSOR library for accelerating tensor operations

Jed Brown jed at jedbrown.org
Sun Apr 7 19:56:19 CDT 2019

There is already a library called cuTensor, but it is not this
"pre-release", which I can't find on the internet.


It might be useful for high order elements if they can support fusing
enough kernels, but is probably of limited utility if each contraction
implies a kernel launch.

"Smith, Barry F. via petsc-dev" <petsc-dev at mcs.anl.gov> writes:

>    Is this relevant for anything upcoming in PETSc?
>     Barry
> From: Timothy Costa tcosta at nvidia.com
> Date: April 02, 2019
> Subject: Pre-Release: NVIDIA cuTENSOR library for accelerating tensor operations
> cuTENSOR is a new library containing highly optimized tensor
> primitives for NVIDIA GPUs. It provides a set of simple, flexible APIs
> for elementwise tensor operations and tensor contractions. cuTENSOR's
> expressive API allows for elementwise operation fusion and exposes
> several tensor contraction algorithms. cuTENSOR is now available in an
> apply-for-access pre-release at developer.nvidia.com/cutensor. Apply
> today!

More information about the petsc-dev mailing list