[petsc-dev] [GPU] Performance on Fermi
Barry Smith
bsmith at mcs.anl.gov
Fri Aug 27 14:14:40 CDT 2010
PETSc-dev folks,
Please prepend all messages to petsc-dev that involve GPUs with [GPU] so they can be easily filtered.
Keita,
To run src/ksp/ksp/examples/tutorials/ex2.c with CUDA you need the flag -vec_type cuda
Note also that this example is fine for simple ONE processor tests but should not be used for parallel testing because it does not do a proper parallel partitioning for performance
Barry
On Aug 27, 2010, at 2:04 PM, Keita Teranishi wrote:
> Hi,
>
> I ran ex2.c with a matrix from 512x512 grid.
> I set CG and Jacobi for the solver and preconditioner.
> GCC-4.4.4 and CUDA-3.1 are used to compile the code.
> BLAS and LAPAKCK are not optimized.
>
> MatMult
> Fermi: 1142 MFlops
> 1 core Istanbul: 420 MFlops
>
> KSPSolve:
> Fermi: 1.5 Sec
> 1 core Istanbul: 1.7 Sec
>
>
> ================================
> Keita Teranishi
> Scientific Library Group
> Cray, Inc.
> keita at cray.com
> ================================
>
>
> -----Original Message-----
> From: petsc-dev-bounces at mcs.anl.gov [mailto:petsc-dev-bounces at mcs.anl.gov] On Behalf Of Satish Balay
> Sent: Friday, August 27, 2010 1:49 PM
> To: For users of the development version of PETSc
> Subject: Re: [petsc-dev] Problem with petsc-dev
>
> On Fri, 27 Aug 2010, Satish Balay wrote:
>
>> There was a problem with tarball creation for the past few days. Will
>> try to respin manually today - and update you.
>
> the petsc-dev tarball is now updated on the website..
>
> Satish
More information about the petsc-dev
mailing list