[petsc-users] GPU speedup in Poisson solvers
Chung Shen
chung1shen at yahoo.com
Mon Sep 22 13:57:07 CDT 2014
Dear PETSc Users,
I am new to PETSc and trying to determine if GPU speedup is possible with the 3D Poisson solvers. I configured 2 copies of 'petsc-master' on a standalone machine, one with CUDA toolkit 5.0 and one without (both without MPI):
Machine: HP Z820 Workstation, Redhat Enterprise Linux 5.0
CPU: (x2) 8-core Xeon E5-2650 2.0GHz, 128GB Memory
GPU: (x2) Tesla K20c (706MHz, 5.12GB Memory, Cuda Compatibility: 3.5, Driver: 313.09)
I used 'src/ksp/ksp/examples/tests/ex32.c' as a test and was getting about 20% speedup with GPU. Is this reasonable or did I miss something?
Attached is a comparison chart with two sample logs. The y-axis is the elapsed time in seconds and the x-axis corresponds to the size of the problem. In particular, I wonder if the numbers of calls to 'VecCUSPCopyTo' and 'VecCUSPCopyFrom' shown in the GPU log are excessive?
Thanks in advance for your reply.
Best Regards,
Chung Shen
-------------- next part --------------
A non-text attachment was scrubbed...
Name: chart.jpg
Type: image/pjpeg
Size: 122316 bytes
Desc: not available
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20140922/26a53312/attachment-0001.bin>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: ex32-m150-cpu.log
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20140922/26a53312/attachment-0002.ksh>
-------------- next part --------------
An embedded and charset-unspecified text was scrubbed...
Name: ex32-m150-gpu.log
URL: <http://lists.mcs.anl.gov/pipermail/petsc-users/attachments/20140922/26a53312/attachment-0003.ksh>
More information about the petsc-users
mailing list