Parallel Computing Experiences with CUDA 论文

2008IEEE Micro引用 537
Parallel Computing and Optimization TechniquesAlgorithms and Data CompressionLow-power high-performance VLSI design

摘要

The CUDA programming model provides a straightforward means of describing inherently parallel computations, and NVIDIA's Tesla GPU architecture delivers high computational throughput on massively parallel problems. This article surveys experiences gained in applying CUDA to a diverse set of problems and the parallel speedups over sequential codes running on traditional CPU architectures attained by executing key computations on the GPU.