A GPGPU compiler for memory optimization and parallelism management 论文
2010引用 275
Parallel Computing and Optimization TechniquesEmbedded Systems Design TechniquesAdvanced Data Storage Technologies
摘要
This paper presents a novel optimizing compiler for general purpose computation on graphics processing units (GPGPU). It addresses two major challenges of developing high performance GPGPU programs: effective utilization of GPU memory hierarchy and judicious management of parallelism.