DRTriton: Large-Scale Synthetic Data Driven Reinforcement Learning for Triton Kernel Generation 文章

ArXiv CS.CL2026-05-28NEWSen作者: Siqi Guo, Ming Lin, Tianbao Yang

摘要

arXiv:2603.21465v2 Announce Type: replace Abstract: Developing efficient CUDA kernels is a fundamental yet challenging task in the generative AI industry. Recent research leverages Large Language Models (LLMs) to automatically convert PyTorch reference implementations to CUDA kernels, significantly reducing engineering effort. State-of-the-art LLMs, such as GPT-5.2 and Claude-Sonnet-4.5, still struggle with this task. To address this challenge, we propose DRTriton, a scalable learning framework for training LLMs to convert PyTorch programs into highly optimized Triton kernels, which are then compiled to CUDA kernels at runtime. DRTriton consists of three key components: (i) a data synthetic algorithm CSP-DAG that guarantees full coverage and unbiased uniform sampling over the operator space with controlled difficulty; (ii) a curriculum RL framework with decoupled rewards that jointly optimizes conversion success rate and execution speed;

DRTriton: Large-Scale Synthetic Data Driven Reinforcement Learning for Triton Kernel Generation 文章

摘要

相关事件查看全部 (2)

相关公司

相关人物

相关产品查看全部 (14)

相关技术查看全部 (4)