dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching 事件

Name: dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching
Start: 2026-06-03

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching arXiv:2506.06295v2 Announce Type: replace-cross Abstract: Autoregressive Models (ARMs) have long dominated the landscape of Large Language Models. Recently, a new paradigm has emerged in the form of diffusion-based Large Language Models (dLLMs), which generate text by iteratively denoising masked segments. This approach has shown significant advantages and potential. However, dLLMs suffer from high inference latency.

人工智能

关系图谱

dLLM-Cache: Accelerating Diffusion Large Language Models with Adaptive Caching 事件

相关公司查看全部 (10)

相关人物查看全部 (2)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)