Experience-Driven Dynamic Exits for LLMs with Reinforcement Learning 事件

Name: Experience-Driven Dynamic Exits for LLMs with Reinforcement Learning
Start: 2026-06-03

PRODUCT_LAUNCH2026-06-03影响: MEDIUM

Experience-Driven Dynamic Exits for LLMs with Reinforcement Learning arXiv:2606.03113v1 Announce Type: new Abstract: Large Language Models suffer from slow autoregressive inference. While self-speculative decoding accelerates this process, its efficiency is hampered by static configurations like fixed exit layers and speculation lengths. We reframe this optimization as a \textbf{Markov Decision Process} and propose \textbf{LEDE}, a framework that uses offline reinforcement learning. LEDE learns

人工智能

关系图谱

Experience-Driven Dynamic Exits for LLMs with Reinforcement Learning 事件

Experience-Driven Dynamic Exits for LLMs with Reinforcement Learning · 相关报道

相关报道