Unlocking Proactivity in Task-Oriented Dialogue 事件
PRODUCT_LAUNCH2026-06-04影响: MEDIUM
Unlocking Proactivity in Task-Oriented Dialogue arXiv:2605.22240v2 Announce Type: replace Abstract: Proactive task-oriented dialogue (TOD), such as outbound sales, demands a persuasive agent that actively probes the user's concerns and steers the conversation toward acceptance within a bounded number of turns. Yet post-trained LLMs are inherently conservative, and reward-shaping RL (e.g., GRPO) struggles since it only re-weights what an already passive policy samples. We show that conditioning