SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search 事件

PRODUCT_LAUNCH2026-05-29影响: MEDIUM

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search arXiv:2605.29796v1 Announce Type: cross Abstract: Agentic search enables LLMs to solve complex multi-hop questions through iterative reasoning and external search. Despite the effectiveness, these systems often suffer from a critical limitation in practice: agents fail to recognize their own knowledge boundaries, blindly triggering searches when internal knowledge suffices and failing to terminate search even w

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search · 相关报道