SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search 事件

Name: SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search
Start: 2026-05-29

REGULATION2026-05-29影响: MEDIUM

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search arXiv:2605.29796v1 Announce Type: cross Abstract: Agentic search enables LLMs to solve complex multi-hop questions through iterative reasoning and external search. Despite the effectiveness, these systems often suffer from a critical limitation in practice: agents fail to recognize their own knowledge boundaries, blindly triggering searches when internal knowledge suffices and failing to terminate search even w

人工智能

关系图谱

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search · 相关公司

arXivNONPROFIT

ARENEGOVERNMENT

FrameworkCOMPANY

EARNNONPROFIT

AnisNONPROFIT

IterRESEARCH_INSTITUTE

ACTNONPROFIT

SearchNONPROFIT

UBS

iterativeCOMPANY