SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search 事件

REGULATION2026-05-29影响: MEDIUM

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search arXiv:2605.29796v1 Announce Type: cross Abstract: Agentic search enables LLMs to solve complex multi-hop questions through iterative reasoning and external search. Despite the effectiveness, these systems often suffer from a critical limitation in practice: agents fail to recognize their own knowledge boundaries, blindly triggering searches when internal knowledge suffices and failing to terminate search even w

SAAS: Self-Aware Reinforcement Learning for Over-Search Mitigation in Agentic Search · 相关公司

A
arXivNONPROFIT
A
ARENEGOVERNMENT
F
FrameworkCOMPANY
E
EARNNONPROFIT
A
AnisNONPROFIT
I
IterRESEARCH_INSTITUTE
A
ACTNONPROFIT
S
SearchNONPROFIT
I
iterativeCOMPANY