Knowing When to Ask: Segment-Level Credit Assignment for LLM Tool Use 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
Knowing When to Ask: Segment-Level Credit Assignment for LLM Tool Use arXiv:2605.27788v1 Announce Type: cross Abstract: Humans know when to reach for help e.g. $347 \times 28$ warrants a calculator while $2+2$ does not. Language models do not. Prompt-based approaches can instruct a model when to invoke tools, but this scaffolding does not teach it to recognize the boundary of its own knowledge. RL approaches that assign a single outcome reward to the whole trajectory fare no better: trajectory-