Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Do Multimodal Agents Really Benefit from Tool Use? A Systematic Study of Capability Gains arXiv:2606.02357v1 Announce Type: new Abstract: Tool-augmented multimodal agents show strong benchmark gains, often taken as evidence that agents have learned to use tools. We argue that this interpretation can be premature: a tool-call trace alone does not show whether the tool supplied answer-critical information. We study two representative ``thinking with images'' agents, Thyme and DeepEyesV2, across r