DRIVE: Modeling Skills at the Reasoning and Interaction Levels for Web Agents under Continual Learning 文章

ArXiv CS.AI2026-05-26NEWSen作者: Xirui Liu, Sihang Zhou, Yanning Hou, Rong Zhou, Haoyuan Chen, Maolin He, Siwei Wang, Hao Chen, Jian Huang

查看原文 →

关系图谱

摘要

arXiv:2605.23939v1 Announce Type: new Abstract: Web agents require both high-level reasoning (for task decomposition) and low-level interactions (for page elements manipulation) to conduct different tasks. However, these knowledge types differ fundamentally: reasoning knowledge (e.g., booking a flight requires first searching for routes) is abstract and transferable across websites, while interaction knowledge (e.g., clicking the Search button at a specific coordinate on Site A) depends heavily on page-specific contexts. Existing methods store experiences uniformly. This creates a dilemma: abstract representations lose executability on concrete pages, while concrete representations fail to generalize across domains. This entanglement limits capability accumulation: on new websites, agents either fail to recognize reusable task logic due to surface-level differences or attempt infeasible actions from outdated page structures.

DRIVE: Modeling Skills at the Reasoning and Interaction Levels for Web Agents under Continual Learning 文章

摘要

相关事件查看全部 (2)

相关公司查看全部 (4)

相关人物

相关产品查看全部 (14)

相关技术查看全部 (19)