AttenA+: Rectifying Action Inequality in Robotic Foundation Models 文章

ArXiv CS.AI2026-05-29NEWSen作者: Daojie Peng, Fulong Ma, Jiahang Cao, Qiang Zhang, Xupeng Xie, Jian Guo, Ping Luo, Andrew F. Luo, Boyu Zhou, Jun Ma

查看原文 →

关系图谱

详细信息

来源站点: ArXiv CS.AI
作者: Daojie Peng, Fulong Ma, Jiahang Cao, Qiang Zhang, Xupeng Xie, Jian Guo, Ping Luo, Andrew F. Luo, Boyu Zhou, Jun Ma
文章类型: NEWS
语言: en
发布日期: 2026-05-29

原文

摘要

arXiv:2605.13548v2 Announce Type: replace-cross Abstract: Existing robotic foundation models, while powerful, are predicated on an implicit assumption of temporal homogeneity: treating all actions as equally informative during optimization. This "flat" training paradigm, inherited from language modeling, remains indifferent to the underlying physical hierarchy of manipulation. In reality, robot trajectories are fundamentally heterogeneous, where low-velocity segments often dictate task success through precision-demanding interactions, while high-velocity motions serve as error-tolerant transitions. Such a misalignment between uniform loss weighting and physical criticality fundamentally limits the performance of current Vision-Language-Action (VLA) models and World-Action Models (WAM) in complex, long-horizon tasks. To rectify this, we introduce AttenA+, an architecture-agnostic framework that prioritizes kinematically critical segments via velocity-driven action attention.

AttenA+: Rectifying Action Inequality in Robotic Foundation Models 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品查看全部 (5)

相关技术查看全部 (6)