MMSkills: Towards Multimodal Skills for General Visual Agents 事件

Name: MMSkills: Towards Multimodal Skills for General Visual Agents
Start: 2026-06-02

SHUTDOWN2026-06-02影响: LOW

MMSkills: Towards Multimodal Skills for General Visual Agents arXiv:2605.13527v3 Announce Type: replace Abstract: Reusable skills have become a core substrate for improving agent capabilities, yet most existing skill packages encode reusable behavior primarily as textual prompts, executable code, or learned routines. For visual agents, however, procedural knowledge is inherently multimodal: reuse depends not only on what operation to perform, but also on recognizing the relevant state, interpre

人工智能

关系图谱

MMSkills: Towards Multimodal Skills for General Visual Agents 事件

相关公司查看全部 (10)

相关人物查看全部 (2)

相关产品查看全部 (10)

相关技术查看全部 (10)

相关报道查看全部 (1)