DeMaVLA: A Vision-Language-Action Foundation Model for Generalizable Deformable Manipulation 事件
ACQUISITION2026-06-01影响: HIGH
DeMaVLA: A Vision-Language-Action Foundation Model for Generalizable Deformable Manipulation arXiv:2605.31286v1 Announce Type: cross Abstract: Real-world household robots require Vision-Language-Action (VLA) foundation models that can acquire reusable manipulation skills across diverse objects, task conditions, and household environments. Deformable-object folding is a representative challenge, requiring robots to handle clothing items from random initial states across varying categories, geome