VISTA: Vision-Grounded and Physics-Validated Adaptation of UMI data for VLA Training 事件

PRODUCT_LAUNCH2026-06-04影响: MEDIUM

VISTA: Vision-Grounded and Physics-Validated Adaptation of UMI data for VLA Training arXiv:2606.04708v1 Announce Type: cross Abstract: Universal Manipulation Interface (UMI) enables scalable real-world robot data collection without hardware-specific teleoperation, yet leveraging UMI data to train large-scale Vision-Language-Action (VLA) models remains fundamentally challenging. We identify two critical mismatches: wrist-mounted fisheye views, with severe radial distortion and local gripper-cent

VISTA: Vision-Grounded and Physics-Validated Adaptation of UMI data for VLA Training · 相关报道