Uni-LaViRA: Language-Vision-Robot Actions Translation for Unified Embodied Navigation 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
Uni-LaViRA: Language-Vision-Robot Actions Translation for Unified Embodied Navigation arXiv:2605.27582v1 Announce Type: cross Abstract: Embodied navigation requires an agent to map language and visual observations to a stream of spatial actions that drive a real robot through environments it has never seen. The dominant approach has been to scale vision-language-action (VLA) foundation models on ever-larger collections of robot trajectories. This paper argues that, for navigation specifically,