Dual-Anchoring: Addressing State Drift in Vision-Language Navigation 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Dual-Anchoring: Addressing State Drift in Vision-Language Navigation arXiv:2604.17473v3 Announce Type: replace Abstract: Vision-Language Navigation(VLN) requires an agent to navigate through 3D environments by following natural language instructions. While recent Video Large Language Models(Video-LLMs) have largely advanced VLN, they remain highly susceptible to State Drift in long scenarios. In these cases, the agent's internal state drifts away from the true task execution state, leading to a

Dual-Anchoring: Addressing State Drift in Vision-Language Navigation · 相关产品