MASER: Modality-Adaptive Specialist Routing for Embodied 3D Spatial Intelligence 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

MASER: Modality-Adaptive Specialist Routing for Embodied 3D Spatial Intelligence arXiv:2606.02463v1 Announce Type: new Abstract: In 3D environments, Embodied Agents answer spatially relevant questions through reasoning from a mixture of modalities including natural language, RGB images, point clouds, depth maps and camera poses. Existing Vision-Language models (VLMs) are fine-tuned over a single modality. This completely ignores the question semantics which may favor a different modality than t

MASER: Modality-Adaptive Specialist Routing for Embodied 3D Spatial Intelligence · 相关人物

暂无数据