Dual-Pathway Geometry-Aware MLLM for Spatial Intelligence 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Dual-Pathway Geometry-Aware MLLM for Spatial Intelligence arXiv:2605.25334v1 Announce Type: new Abstract: Spatial understanding of the physical world from 2D visual inputs hinges on two complementary forms of geometric knowledge: holistic 3D structural perception and fine-grained metric scale estimation. Existing multimodal large language models (MLLMs) typically address only one facet, ingesting either depth maps or point clouds as additional model inputs, which incurs substantial computationa