Brick-Composer: Using MLLMs for Assembly with Diverse Bricks 事件

BREAKTHROUGH2026-06-06影响: HIGH

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks arXiv:2606.05445v1 Announce Type: new Abstract: We dream of AI agents that can read arbitrary designs and construct real-world objects from reusable building blocks. As a first step toward this vision, we study whether multimodal large language models (MLLMs) possess the visual grounding and spatial reasoning capabilities required for brick assembly. We formulate brick assembly as a sequential decision-making problem, where each step

Brick-Composer: Using MLLMs for Assembly with Diverse Bricks · 相关技术