TROPHIES: Temporal Reconstruction of Places, Humans, and Cameras from Multi-view Videos 文章

ArXiv CS.CV2026-06-02NEWSen作者: Jinpeng Liu, Yukang Xu, Yutong Li, Xingyu Liu

摘要

arXiv:2606.02350v1 Announce Type: new Abstract: Reconstructing humans and their surrounding environments in a globally consistent 4D space is essential for comprehensive perception. However, prior works typically assume single-view inputs or decouple humans, scenes, and cameras, making them unable to recover coherent geometry, stable motion, and physically aligned trajectories. These limitations motivate us to introduce a new task: unified human-scene-camera reconstruction from multi-view videos, which aims to jointly estimate dynamic humans, static scenes, and camera poses in one global coordinate frame. We propose TROPHIES--Temporal Reconstruction of Places, Humans, and Cameras from Multi-view Videos-a unified framework tailored for this task. TROPHIES features a Human Branch that models humans through temporal and spatial reasoning, and a Scene Branch that reconstructs static geometry with human-aware attention.

TROPHIES: Temporal Reconstruction of Places, Humans, and Cameras from Multi-view Videos 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (3)

相关技术