Encoder-Free Human Motion Understanding via Structured Motion Descriptions 文章

ArXiv CS.CV2026-05-28NEWSen作者: Yao Zhang, Zhuchenyang Liu, Thomas Ploetz, Yu Xiao

摘要

arXiv:2604.21668v2 Announce Type: replace Abstract: The world knowledge and reasoning capabilities of text-based large language models (LLMs) are advancing rapidly, yet current approaches to human motion understanding, including motion question answering and captioning, have not fully exploited these capabilities. Existing LLM-based methods typically learn motion-language alignment through dedicated encoders that project motion features into the LLM's embedding space, remaining constrained by cross-modal representation and alignment. Inspired by biomechanical analysis, where joint angles and body-part kinematics have long served as a precise descriptive language for human movement, we propose \textbf{Structured Motion Description (SMD)}, a rule-based, deterministic approach that converts joint position sequences into structured natural language descriptions of joint angles, body part movements, and global trajectory.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据