EMMA: Extracting Multiple physical parameters from Multimodal Data 文章

ArXiv CS.CV2026-05-26NEWSen作者: Farhat Shaikh, Ayan Banerjee, Sandeep Gupta

摘要

arXiv:2605.24047v1 Announce Type: new Abstract: We introduce EMMA, a physics-informed multimodal framework that recovers all identifiable dynamical parameters of a system directly from raw video, audio, and image-based time-series observations. Unlike prior video-only approaches that struggle with occluded states, hidden actuation inputs, or assumptions about known initial conditions and coordinate frames, EMMA performs joint inference of explicit parameters, implicit dynamical components, and calibration invariants within a unified continuous-time model. EMMA leverages a Liquid Time-Constant (LTC) network to learn latent dynamics from heterogeneous modalities while a physics-constrained loss enforces consistency with the governing differential equations.

相关公司

暂无数据

相关人物

暂无数据