EMMA: Extracting Multiple physical parameters from Multimodal Data 文章

ArXiv CS.CV2026-05-26NEWSen作者: Farhat Shaikh, Ayan Banerjee, Sandeep Gupta

摘要

arXiv:2605.24047v1 Announce Type: new Abstract: We introduce EMMA, a physics-informed multimodal framework that recovers all identifiable dynamical parameters of a system directly from raw video, audio, and image-based time-series observations. Unlike prior video-only approaches that struggle with occluded states, hidden actuation inputs, or assumptions about known initial conditions and coordinate frames, EMMA performs joint inference of explicit parameters, implicit dynamical components, and calibration invariants within a unified continuous-time model. EMMA leverages a Liquid Time-Constant (LTC) network to learn latent dynamics from heterogeneous modalities while a physics-constrained loss enforces consistency with the governing differential equations.

EMMA: Extracting Multiple physical parameters from Multimodal Data 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (1)

相关技术查看全部 (1)