Head-Pose-Aware Visual Speech Recognition with FiLM Modulation 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

Head-Pose-Aware Visual Speech Recognition with FiLM Modulation arXiv:2606.00751v1 Announce Type: new Abstract: Visual Speech Recognition (VSR) aims to recognize speech from visual cues such as lip movements, but its performance is fundamentally limited by viseme ambiguity and pose-induced variations that introduce geometric distortions and occlusions. Existing approaches mainly rely on linguistic context or implicit invariance, leaving visual representations insufficiently robust under non-fron

Head-Pose-Aware Visual Speech Recognition with FiLM Modulation · 相关报道