Head-Pose-Aware Visual Speech Recognition with FiLM Modulation 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
Head-Pose-Aware Visual Speech Recognition with FiLM Modulation arXiv:2606.00751v1 Announce Type: new Abstract: Visual Speech Recognition (VSR) aims to recognize speech from visual cues such as lip movements, but its performance is fundamentally limited by viseme ambiguity and pose-induced variations that introduce geometric distortions and occlusions. Existing approaches mainly rely on linguistic context or implicit invariance, leaving visual representations insufficiently robust under non-fron
Head-Pose-Aware Visual Speech Recognition with FiLM Modulation · 相关报道
相关报道
Head-Pose-Aware Visual Speech Recognition with FiLM Modulation
ArXiv CS.CV2026-06-02