FiLM-Based Speaker Conditioning of a SpeechLLM for Pathological Speech Recognition 事件

PRODUCT_LAUNCH2026-06-05影响: MEDIUM

FiLM-Based Speaker Conditioning of a SpeechLLM for Pathological Speech Recognition arXiv:2606.06211v1 Announce Type: new Abstract: Automatic speech recognition (ASR) has advanced remarkably for standard speech; however, pathological speech from neurological conditions remains a significant challenge. We investigate speaker conditioning via Feature-wise Linear Modulation (FiLM), injecting x-vector-derived information into each transformer layer of a frozen ASR encoder to adapt internal represent