Phonetic Error Analysis of Raw Waveform Acoustic Models 事件

PRODUCT_LAUNCH2026-06-08影响: MEDIUM

Phonetic Error Analysis of Raw Waveform Acoustic Models arXiv:2606.07030v1 Announce Type: cross Abstract: We analyse error patterns of raw waveform acoustic models on TIMIT phone recognition beyond the overall phone error rate (PER). PER is decomposed across three broad phonetic class (BPC) categorisations, and confusion matrices are constructed from substitution errors. Our models combine parametric (SincNet, Sinc2Net) or non-parametric CNNs with Bidirectional LSTMs, achieving 13.9%/15.3% PER