Approaches to language identification using Gaussian mixture models and shifted delta cepstral features 论文

2002引用 393

Speech Recognition and SynthesisMusic and Audio ProcessingNatural Language Processing Techniques

Natural Language Processing Techniques Speech Recognition and Synthesis Music and Audio Processing

作者

摘要

Published results indicate that automatic language identification (LID) systems that rely on multiple-language phone recognition and n-gram language modeling produce the best performance in formal LID evaluations. By contrast, Gaussian mixture model (GMM) systems, which measure acoustic characteristics, are far more efficient computationally but have tended to provide inferior levels of performance. This paper describes two GMM-based approaches to language identification that use shifted delta cepstra (SDC) feature vectors to achieve LID performance comparable to that of the best phone-based systems. The approaches include both acoustic scoring and a recently developed GMM tokenization system that is based on a variation of phonetic recognition and language modeling. System performance is evaluated on both the CallFriend and OGI corpora. 1.

作者查看全部 (6)

J.R. Deller

Douglas A. Reynolds

Richard Greene

M.A. Kohler

Approaches to language identification using Gaussian mixture models and shifted delta cepstral features 论文

摘要

作者查看全部 (6)

相关技术查看全部 (3)

相关事件

相关文章