The Microsoft 2017 Conversational Speech Recognition System 论文

2018引用 479

Speech Recognition and SynthesisSpeech and Audio ProcessingNatural Language Processing Techniques

Natural Language Processing Techniques Speech Recognition and Synthesis Speech and Audio Processing

作者

摘要

We describe the latest version of Microsoft's conversational speech recognition system for the Switchboard and CallHome domains. The system adds a CNN-BLSTM acoustic model to the set of model architectures we combined previously, and includes character-based and dialog session aware LSTM language models in rescoring. For system combination we adopt a two-stage approach, whereby acoustic model posteriors are first combined at the senone/frame level, followed by a word-level voting via confusion networks. We also added another language model rescoring step following the confusion network combination. The resulting system yields a 5.1% word error rate on the NIST 2000 Switchboard test set, and 9.8% on the CallHome subset.

作者查看全部 (6)

Andreas Stolcke

Xuedong Huang

Jasha Droppo

F. Alleva

The Microsoft 2017 Conversational Speech Recognition System 论文

摘要

作者查看全部 (6)

相关技术查看全部 (3)

相关事件

相关文章