Computing Mel-frequency cepstral coefficients on the power spectrum 论文

2002引用 237

Speech Recognition and SynthesisMusic and Audio ProcessingTime Series Analysis and Forecasting

Time Series Analysis and Forecasting Speech Recognition and Synthesis Music and Audio Processing

作者

摘要

We present a method to derive Mel-frequency cepstral coefficients directly from the power spectrum of a speech signal. We show that omitting the filterbank in signal analysis does not affect the word error rate. The presented approach simplifies the speech recognizers front end by merging subsequent signal analysis steps into a single one. It avoids possible interpolation and discretization problems and results in a compact implementation. We show that frequency warping schemes like vocal tract normalization can be integrated easily in our concept without additional computational efforts. Recognition test results obtained with the RWTH large vocabulary speech recognition system are presented for two different corpora: The German VerbMobil II dev99 corpus, and the English North American Business News 94 20k development corpus.

作者查看全部 (4)

Hermann Ney

Ralf Schlüter

Michael Pitz

Sirko Molau

Computing Mel-frequency cepstral coefficients on the power spectrum 论文

摘要

作者查看全部 (4)

相关技术查看全部 (2)

相关事件

相关文章