A mixed excitation LPC vocoder model for low bit rate speech coding 论文

1995IEEE Transactions on Speech and Audio Processing引用 312

Speech and Audio ProcessingSpeech Recognition and SynthesisAdvanced Data Compression Techniques

Speech Recognition and Synthesis Speech and Audio Processing Advanced Data Compression Techniques

作者

摘要

Traditional pitch-excited linear predictive coding (LPC) vocoders use a fully parametric model to efficiently encode the important information in human speech. These vocoders can produce intelligible speech at low data rates (800-2400 b/s), but they often sound synthetic and generate annoying artifacts such as buzzes, thumps, and tonal noises. These problems increase dramatically if acoustic background noise is present at the speech input. This paper presents a new mixed excitation LPC vocoder model that preserves the low bit rate of a fully parametric model but adds more free parameters to the excitation signal so that the synthesizer can mimic more characteristics of natural human speech. The new model also eliminates the traditional requirement for a binary voicing decision so that the vocoder performs well even in the presence of acoustic background noise. A 2400-b/s LPC vocoder based on this model has been developed and implemented in simulations and in a real-time system. Formal subjective testing of this coder confirms that it produces natural sounding speech even in a difficult noise environment. In fact, diagnostic acceptability measure (DAM) test scores show that the performance of the 2400-b/s mixed excitation LPC vocoder is close to that of the government standard 4800-b/s CELP coder.< <ETX xmlns:mml="http://www.w3.org/1998/Math/MathML" xmlns:xlink="http://www.w3.org/1999/xlink">></ETX>

作者查看全部 (2)

T.P. Barnwell

Alan McCree

A mixed excitation LPC vocoder model for low bit rate speech coding 论文

详细信息

摘要

作者查看全部 (2)

相关技术查看全部 (3)

相关事件

相关文章