Comparative Evaluation of Various MFCC Implementations on the Speaker Verification Task 论文

2007引用 306
Speech Recognition and SynthesisSpeech and Audio ProcessingMusic and Audio Processing

摘要

Making no claim of being exhaustive, a review of the most popular MFCC (Mel Frequency Cepstral Coefficients) implementations is made. These differ mainly in the particular approximation of the nonlinear pitch perception of human, the filter bank design, and the compression of the filter bank output. Then, a comparative evaluation of the presented implementations is performed on the task of text-independent speaker verification, by means of the well-known 2001 NIST SRE (speaker recognition evaluation) one-speaker detection database. 1.