A Discriminative Model for Polyphonic Piano Transcription 论文

2006EURASIP Journal on Advances in Signal Processing引用 263顶会
Music and Audio ProcessingMusic Technology and Sound StudiesSpeech and Audio Processing

摘要

We present a discriminative model for polyphonic piano transcription. Support vector machines trained on spectral features are used to classify frame-level note instances. The classifier outputs are temporally constrained via hidden Markov models, and the proposed system is used to transcribe both synthesized and real piano recordings. A frame-level transcription accuracy of 68% was achieved on a newly generated test set, and direct comparisons to previous approaches are provided.