Framewise phoneme classification with bidirectional lstm and other neural network architectures 论文

2005引用 354

Speech Recognition and SynthesisMusic and Audio ProcessingTopic Modeling

Topic Modeling Speech Recognition and Synthesis Music and Audio Processing

作者

摘要

Abstract — In this paper, we apply bidirectional training to a Long Short Term Memory (LSTM) network for the first time. We also present a modified, full gradient version of the LSTM learning algorithm. On the TIMIT speech database, we measure the framewise phoneme classification ability of bidirectional and unidirectional variants of both LSTM and conventional Recurrent Neural Networks (RNNs). We find that the LSTM architecture outperforms conventional RNNs and that bidirectional networks outperform unidirectional ones. I.

作者查看全部 (2)

Jürgen Schmidhuber

Alex Graves

Framewise phoneme classification with bidirectional lstm and other neural network architectures 论文

摘要

作者查看全部 (2)

相关技术查看全部 (3)

相关事件

相关文章