Strategies for training large scale neural network language models 论文

2011引用 514

Topic ModelingSpeech Recognition and SynthesisNatural Language Processing Techniques

Natural Language Processing Techniques Topic Modeling Speech Recognition and Synthesis

作者

摘要

We describe how to effectively train neural network based language models on large data sets. Fast convergence during training and better overall performance is observed when the training data are sorted by their relevance. We introduce hash-based implementation of a maximum entropy model, that can be trained as a part of the neural network model. This leads to significant reduction of computational complexity. We achieved around 10% relative reduction of word error rate on English Broadcast News speech recognition task, against large 4-gram model trained on 400M tokens.

作者查看全部 (5)

Jaň Černocký

Lukáš Burget

Daniel Povey

Anoop Deoras

Strategies for training large scale neural network language models 论文

摘要

作者查看全部 (5)

相关技术查看全部 (3)

相关事件

相关文章