Recent Development of Open-Source Speech Recognition Engine Julius 论文

2009Hokkaido University Collection of Scholarly and Academic Papers (Hokkaido University)引用 277
Speech Recognition and SynthesisSpeech and dialogue systemsNatural Language Processing Techniques

摘要

Abstract—Julius is an open-source large-vocabulary speech recognition software used for both academic research and in-dustrial applications. It executes real-time speech recognition of a 60k-word dictation task on low-spec PCs with small footprint, and even on embedded devices. Julius supports standard lan-guage models such as statistical N-gram model and rule-based grammars, as well as Hidden Markov Model (HMM) as an acoustic model. One can build a speech recognition system of his own purpose, or can integrate the speech recognition capability to a variety of applications using Julius. This article describes an overview of Julius, major features and specifications, and summarizes the developments conducted in the recent years. I.