Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005 论文

2007IEICE Transactions on Information and Systems引用 224

Speech Recognition and SynthesisSpeech and dialogue systemsSpeech and Audio Processing

Speech Recognition and Synthesis Speech and dialogue systems Speech and Audio Processing

作者

摘要

In January 2005, an open evaluation of corpus-based text-to-speech synthesis systems using common speech datasets, named Blizzard Challenge 2005, was conducted. Nitech group participated in this challenge, entering an HMM-based speech synthesis system called Nitech-HTS 2005. This paper describes the technical details, building processes, and performance of our system. We first give an overview of the basic HMM-based speech synthesis system, and then describe new features integrated into Nitech-HTS 2005 such as STRAIGHT-based vocoding, HSMM-based acoustic modeling, and a speech parameter generation algorithm considering GV. Constructed Nitech-HTS 2005 voices can generate speech waveforms at 0.3 ×RT (real-time ratio) on a 1.6 GHz Pentium 4 machine, and footprints of these voices are less than 2 Mbytes. Subjective listening tests showed that the naturalness and intelligibility of the Nitech-HTS 2005 voices were much better than expected.

作者查看全部 (4)

Keiichi Tokuda

Masahide Nakamura

Tomoki Toda

Heiga Zen

Details of the Nitech HMM-Based Speech Synthesis System for the Blizzard Challenge 2005 论文

摘要

作者查看全部 (4)

相关技术查看全部 (2)

相关事件

相关文章