The Architecture of the Festival Speech Synthesis System 论文
摘要
We describe a new formalism for storing linguistic data in a text to speech system. Linguistic entities such as words and phones are stored as feature structures in a general object called an linguistic item. Items are configurable at run time and via the feature structure can contain arbitrary information. Linguistic relations are used to store the relationship between items of the same linguistic type. Relations can take any graph structure but are commonly trees or lists. Utterance structures contain all the items and relations contained in a single utterance. We first describe the design goals when building a synthesis architecture, and then describe some problems with previous architectures. We then discuss our new formalism in general along with the implementation details and consequences of our approach. 1. INTRODUCTION Speech synthesis systems require ways of storing the various types of linguistic information produced in the process of converting the input format (e.g. text...
相关事件
暂无数据
相关文章
暂无数据