Taming recognition errors with a multimodal interface 论文

2000Communications of the ACM引用 301

Speech and dialogue systemsMulti-Agent Systems and NegotiationTopic Modeling

Topic Modeling Speech and dialogue systems Multi-Agent Systems and Negotiation

摘要

The article focuses on restoring speech-recognition errors by using a multimodal interface. Speech-recognition errors may be results of various factors like diverse speaker styles or speech in noisy field settings. The author argues that multimodal architectures combining speech and pen input can reduce speech recognition errors. The recognition rate degrades whenever a user's speech style departs in some way from the training data on which a recognizer was developed. A different approach to resolving the impasse created by recognition errors is to design a more flexible multimodal interface incorporating speech as one of its input options. One motivation for developing multimodal systems has been their potential for expanding the accessibility of computing to more diverse and non-specialist users while promoting new forms of computing not available in the past. Multimodal systems comprehend to this trend, and permit users to alternate modes and switch between modalities as needed during the changing conditions of mobile use.

作者查看全部 (1)

Sharon Oviatt

Taming recognition errors with a multimodal interface 论文

摘要

作者查看全部 (1)

相关技术查看全部 (1)

相关事件

相关文章