Evaluating and Preserving Lexical Stress in English-to-Chinese Speech-to-Speech Translation 文章

ArXiv CS.CL2026-06-16NEWSen作者: Yuchen Song, Xi Chen, Mingze Li, Satoshi Nakamura

详细信息

来源站点
ArXiv CS.CL
作者
Yuchen Song, Xi Chen, Mingze Li, Satoshi Nakamura
文章类型
NEWS
语言
en
发布日期
2026-06-16

摘要

arXiv:2606.15266v1 Announce Type: new Abstract: Speech-to-speech translation (S2ST) systems have achieved impressive progress in semantic accuracy and speech naturalness. However, the cross-lingual transfer of lexical stress, a vital cue for emphasis and speaker intent, remains heavily underexplored, compounded by a lack of reliable automatic evaluation metrics for tonal languages like Chinese. We investigate English-to-Chinese S2ST stress transfer by constructing a stress-annotated Chinese dataset and an XLS-R-based Mandarin stress detector. Integrating this with the English EmphAssess system, we propose a novel objective metric for cross-lingual stress evaluation. Furthermore, we fine-tune CosyVoice3 to build a stress-aware S2ST system. Experiments demonstrate that our proposed S2ST architecture significantly outperforms existing systems in stress translation capability while maintaining competitive translation quality.

相关事件

暂无数据

相关公司

暂无数据

相关人物

暂无数据