Multi-task Learning is Not Enough: Representational Entanglement in Dual-output Second Language Speech Recognition 文章

ArXiv CS.CL2026-06-05NEWSen作者: Seung Hwan Cho, Young-Min Kim

摘要

arXiv:2606.06065v1 Announce Type: new Abstract: Second-language (L2) speech recognition often requires transcriptions of pronunciations and intended meanings. Multi-task learning (MTL) is a natural approach because it assumes that shared representations benefit both outputs. However, this paper shows that this assumption does not hold across Korean and English. MTL improves meaning but degrades surface transcription, especially in English, where the degradation scales with surface-meaning divergence measured by Levenshtein edit distance.Encoder analysis links these patterns to encoder-level entanglement, with Korean preserving distinct task representations while English produces nearly identical ones. Cross-task decoder analysis shows that the meaning dual-output decoder adapts with a unique representation, while the surface dual-output decoder remains constrained by the encoder.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据