UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception 事件
PRODUCT_LAUNCH2026-06-01影响: MEDIUM
UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception arXiv:2605.31521v1 Announce Type: new Abstract: Semantic speech tokenizers have become a widely used interface for Audio-LLMs, owing to their compact single-codebook design and strong linguistic alignment. However, their focus on linguistic abstraction induces acoustic blindness, limiting their applicability beyond speech-centric tasks. We propose UniAudio-Token, a framework that empowers semantic tokenizers wit
相关产品查看全部 (10)
相关报道查看全部 (1)
UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception
ArXiv CS.CL2026-06-01