UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception 事件

PRODUCT_LAUNCH2026-06-01影响: MEDIUM

UniAudio-Token: Empowering Semantic Speech Tokenizers with General Audio Perception arXiv:2605.31521v1 Announce Type: new Abstract: Semantic speech tokenizers have become a widely used interface for Audio-LLMs, owing to their compact single-codebook design and strong linguistic alignment. However, their focus on linguistic abstraction induces acoustic blindness, limiting their applicability beyond speech-centric tasks. We propose UniAudio-Token, a framework that empowers semantic tokenizers wit