Chatterbox-Flash: Prior-Calibrated Block Diffusion for Streaming Zero-Shot TTS 事件
PRODUCT_LAUNCH2026-06-01影响: MEDIUM
Chatterbox-Flash: Prior-Calibrated Block Diffusion for Streaming Zero-Shot TTS arXiv:2605.30748v1 Announce Type: cross Abstract: We present Chatterbox-Flash, a zero-shot text-to-speech model obtained by fine-tuning a pretrained autoregressive TTS decoder into a block-diffusion decoder, enabling parallel token generation within each block while retaining block-by-block streaming. We find that naively transferring mainstream block-diffusion decoding to discrete speech tokens degrades quality, as
相关人物
暂无数据
相关产品查看全部 (10)
相关报道查看全部 (1)
Chatterbox-Flash: Prior-Calibrated Block Diffusion for Streaming Zero-Shot TTS
ArXiv CS.AI2026-06-01