Data-Efficient On-Policy Distillation for Automatic Speech Recognition 事件
PRODUCT_LAUNCH2026-05-28影响: MEDIUM
Data-Efficient On-Policy Distillation for Automatic Speech Recognition arXiv:2605.28139v1 Announce Type: new Abstract: Building competitive automatic speech recognition (ASR) models usually requires large-scale au- dio supervision, which makes reproduction and specialization expensive. We study Ark-ASR, a 0.6B- parameter audio-conditioned language model trained with 100k hours of speech, and examine whether a strong Qwen-ASR teacher can transfer additional recognition capability through on-poli