Building Community-Centred NLP Resources for Puno Quechua 事件

BREAKTHROUGH2026-05-28影响: HIGH

Building Community-Centred NLP Resources for Puno Quechua arXiv:2605.28253v1 Announce Type: new Abstract: The preservation of under-resourced languages requires digital tools and resources shaped by and for their speakers. We present the first dedicated ASR resources for Puno Quechua (ISO 639-3: qxp): (1) the largest speech corpus for any single Quechua variety, consisting in 66 hours of recordings for scripted and spontaneous speech (including 36 hours of manually transcribed and validated dat