Scaling few-shot spoken word classification with generative meta-continual learning 文章

ArXiv CS.CL2026-06-05NEWSen作者: Louise Beyers, Batsirayi Mupamhi Ziki, Ruan van der Merwe

摘要

arXiv:2605.13075v3 Announce Type: replace Abstract: Few-shot spoken word classification has largely been developed for applications where a small number of classes is considered, and so the potential of larger-scale few-shot spoken word classification remains untapped. This paper investigates the potential of a spoken word classifier to sequentially learn to distinguish between 1000 classes when it is given only five shots per class. We demonstrate that this scaling capability exists by training a model using the Generative Meta-Continual Learning (GeMCL) algorithm and comparing it to repeatedly trained or finetuned baselines. We find that GeMCL produces exceptionally stable performance, and although it does not always outperform a repeatedly fully-finetuned HuBERT model nor a frozen HuBERT model with a repeatedly trained classifier head, it produces comparable performance to the latter while adapting 2000 times faster, having been trained less than half of the data for two orders of…

摘要可能不完整，可查看原文

Scaling few-shot spoken word classification with generative meta-continual learning 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (3)

相关技术查看全部 (2)