Activation-Based Active Learning for In-Context Learning: Challenges and Insights 文章

ArXiv CS.CL2026-06-04NEWSen作者: Yaseen M. Osman, Geoff V. Merrett, Stuart E. Middleton

摘要

arXiv:2606.05134v1 Announce Type: new Abstract: Deep active learning has previously been explored for LLM in-context sample selection, but not with methods that utilise recent advances in understanding of transformer activations. In this paper, we test the hypothesis that model activations could provide a fine-grained signal to optimise the selection of in-context examples. We present the most comprehensive analysis to date of MLP activation-based deep active learning methods applied to in-context learning, including how different attention masking strategies impact active learning across diverse classification and generative datasets, using both Llama-3.2-3B and Qwen2.5-3B base models. However, we find a negative result: MLP outputs, viewed through the lenses of massive activations or the first four moments, do not correlate with example quality or task performance. Specifically, the absolute Spearman correlation coefficient is at most 0.

Activation-Based Active Learning for In-Context Learning: Challenges and Insights 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (4)

相关技术查看全部 (3)