Efficient Benchmarking Is Just Feature Selection and Multiple Regression 文章

ArXiv CS.CL2026-06-01NEWSen作者: Sam Bowyer, Acyr Locatelli, Kris Cao

摘要

arXiv:2605.25773v2 Announce Type: replace-cross Abstract: Efficient benchmarking techniques aim to lower the computational cost of evaluating LLMs by predicting full benchmark scores using only a subset of a benchmark's questions. By reframing this problem as an instance of multiple regression with feature selection, we find that existing efficient benchmarking methods can be greatly improved by simply using kernel ridge regression at the prediction stage. Additionally, using an information-theoretic feature-selection algorithm called minimum redundancy maximum relevance (mRMR), we can further improve upon these methods by selecting question subsets that will be maximally useful for prediction.