Spiking the training data to correct for test set contamination 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Spiking the training data to correct for test set contamination arXiv:2605.24818v1 Announce Type: cross Abstract: The literature on test set contamination largely focuses on detection, but the correction of contaminated test scores is underexplored. Our core proposal is to spike the training data by intentionally contaminating some test examples at known rates. The spiked examples can then be used to calibrate predictors of model memorization which enable principled statistical correction of in

Spiking the training data to correct for test set contamination · 相关人物