Gap-K%: Measuring Top-1 Prediction Gap for Detecting Pretraining Data 文章

ArXiv CS.CL2026-06-01NEWSen作者: Minseo Kwak, Jaehyung Kim

摘要

arXiv:2601.19936v2 Announce Type: replace-cross Abstract: The opacity of massive pretraining corpora in Large Language Models (LLMs) raises significant privacy and copyright concerns, making pretraining data detection a critical challenge. Existing state-of-the-art methods typically rely on token likelihoods, yet they often overlook the gap between the target token and the model's top-1 prediction, as well as local correlations between adjacent tokens. In this work, we propose Gap-K%, a novel pretraining data detection method grounded in the optimization dynamics of LLM pretraining. By analyzing the next-token prediction objective, we observe that discrepancies between the model's top-1 prediction and the target token induce strong gradient signals, which are explicitly penalized during training.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据