idSCD: Identifying Training Datasets through Semantic Correlation Descriptors 文章

ArXiv CS.AI2026-06-01NEWSen作者: Andrada Gobeaja, Ionut Hodoroaga, Elena Burceanu, Marius Leordeanu

摘要

arXiv:2605.30462v1 Announce Type: cross Abstract: Can a dataset be recognized from the spurious correlations it induces during training? We argue that datasets leave dataset-specific traces in a model's learned semantic correlation structure: incidental regularities that are predictive within a dataset, but not causal for the underlying task, can be internalized during training. We use this insight to study dataset-level membership inference, moving beyond existing methods that rely on behavioral or distributional evidence such as confidence scores, losses, margins, generated samples, or query responses. We introduce a white-box semantic fingerprinting approach based on semantic correlation descriptors (SCDs), which capture the semantic correlation structure learned by a model and make it comparable across dataset mixtures. In a controlled leave-one-dataset-out diagnostic, SCDs recover dataset-specific changes and perfectly separate matching from non-matching dataset pairs.

idSCD: Identifying Training Datasets through Semantic Correlation Descriptors 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术