FLIPS: Instance-Fingerprinting for LLMs via Pseudo-random Sequences 文章

ArXiv CS.AI2026-06-03NEWSen作者: Gurvan Richardeau, Gohar Dashyan, Erwan Le Merrer, Gilles Tredan

摘要

arXiv:2606.03330v1 Announce Type: cross Abstract: Literature reveals that a Large Language Model's (LLM) behavior is not only conditioned by its original weights but also its instance-level parameters, such as instructional prompt, sampling configuration or quantization. A model that generates safe outputs under one configuration may produce toxic content under another. However, current LLM identification techniques (such as fingerprinting) focus on intellectual property protection, and their design favors robustness to changes in these instance-level parameters. This poses a critical challenge for AI regulation in which compliance assessments target actual deployed behaviors, not model provenance. In this paper, we introduce instance-level fingerprinting, a regulator-oriented paradigm that distinguishes configurations of the same LLM.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据