Referential Security as a New Paradigm for AI Evaluations 文章

ArXiv CS.AI2026-05-26NEWSen作者: Dan Ristea, Vasilios Mavroudis

摘要

arXiv:2605.25673v1 Announce Type: cross Abstract: Security evaluations inherently depend on stable identifiers. Any finding, audit, or regulatory decision must remain attached to the specific artifact it pertains to. Continuously updated artificial intelligence systems violate this core assumption, with public model designations remaining static while underlying weights, prompts, retrieval mechanisms, misuse classifiers, inference settings, and serving infrastructures undergo unannounced modifications. Consequently, current evaluations frequently apply to superficial labels rather than identifiable and distinct systems. To resolve this, we propose referential security as a new paradigm for AI evaluation. The fundamental security question extends beyond whether a model is safe to whether subsequent parties can conclusively determine which system a specific safety claim addressed.

Referential Security as a New Paradigm for AI Evaluations 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术