An Embarrassingly Simple Detector for Model Extraction Attacks in Large Language Model API Traffic 事件

PRODUCT_LAUNCH2026-06-05影响: MEDIUM

An Embarrassingly Simple Detector for Model Extraction Attacks in Large Language Model API Traffic arXiv:2606.05725v1 Announce Type: cross Abstract: Large language models (LLMs) are increasingly deployed through hosted APIs, making model extraction a practical threat to model ownership and service security. However, individual extraction queries often resemble benign requests, and existing evaluations often focus on single-query anomaly scoring or pure benign-versus-attacker user settings. We f