Representation-Aware Unlearning via Activation Signatures: From Suppression to Entity-Signature Erasure 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

Representation-Aware Unlearning via Activation Signatures: From Suppression to Entity-Signature Erasure arXiv:2601.10566v5 Announce Type: replace Abstract: Entity-level unlearning is usually evaluated by what a model says: whether it stops naming the target, refuses a query, or shifts a Truth Ratio distribution. These output-level tests, however, do not show whether a subject's internal representation has been attenuated. We introduce the Entity Representation Unlearning Framework (ERUF), a rep