Concept Unlearning via Cross-Attention Activation Projection for Diffusion Models 文章

ArXiv CS.CV2026-05-26NEWSen作者: Saemi Moon, Suhyeon Jun, Seoyeon Lee, Dongwoo Kim

摘要

arXiv:2605.25765v1 Announce Type: new Abstract: Concept unlearning aims to erase a target concept from a pretrained text-to-image diffusion model without retraining. Closed-form methods are attractive in this setting because they apply a single deterministic edit to the cross-attention weights and add no inference-time cost. Existing closed-form methods, however, represent the target concept through the text encoder's response to a few short anchor prompts that name it, and paraphrased prompts that evoke the concept without naming it consistently bypass the edit. We argue that the target should instead be represented in the cross-attention activation space. Text embeddings describe the user's prompt, while cross-attention activations describe what the model is about to render, and the latter generalize to paraphrase the anchor templates do not cover.

Concept Unlearning via Cross-Attention Activation Projection for Diffusion Models 文章

摘要

相关事件查看全部 (1)

相关公司查看全部 (2)

相关人物

相关产品查看全部 (10)

相关技术查看全部 (23)