You Don't Need All That Attention: Surgical Memorization Mitigation in Text-to-Image Diffusion Models 文章

ArXiv CS.CV2026-06-02NEWSen作者: Kairan Zhao, Eleni Triantafillou, Peter Triantafillou

摘要

arXiv:2603.00133v2 Announce Type: replace Abstract: Generative models have been shown to "memorize" certain training data, leading to verbatim or near-verbatim generating images, which may cause privacy concerns or copyright infringement. We introduce Guidance Using Attractive-Repulsive Dynamics (GUARD), a novel framework for memorization mitigation in text-to-image diffusion models. GUARD adjusts the image denoising process to guide the generation away from an original training image and towards one that is distinct from training data while remaining aligned with the prompt, guarding against reproducing training data, without hurting image generation quality. We propose a concrete instantiation of this framework, where the positive target that we steer towards is given by a novel method for (cross) attention attenuation based on (i) a novel statistical mechanism that automatically identifies the prompt positions where cross attention must be attenuated and (ii) attenuating…

摘要可能不完整,可查看原文

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据