Equilibrated Diffusion: Frequency-aware Textual Embedding for Equilibrated Image Customization 文章

ArXiv CS.CV2026-06-02NEWSen作者: Liyuan Ma, Xueji Fang, Guo-Jun Qi

摘要

arXiv:2606.02129v1 Announce Type: new Abstract: Image customization learns target subjects from reference concept images and generates conditioned images per text prompts, mainly modifying styles or backgrounds. Prevailing methods adopt fine-tuning to pack diverse concept attributes into a unified latent embedding, yet entangled attributes hinder elimination of irrelevant disturbances from style and background. To address this issue, we propose Equilibrated Diffusion, a frequency-driven approach that disentangles tangled concept features for balanced customization and consistent text-visual matching. Unlike conventional methods learning full concepts with shared embeddings and unified tuning, our work utilizes the inherent link between image frequency components and semantics: low frequencies represent subject content and high frequencies correspond to styles. We decompose concepts in frequency space and optimize each embedding independently.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据