RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

RichControl: Structure- and Appearance-Rich Training-Free Spatial Control for Text-to-Image Generation arXiv:2507.02792v5 Announce Type: replace Abstract: Text-to-image (T2I) diffusion models have shown remarkable success in generating high-quality images from text prompts. Recent efforts extend these models to incorporate conditional images (e.g., canny edge) for fine-grained spatial control. Among them, feature injection methods have emerged as a training-free alternative to traditional fine-