Towards Anatomically Plausible Human Image Generation via Synthetic Localized Preferences 文章

ArXiv CS.CV2026-05-26NEWSen作者: Bao Li, Yuliang Xiu, Zhen Liu

摘要

arXiv:2605.25759v1 Announce Type: new Abstract: Large-scale text-to-image foundation models have achieved remarkable visual realism, yet generating human images with correct anatomical structures remains challenging. Existing approaches enforce anatomical constraints through part-specific modules or localized loss weighting during supervised fine-tuning on high-quality human photos, but such datasets are limited and often provide ambiguous optimization signals due to confounding factors such as lighting, pose, and background. Preference-based alignment offers an alternative, but standard Direct Preference Optimization (DPO) treats all pixels equally and therefore fails to exploit the localized nature of anatomical artifacts. To address this, we propose the framework of Alignment via Synthetic Anatomical Preference (ASAP), which constructs controlled preference pairs through a localized degradation mechanism applied to high-fidelity human images.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据