Guess the Unified Model: How Much Can We Recover from Generated Images? 文章

ArXiv CS.CV2026-05-26NEWSen作者: Jasin Cekinmez, Ryo Mitsuhashi, Addison J. Wu, Yida Yin

摘要

arXiv:2605.25254v1 Announce Type: new Abstract: With unified model-generated images now widespread online, attributing their model of origin offers a path toward transparency and deeper insight into the characteristic behaviors of individual models. Prior work has explored provenance in LLM-generated text, diffusion model images, and datasets, but the separability of unified model-generated images remains an underexplored area. We address this gap by examining separability across corruption, domains, and prompt languages using images generated by seven unified models. We show that model attribution is highly feasible as our model achieves near-perfect accuracy with around 20K images per model. Corruptions and structural perturbations have only a modest effect on attribution performance, and cross-domain generalization reveals that semantic content contributes to separability but is not the dominant signal.