Where, What, Why, and Importance: Structured Defect Grounding for Text-to-Image Feedback 事件
PRODUCT_LAUNCH2026-06-05影响: MEDIUM
Where, What, Why, and Importance: Structured Defect Grounding for Text-to-Image Feedback arXiv:2606.06113v1 Announce Type: new Abstract: Despite generating increasingly photorealistic images, text-to-image (T2I) models still exhibit localized, subtle, and structurally complex failures. Diagnosing these failures requires instance-level feedback that answers where a defect occurs, what type it is, why it is defective, and its importance to overall image quality. While recent dense-feedback method