No Safe Dose: How Training Data Drives Unsafe Image Generation 事件

PRODUCT_LAUNCH2026-05-28影响: MEDIUM

No Safe Dose: How Training Data Drives Unsafe Image Generation arXiv:2605.28137v1 Announce Type: new Abstract: Text-to-image models trained on large-scale data often inevitably ingest unsafe content. While some people observe input-output amplifications, it remains unclear whether and how training data composition directly drives model output safety or by other factors. We shed light on this question by isolating this variable: we train the same text-to-image model on datasets that differ \emph