Beyond Objects: Contextual Synthetic Data Generation for Fine-Grained Classification 文章

ArXiv CS.CV2026-06-02NEWSen作者: William Yang, Xindi Wu, Zhiwei Deng, Esin Tureci, Olga Russakovsky

摘要

arXiv:2510.24078v2 Announce Type: replace Abstract: Text-to-image (T2I) models are increasingly used for synthetic dataset generation, but generating effective synthetic training data for classification remains challenging. Fine-tuning a T2I model with a few real examples can help improve the quality of synthetic training data; however, it may also cause overfitting and reduce diversity in the generated samples. We propose a fine-tuning strategy BOB (BeyondOBjects) to mitigate these concerns for fine-grained classification. Given a small set of real examples, we first extract class-agnostic attributes such as scene background and object pose. We then explicitly condition on these attributes during fine-tuning of the T2I model and marginalize them out during generation. This design mitigates overfitting, preserves the T2I model's generative prior, reduces estimation errors, and further minimizes unintended inter-class associations.

相关公司

暂无数据

相关人物

暂无数据

相关产品

暂无数据