Text-Only Data Synthesis for Vision Language Model Training 文章

ArXiv CS.CV2026-05-28NEWSen作者: Xiaomin Yu, Wenjie Zhang, Ziyue Qiao, Chengwei Qin, Hui Xiong

Text-Only Data Synthesis for Vision Language Model Training · 相关技术