MAGIC: Multimodal Alignment & Grounding-aware Instruction Coreset for Vision-Language Models 文章

ArXiv CS.CV2026-05-26NEWSen作者: Shristi Das Biswas, Kaushik Roy

MAGIC: Multimodal Alignment & Grounding-aware Instruction Coreset for Vision-Language Models · 相关技术

相关技术