Multilingual OCR-Aware Fine-Tuning and Prompt-Guided Chain-of-Thought Reasoning for Multimodal Large Language Models 文章

ArXiv CS.CV2026-05-26NEWSen作者: Qinwu Xu, Yifan Jiang, Haoyu Ren

Multilingual OCR-Aware Fine-Tuning and Prompt-Guided Chain-of-Thought Reasoning for Multimodal Large Language Models · 相关技术