VIDA: A dataset for Visually Dependent Ambiguity in Multimodal Machine Translation 文章

ArXiv CS.CL2026-05-27NEWSen作者: Jingheng Pan, Xintong Wang, Longyue Wang, Liang Ding, Weihua Luo, Chris Biemann

VIDA: A dataset for Visually Dependent Ambiguity in Multimodal Machine Translation · 相关技术