IFMTBench: A Comprehensive Benchmark for Multilingual Translation Instruction Following 文章

ArXiv CS.CL2026-05-28NEWSen作者: Mingrui Sun, Mao Zheng, Zheng Li, Mingyang Song

摘要

arXiv:2605.28218v1 Announce Type: new Abstract: Modern translation workflows demand more than semantic equivalence. Users routinely require models to preserve JSON or HTML schemas, honor curated glossaries, disambiguate with provided context, and match prescribed registers, often several at once. Conventional metrics such as BLEU and xCOMET capture semantic fidelity but provide little signal on constraint adherence, while general instruction following benchmarks ignore the cross-lingual nature of translation. We introduce \bench, a benchmark for multilingual translation instruction following covering seven languages, with 4,506 single-constraint and 2,838 multi-constraint items spanning six constraint dimensions and five compositional patterns with instructions issued in all seven languages.

IFMTBench: A Comprehensive Benchmark for Multilingual Translation Instruction Following 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品查看全部 (1)

相关技术查看全部 (4)