Testing the Deliteralization Hypothesis in Human and Machine Translation 文章

ArXiv CS.CL2026-05-26NEWSen作者: Malik Marmonier, Rachel Bawden, Beno\^it Sagot

摘要

arXiv:2605.25686v1 Announce Type: new Abstract: The recent shift from dedicated NMT systems to general-purpose LLMs has reshaped machine translation, with LLMs reported to produce more fluent, less literal output than their predecessors. We test whether this shift extends to the deliteralization hypothesis, the long-standing claim from translation studies that translations become progressively less literal as they are drafted and revised. Using the WMT24++ dataset, we compare the literality of human translations and post-editions to that of two NMT systems and six LLMs across 54 language pairs and three tasks: direct translation, iterative self-revision, and post-editing of human drafts. Literality is measured via a validated Synthetic Literality Index built from six heuristics. We find that (i) human translations remain significantly less literal than those of all tested MT systems, though recent LLMs narrow the gap;

相关公司

暂无数据

相关人物

暂无数据