When the Gold Standard Isn't Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content 文章

ArXiv CS.CL2026-06-02NEWSen作者: Lydia Nishimwe, Beno\^it Sagot, Rachel Bawden

详细信息

来源站点: ArXiv CS.CL
作者: Lydia Nishimwe, Beno\^it Sagot, Rachel Bawden
文章类型: NEWS
语言: en
发布日期: 2026-06-02

摘要

arXiv:2512.17738v3 Announce Type: replace Abstract: User-generated content (UGC) is characterised by frequent use of non-standard language, from spelling errors to expressive choices such as slang, character repetitions, and emojis. This makes evaluating UGC translation challenging: what counts as a "good" translation depends on the desired standardness level of the output. To explore this, we examine the human translation guidelines of four UGC datasets, and derive a taxonomy of twelve non-standard phenomena and five translation actions (NORMALISE, COPY, TRANSFER, OMIT, CENSOR). Our analysis reveals notable differences in how UGC is treated, resulting in a spectrum of standardness in reference translations. We show that translation scores of large language models are highly sensitive to prompts with explicit UGC translation instructions, and that they improve when they align with the dataset guidelines.

When the Gold Standard Isn't Necessarily Standard: Challenges of Evaluating the Translation of User-Generated Content 文章

详细信息

摘要

相关事件

相关公司

相关人物

相关产品

相关技术