From Outliers to Errors: Auditing Pali-to-English LLM Translations with Multi-Reference Adjudication 事件
PRODUCT_LAUNCH2026-06-02影响: MEDIUM
From Outliers to Errors: Auditing Pali-to-English LLM Translations with Multi-Reference Adjudication arXiv:2606.01136v1 Announce Type: new Abstract: Single-score translation metrics can conflate legitimate variation with error, a problem especially acute for classical languages where multiple defensible English renderings of the same passage coexist. We audit Pali-to-English output from four flagship large language models (LLMs): GPT-5.5, Claude Sonnet 4.6, Gemini 3.1 Pro, and Grok 4.3, on 1,70