Translation Analytics for Freelancers II: Benchmarking Local LLMs for Confidential Translation Workflows 文章

ArXiv CS.CL2026-06-01NEWSen作者: Yuri Balashov, Rex VanHorn, Mingxi Xu, Austin Downes

摘要

arXiv:2605.31452v1 Announce Type: new Abstract: Building on our previous work, this paper develops practical, low-barrier methods for freelance translators and smaller language service providers to evaluate translation technologies using rigorous yet accessible analytic methods. Here we address a high-stakes, specialized need: offline translation for confidentiality-sensitive domains in which privacy constraints preclude the use of cloud-based engines and commercial LLMs. We expand the Reeve Foundation Trilingual Corpus (RFTC) used in our previous work into a multilingual corpus (RFMC) by adding sentence-aligned German and Simplified Chinese reference translations. We then benchmark several locally runnable language models (via Ollama) across four language directions on 1000+ sentences selected from this corpus. We use consistent single-prompt calls without fine-tuning or domain adaptation, comparing local LLM outputs against commercial NMTs (DeepL, Baidu), a frontier LLM (GPT-5.

相关公司查看全部 (3)

D
DeepLCOMPANY

相关人物

暂无数据

相关技术

暂无数据