"Be My Cheese?": Cultural Nuance Benchmarking for Machine Translation in Multilingual LLMs 事件
PRODUCT_LAUNCH2026-05-29影响: MEDIUM
"Be My Cheese?": Cultural Nuance Benchmarking for Machine Translation in Multilingual LLMs arXiv:2602.04729v2 Announce Type: replace Abstract: We present a large-scale human evaluation benchmark for assessing cultural localisation in machine translation produced by state-of-the-art multilingual large language models (LLMs). Existing MT benchmarks emphasise token-level and grammatical accuracy, but often overlook the pragmatic and culturally grounded competencies required for real-world localisa