Translation Heads: Disentangling meaning from language in LLM-based machine translation 事件

PRODUCT_LAUNCH2026-06-04影响: MEDIUM

Translation Heads: Disentangling meaning from language in LLM-based machine translation arXiv:2602.04613v2 Announce Type: replace Abstract: Mechanistic Interpretability (MI) seeks to explain how neural networks implement their capabilities, but the scale of Large Language Models (LLMs) has limited prior MI work in Machine Translation (MT) to word-level analyses. We study sentence-level MT from a mechanistic perspective by analyzing attention heads to understand how LLMs internally encode and di