Translation Heads: Disentangling meaning from language in LLM-based machine translation 事件
PRODUCT_LAUNCH2026-06-04影响: MEDIUM
Translation Heads: Disentangling meaning from language in LLM-based machine translation arXiv:2602.04613v2 Announce Type: replace Abstract: Mechanistic Interpretability (MI) seeks to explain how neural networks implement their capabilities, but the scale of Large Language Models (LLMs) has limited prior MI work in Machine Translation (MT) to word-level analyses. We study sentence-level MT from a mechanistic perspective by analyzing attention heads to understand how LLMs internally encode and di