Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs 事件

PRODUCT_LAUNCH2026-05-26影响: MEDIUM

Mix-MoE: Improving Multilingual Machine Translation of Large Language Models through Mixed MoEs arXiv:2605.24681v1 Announce Type: new Abstract: Large Language Models (LLMs) have shown great promise in multilingual machine translation (MT), even with limited bilingual supervision. However, fine-tuning LLMs with parallel corpora presents major challenges, namely parameter interference. To address these issues, we propose Mix-MoE, a mixed Mixture-of-Experts framework designed to train LLMs for mul