Learning When to Translate for Multilingual Reasoning 文章

ArXiv CS.CL2026-06-02NEWSen作者: Deokhyung Kang, Hyounghun Kim, Gary Geunbae Lee

摘要

arXiv:2606.02465v1 Announce Type: new Abstract: Reasoning language models (RLMs) achieve strong performance on complex reasoning tasks, but still exhibit substantial multilingual reasoning gaps, largely due to language-understanding failures in non-English inputs. English translation can mitigate these failures by expressing non-English inputs in a form that RLMs can more reliably interpret, yet translating every input is unnecessary when the model can reason reliably from the original query. To address this challenge, we propose Luar, a Language Understanding Boundary-aware Reinforcement Learning framework that trains RLMs to selectively invoke translation when direct understanding is unreliable. Luar trains the model to choose between solving the original input directly and reasoning over its English translation, encouraging translation only when translator-augmented reasoning is expected to substantially outperform direct reasoning.

相关事件查看全部 (1)

Learning When to Translate for Multilingual Reasoning
2026-06-02PRODUCT_LAUNCH影响: MEDIUM

相关公司

暂无数据

相关人物

暂无数据