Shared Doubt: Zero-shot Cross-Lingual Confidence Estimation for Language Models 文章

ArXiv CS.CL2026-06-01NEWSen作者: Athina Kyriakou, Dennis Ulmer, Ivan Titov

摘要

arXiv:2605.31220v1 Announce Type: new Abstract: Confidence estimation (CE), i.e. quantifying the reliability of a model's prediction, has attracted great interest in the context of large language models (LLMs). However, most studies focus on English, ignoring the multilingual reality of LLM usage, while many CE methods degrade or require retraining across languages. To address this gap, we investigate whether multilingual LLMs encode shared, language-transferable confidence features. We use a lightweight linear probe that predicts answer correctness directly from intermediate representations. Trained monolingually, the probe generalizes zero-shot to unseen, typologically diverse languages without target-language supervision. Learned layer weights and multiple ablations reveal that confidence features concentrate in middle layers across languages, suggesting a shared confidence subspace.

Shared Doubt: Zero-shot Cross-Lingual Confidence Estimation for Language Models 文章

摘要

相关事件查看全部 (1)

相关公司

相关人物

相关产品

相关技术查看全部 (3)