The Latin Substrate: How Language Models Represent and Mediate Script Choice 事件
PRODUCT_LAUNCH2026-06-01影响: MEDIUM
The Latin Substrate: How Language Models Represent and Mediate Script Choice arXiv:2605.31363v1 Announce Type: new Abstract: Many languages are written in multiple scripts, requiring large language models (LLMs) to generate equivalent linguistic content in distinct orthographic forms. While prior work suggests that LLMs route information through shared latent representations, how they internally mediate script variation remains poorly understood. We study this question by first examining per-