Characterizing Linear Alignment Across Language Models 文章

ArXiv CS.AI2026-05-26NEWSen作者: Matt Gorbett, Suman Jana

详细信息

来源站点: ArXiv CS.AI
作者: Matt Gorbett, Suman Jana
文章类型: NEWS
语言: en
发布日期: 2026-05-26

摘要

arXiv:2603.18908v4 Announce Type: replace Abstract: Language models increasingly appear to learn similar representations, despite differences in training objectives, architectures, and data modalities. This emerging compatibility between independently trained models introduces new opportunities for cross-model alignment to downstream objectives. Moreover, this capability unlocks new potential application domains, such as settings where security, privacy, or competitive constraints prohibit direct data or model sharing. In this work, we investigate the extent to which representational convergence enables practical linear alignment between large language models. Specifically, we learn affine transformations between the final hidden states of independent models and empirically evaluate these mappings across text generation, embedding classification, and out-of-distribution detection.

Characterizing Linear Alignment Across Language Models 文章

详细信息

摘要

相关事件

相关公司查看全部 (3)

相关人物

相关产品查看全部 (10)

相关技术查看全部 (20)