CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations 事件

PRODUCT_LAUNCH2026-05-27影响: MEDIUM

CroCo: Cross-Lingual Contrastive Preference Tuning on Self-Generations arXiv:2605.26293v1 Announce Type: new Abstract: Prior work establishes that controlled contrastiveness between self-generated responses from large language models, set via reward scores, improves downstream preference tuning in English. We extend this method to multiple languages and evaluate two models across a total of 14 high and low-resource languages on a diverse set of tasks. Our central finding is that cross-lingual c