EuroBERT: Scaling Multilingual Encoders for European Languages 事件

PRODUCT_LAUNCH2026-06-02影响: MEDIUM

EuroBERT: Scaling Multilingual Encoders for European Languages arXiv:2503.05500v3 Announce Type: replace Abstract: General-purpose multilingual vector representations, used in retrieval, regression and classification, are traditionally obtained from bidirectional encoder models. Despite their wide applicability, encoders have been recently overshadowed by advances in generative decoder-only models. However, many innovations driving this progress are not inherently tied to decoders. In this pape