Adapting Multilingual Embedding Models to Turkish via Cross-Lingual Tokenizer Surgery and Offline Distillation 文章

ArXiv CS.CL2026-05-29NEWSen作者: M. Ali Bayram, Banu Diri, Sava\c{s} Y{\i}ld{\i}r{\i}m

Adapting Multilingual Embedding Models to Turkish via Cross-Lingual Tokenizer Surgery and Offline Distillation · 相关技术