English-to-Prakrit Machine Translation via Multilingual Transfer Learning 文章

ArXiv CS.CL2026-06-05NEWSen作者: Om Choksi, Smit Kareliya, Shrikant Malviya, Pruthwik Mishra

摘要

arXiv:2606.06038v1 Announce Type: new Abstract: We study English-to-Prakrit machine translation in a low-resource setting where the target language is unsupported by IndicTrans2. We adapt the multilingual model by mapping Prakrit to the Hindi language tag (hin_Deva) without modifying the tokenizer, vocabulary, or architecture. Using a 1,474-pair Maharashtri Prakrit parallel corpus and evaluation on a 20-sample Ardhamagadhi test set, we report corpus BLEU improvements over an untuned baseline. The results indicate that script-compatible language routing can enable feasible transfer to unsupported classical languages, while highlighting limitations due to data scarcity and dialect mismatch. Our code and trained models are released to the public for further exploration https://github.com/D3v1s0m/indictrans2-prakrit-mt.

相关公司

暂无数据

相关人物

暂无数据

相关技术

暂无数据