Bilingual Word Embeddings for Phrase-Based Machine Translation 论文
2013引用 542
Natural Language Processing TechniquesTopic ModelingText Readability and Simplification
摘要
We introduce bilingual word embeddings: se-mantic embeddings associated across two lan-guages in the context of neural language mod-els. We propose a method to learn bilingual embeddings from a large unlabeled corpus, while utilizing MT word alignments to con-strain translational equivalence. The new em-beddings significantly out-perform baselines in word semantic similarity. A single semantic similarity feature induced with bilingual em-beddings adds near half a BLEU point to the results of NIST08 Chinese-English machine translation task. 1