Facebook FAIR’s WMT19 News Translation Task Submission 论文

2019引用 318
Topic ModelingNatural Language Processing TechniquesData Quality and Management

摘要

This paper describes Facebook FAIR's submission to the WMT19 shared news translation task. We participate in four language directions, English German and English Russian in both directions. Following our submission from last year, our baseline systems are large BPE-based transformer models trained with the FAIRSEQ sequence modeling toolkit. This year we experiment with different bitext data filtering schemes, as well as with adding filtered back-translated data. We also ensemble and fine-tune our models on domain-specific data, then decode using noisy channel model reranking. Our system improves on our previous system's performance by 4.5 BLEU points and achieves the best casesensitive BLEU score for the translation direction EnglishRussian.