ProphetNet: Predicting Future N-gram for Sequence-to-SequencePre-training 论文

2020引用 340
Topic ModelingNatural Language Processing TechniquesText and Document Classification Technologies

摘要

This paper presents a new sequence-tosequence pre-training model called Prophet-Net, which introduces a novel self-supervised objective named future n-gram prediction and the proposed n-stream self-attention mechanism.