HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization 论文

2019引用 360

Topic ModelingNatural Language Processing TechniquesAdvanced Text Analysis Techniques

Natural Language Processing Techniques Topic Modeling Advanced Text Analysis Techniques

作者

摘要

Neural extractive summarization models usually employ a hierarchical encoder for document encoding and they are trained using sentence-level labels, which are created heuristically using rule-based methods. Training the hierarchical encoder with these inaccurate labels is challenging. Inspired by the recent work on pre-training transformer sentence encoders We apply the pre-trained HIBERT to our summarization model and it outperforms its randomly initialized counterpart by 1.25 ROUGE on the CNN/Dailymail dataset and by 2.0 ROUGE on a version of New York Times dataset. We also achieve the state-of-the-art performance on these two datasets.

作者查看全部 (3)

Xingxing Zhang

Ming Zhou

Furu Wei

HIBERT: Document Level Pre-training of Hierarchical Bidirectional Transformers for Document Summarization 论文

摘要

作者查看全部 (3)

相关技术查看全部 (2)

相关事件

相关文章