Advances in Pre-Training Distributed Word Representations 论文

2018引用 331

Topic ModelingNatural Language Processing TechniquesAdvanced Text Analysis Techniques

Natural Language Processing Techniques Topic Modeling Advanced Text Analysis Techniques

作者

摘要

Many Natural Language Processing applications nowadays rely on pre-trained word representations estimated from large text corpora such as news collections, Wikipedia and Web Crawl. In this paper, we show how to train high-quality word vector representations by using a combination of known tricks that are however rarely used together. The main result of our work is the new set of publicly available pre-trained models that outperform the current state of the art by a large margin on a number of tasks.

作者查看全部 (5)

Armand Joulin

Christian Puhrsch

Piotr Bojanowski

Édouard Grave

Advances in Pre-Training Distributed Word Representations 论文

详细信息

摘要

作者查看全部 (5)

相关技术查看全部 (2)

相关事件

相关文章