Word Representations: A Simple and General Method for Semi-Supervised Learning 论文

2010引用 1946

Topic ModelingNatural Language Processing TechniquesAdvanced Text Analysis Techniques

Natural Language Processing Techniques Topic Modeling Advanced Text Analysis Techniques

作者

摘要

If we take an existing supervised NLP system, a simple and general way to improve accuracy is to use unsupervised word representations as extra word features. We evaluate Brown clusters, Collobert and Weston (2008) embeddings, and HLBL (Mnih &amp; Hinton, 2009) embeddings of words on both NER and chunking. We use near state-of-the-art supervised baselines, and find that each of the three word representations improves the accuracy of these baselines. We find further improvements by combining different word representations. You can download our word features, for off-the-shelf use in existing NLP systems, as well as our code, here:

作者查看全部 (2)

Lev-Arie Ratinov

Joseph Turian

Word Representations: A Simple and General Method for Semi-Supervised Learning 论文

摘要

作者查看全部 (2)

相关技术查看全部 (2)

相关事件

相关文章