Automatic acquisition of domain knowledge for Information Extraction 论文

2000引用 227

Natural Language Processing TechniquesTopic ModelingSemantic Web and Ontologies

Natural Language Processing Techniques Topic Modeling Semantic Web and Ontologies

作者

摘要

In developing an Information Extraction (IE) system for a new class of events or relations, one of the major tasks is identifying the many ways in which these events or relations may be expressed in text. This has generally involved the manual analysis and, in some cases, the annotation of large quantities of text involving these events. This paper presents an alternative approach, based on an automatic discovery procedure, EXDISCO, which identifies a set of relevant documents and a set of event patterns from un-annotaled text, starting from a small set of "seed patterns." We evaluate EXDISCO by comparing the performance of discovered patterns against that of manually constructed systems on actual extraction tasks.

作者查看全部 (4)

Silja Huttunen

Pasi Tapanainen

Ralph Grishman

Roman Yangarber

Automatic acquisition of domain knowledge for Information Extraction 论文

摘要

作者查看全部 (4)

相关技术查看全部 (2)

相关事件

相关文章