InfoGather 论文

2012引用 227

Data Quality and ManagementWeb Data Mining and AnalysisSemantic Web and Ontologies

Semantic Web and Ontologies Data Quality and Management Web Data Mining and Analysis

作者

摘要

The Web contains a vast corpus of HTML tables, specifically entity attribute tables. We present three core operations, namely entity augmentation by attribute name, entity augmentation by example and attribute discovery, that are useful for "information gathering" tasks (e.g., researching for products or stocks). We propose to use web table corpus to perform them automatically. We require the operations to have high precision and coverage, have fast (ideally interactive) response times and be applicable to any arbitrary domain of entities. The naive approach that attempts to directly match the user input with the web tables suffers from poor precision and coverage.

作者查看全部 (4)

Surajit Chaudhuri

Kaushik Chakrabarti

Kris Ganjam

Mohamed Yakout

InfoGather 论文

摘要

作者查看全部 (4)

相关技术

相关事件

相关文章