Truth finding on the deep web 论文

2012Proceedings of the VLDB Endowment引用 240
Mobile Crowdsensing and CrowdsourcingData Quality and ManagementPrivacy-Preserving Technologies in Data

摘要

The amount of useful information available on the Web has been growing at a dramatic pace in recent years and people rely more and more on the Web to fulfill their information needs. In this paper, we study truthfulness of Deep Web data in two domains where we believed data are fairly clean and data quality is important to people's lives: Stock and Flight. To our surprise, we observed a large amount of inconsistency on data from different sources and also some sources with quite low accuracy. We further applied on these two data sets state-of-the-art data fusion methods that aim at resolving conflicts and finding the truth, analyzed their strengths and limitations, and suggested promising research directions. We wish our study can increase awareness of the seriousness of conflicting data on the Web and in turn inspire more research in our community to tackle this problem.

相关技术

暂无数据

相关事件

暂无数据

相关文章

暂无数据